Novel Proteins and Nucleic Acids Encoding Same 




RELATED APPLICATIONS 

5 

This is a request for filing a new nonprovisional application under 37 C.F.R. §1 .53(b). This 
application claims priority to U.S.S.N. 60/274,322 filed on March 8, 2001 (Cura 590); 
U.S.S.N. 60/283,675 filed on April 13, 2001 (Cura 590D1); U.S.S.N. 60/338,092 filed on 
December 3, 2001 (Cura 590D2); U.S.S.N. 60/274,281 filed on March 8, 2001 (Cura 591); 

10 U.S.S.N. 60/274,101 filed on March 8, 2001 (Cura 592); U.S.S.N. 60/325,681 filed on 
September 27, 2001 (Cura 592J1); U.S.S.N. 60/304,354 filed on July 10, 2001 (Cura 
59211); U.S.S.N. 60/279,995 filed on March 30, 2001 (Cura 592H1); U.S.S.N. 60/294,899 
filed on May 31, 2001 (Cura 592E1); U.S.S.N. 60/287,424 filed on April 30, 2001 (Cura 
592D1); U.S.S.N. 60/299,027 filed on June 18, 2001 (Cura 592D2); U.S.S.N. 60/309,198 

15 filed on July 31, 2001 (Cura 592C1); U.S.S.N. 60/281,194 filed on April 4, 2001 (Cura 
592A1); U.S.S.N. 60/274,194 filed on March 8, 2001 (Cura 593); U.S.S.N. 60/274,849 
filed on March 9, 2001 (Cura 594); U.S.S.N. 60/330,380 filed on October 18, 2001 (Cura 
594C1); U.S.S.N. 60/275,235 filed on March 12, 2001 (Cura 595); U.S.S.N. 60/288,342 
filed on May 3, 2001 (Cura 595J1); U.S.S.N. 60/275,578 filed on March 13, 2001 (Cura 

20 596); U.S.S.N. 60/291,240 filed on May 16, 2001 (Cura 59611); U.S.S.N. 60/294,485 filed 
on May 30, 2001 (Cura 596B1); U.S.S.N. 60/299,310 filed on June 19, 2001 (Cura 
596A1); U.S.S.N. 60/275,579 filed on March 13, 2001 (Cura 597); U.S.S.N. 60/275,601 
filed on March 13, 2001 (Cura 598); U.S.S.N. 60/276,000 filed on March 14, 2001 (Cura 
599); U.S.S.N. 60/280,900 filed on April 2, 2001 (Cura 599E1); U.S.S.N. 60/276,776 filed 

25 on March 1 6, 2001 (Cura 600); U.S.S.N. 60/294,889 filed on May 3 1 , 200 1 (Cura 600G1); 
U.S.S.N. 60/318,770 filed on September 12, 2001 (Cura 600E1); U.S.S.N. 60/276,994 filed 
on March 19, 2001 (Cura 604); U.S.S.N. 60/277,338 filed on March 20, 2001 (Cura 607); 
U.S.S.N. 60/325,430 filed on September 27, 2001 (Cura 607J1); U.S.S.N. 60/332,094 filed 
on November 21, 2001 (Cura 607C1); U.S.S.N. 60/299,303 filed on June 19, 2001 (Cura 

30 607B1); U.S.S.N. 60/288,066 filed on May 2, 2001 (Cura 607A1); U.S.S.N. 60/277,321 
filed on March 20, 2001 (Cura 608); U.S.S.N. 60/280,822 filed on April 2, 2001 (Cura 
608A); U.S.S.N. 60/277,239 filed on March 20, 2001 (Cura 609); U.S.S.N. 60/277,327 
filed on March 20, 2001 (Cura 610); U.S.S.N. 60/277,791 filed on March 21, 2001 (Cura 
61 1); U.S.S.N. 60/333,1 84 filed on November 14, 2001 (Cura 611H1); U.S.S.N. 

35 60/277,833 filed on March 22, 2001 (Cura 612); U.S.S.N. 60/318,462 filed on September 

10, 2001 (Cura 612J1); U.S.S.N. 60/288,528 filed on May 3, 2001 (Cura 612A1); U.S.S.N. 
1 



60/278,152 filed on March 23, 2001 (Cura 613); U.S.S.N. 60/332,272 filed on November 
14, 2001 (Cura 613D1); U.S.S.N. 60/278,894 filed on March 26, 2001 (Cura 614); 
U.S.S.N. 60/312,903 filed on August 16, 2001 (Cura 614C1); U.S.S.N. 60/333,272 filed on 
November 14, 2001 (Cura 614C2); U.S.S.N. 60/279,036 filed on March 27, 2001 (Cura 

5 615); U.S.S.N. 60/332,172 filed on November 14, 2001 (Cura 61511); U.S.S.N. 60/337,426 
filed on December 3, 2001 (Cura 61512); U.S.S.N. 60/278,999 filed on March 27, 2001 
(Cura 616); U.S.S.N. 60/279,344 filed on March 28, 2001 (Cura 617); U.S.S.N. 60/332,271 
filed on November 14, 2001 (Cura 617J1); U.S.S.N. 60/291,099 filed on May 16, 2001 
(Cura 617H1); U.S.S.N. 60/291,190 filed on May 15, 2001 (Cura 617E1); U.S.S.N. 

1 0 60/280,233 filed on March 30, 200 1 (Cura 61 8); U.S.S.N. 60/280,802 filed on April 2, 
2001 (Cura 621); U.S.S.N. 60/335,301 filed on October 31, 2001 (Cura 621F1); U.S.S.N. 
60/337,185 filed on December 4, 2001 (Cura 621D1); and U.S.S.N. 60/345,705 filed on 
January 3, 2002 (Cura 621B1). 

15 

Field of the Invention 

The present invention relates to novel polypeptides that are targets of small 
20 molecule drugs and that have properties related to stimulation of biochemical or 

physiological responses in a cell, a tissue, an organ or an organism. More particularly, the 
novel polypeptides are gene products of novel genes, or are specified biologically active 
fragments or derivatives thereof. Methods of use encompass diagnostic and prognostic 
assay procedures as well as methods of treating diverse pathological conditions. The 
25 present invention discloses novel associations of proteins and polypeptides and the nucleic 
acids that encode them with various diseases or pathologies. The proteins and related 
proteins that are similar to them, are encoded by a cDNA and/or by genomic DNA. The 
proteins, polypeptides and their cognate nucleic acids were identified by Curagen 
Corporation in certain cases. The XYZase-encoded protein and any variants, thereof, are 
30 suitable as diagnostic markers, targets for an antibody therapeutic and targets for small 
molecule drugs. As such the current invention embodies the use of recombinantly 
expressed and/or endogenously expressed protein in various screens to identify such 
therapeutic antibodies and/or therapeutic small molecules. 
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Background 



Eukaryotic cells are characterized by biochemical and physiological processes 
which under normal conditions are exquisitely balanced to achieve the preservation and 
propagation of the cells. When such cells are components of multicellular organisms such 
as vertebrates, or more particularly organisms such as mammals, the regulation of the 
biochemical and physiological processes involves intricate signaling pathways. 
Frequently, such signaling pathways are constituted of extracellular signaling proteins, 
cellular receptors that bind the signaling proteins and signal transducing components 
located within the cells. 

Signaling proteins may be classified as endocrine effectors, paracrine effectors or 
autocrine effectors. Endocrine effectors are signaling molecules secreted by a given organ 
into the circulatory system, which are then transported to a distant target organ or tissue. 
The target cells include the receptors for the endocrine effector, and when the endocrine 
effector binds, a signaling cascade is induced. Paracrine effectors involve secreting cells 
and receptor cells in close proximity to each other, for example two different classes of 
cells in the same tissue or organ. One class of cells secretes the paracrine effector, which 
then reaches the second class of cells, for example by diffusion through the extracellular 
fluid. The second class of cells contains the receptors for the paracrine effector; binding of 
the effector results in induction of the signaling cascade that elicits the corresponding 
biochemical or physiological effect. Autocrine effectors are highly analogous to paracrine 
effectors, except that the same cell type that secretes the autocrine effector also contains the 
receptor. Thus the autocrine effector binds to receptors on the same cell, or on identical 
neighboring cells. The binding process then elicits the characteristic biochemical or 
physiological effect. 

Signaling processes may elicit a variety of effects on cells and tissues including by 
way of nonlimiting example induction of cell or tissue proliferation, suppression of growth 
or proliferation, induction of differentiation or maturation of a cell or tissue, and 
suppression of differentiation or maturation of a cell or tissue. 

Many pathological conditions involve dysregulation of expression of important 
effector proteins. In certain classes of pathologies the dysregulation is manifested as 
diminished or suppressed level of synthesis and secretion protein effectors. In a clinical 
setting a subject may be suspected of suffering from a condition brought on by diminished 
or suppressed levels of a protein effector of interest. Therefore there is a need to be able to 
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assay for the level of the protein effector of interest in a biological sample from such a 
subject, and to compare the level with that characteristic of a nonpathological condition. 
There further is a need to provide the protein effector as a product of manufacture. 
Administration of the effector to a subject in need thereof is useful in treatment of the 
pathological condition, or the protein effector deficiency or suppression may be favorably 
acted upon by the administration of another small molecule drug product. Accordingly, 
there is a need for a method of treatment of a pathological condition brought on by a 
diminished or suppressed levels of the protein effector of interest. 

Small molecule targets have been implicated in various disease states or 
pathologies. These targets may be proteins, and particularly enzymatic proteins, which are 
acted upon by small molecule drugs for the purpose of altering target function and 
achieving a desired result. Cellular, animal and clinical studies can be performed to 
elucidate the genetic contribution to the etiology and pathogenesis of conditions in which 
small molecule targets are implicated in a variety of physiologic, pharmacologic or native 
states. These studies utilize the core technologies at CuraGen Corporation to look at 
differential gene expression, protein-protein interactions, large-scale sequencing of 
expressed genes and the association of genetic variations such as, but not limited to, single 
nucleotide polymorphisms (SNPs) or splice variants in and between biological samples 
from experimental and control groups. The goal of such studies is to identify potential 
avenues for therapeutic intervention in order to prevent, treat the consequences or cure the 
conditions. 

In order to treat diseases, pathologies and other abnormal states or conditions in 
which a mammalian organism has been diagnosed as being, or as being at risk for 
becoming, other than in a normal state or condition, it is important to identify new 
therapeutic agents. Such a procedure includes at least the steps of identifying a target 
component within an affected tissue or organ, and identifying a candidate therapeutic agent 
that modulates the functional attributes of the target. The target component may be any 
biological macromolecule implicated in the disease or pathology. Commonly the target is a 
polypeptide or protein with specific functional attributes. Other classes of macromolecule 
may be a nucleic acid, a polysaccharide, a lipid such as a complex lipid or a glycolipid; in 
addition a target may be a sub-cellular structure or extra-cellular structure that is comprised 
of more than one of these classes of macromolecule. Once such a target has been 
identified, it may be employed in a screening assay in order to identify favorable candidate 
therapeutic agents from among a large population of substances or compounds. 



In many cases the objective of such screening assays is to identify small molecule 
candidates; this is commonly approached by the use of combinatorial methodologies to 
develop the population of substances to be tested. The implementation of high throughput 
screening methodologies is advantageous when working with large, combinatorial libraries 
of compounds. 

It is an objective of this invention to provide at least one target biopolymer that is 
intended to serve as the macromolecular component in a screening assay for identifying 
candidate pharmaceutical agents. 

It is another objective of the present invention to provide screening assays that 
positively identify candidate pharmaceutical agents from among a combinatorial library of 
low molecular weight substances or compounds. 

It is still a further objective of this invention to employ the candidate 
pharmaceutical agents in any of a variety of in vitro, ex vivo and in vivo assays in order to 
identify pharmaceutical agents with advantageous therapeutic applications in the treatment 
of a disease, pathology, or abnormal state or condition in a mammal. 

Summary Of The Invention 

The invention is based in part upon the discovery of nucleic acid sequences 
encoding novel polypeptides. These nucleic acids and polypeptides, as well as derivatives, 
homologs, analogs and fragments thereof, will hereinafter be collectively designated as 
"NOVX" nucleic acid, which represents the nucleotide sequence selected from the group 
consisting of SEQ ID NO: 2n-l, wherein n is an integer between 1 and 1 78, or polypeptide 
sequences, which represents the group consisting of SEQ ID NO: 2n, wherein n is an 
integer between 1 and 178. 

In one aspect, the invention provides an isolated polypeptide comprising a mature 
form of a NOVX amino acid. One example is a variant of a mature form of a NOVX 
amino acid sequence, wherein any amino acid in the mature form is changed to a different 
amino acid, provided that no more than 15% of the amino acid residues in the sequence of 
the mature form are so changed. The amino acid can be, for example, a NOVX amino 
acid sequence or a variant of a NOVX amino acid sequence, wherein any amino acid 
specified in the chosen sequence is changed to a different amino acid, provided that no 
more than 15% of the amino acid residues in the sequence are so changed. The invention 
also includes fragments of any of these. In another aspect, the invention also includes an 
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isolated nucleic acid that encodes a NOVX polypeptide, or a fragment, homolog, analog or 
derivative thereof. 

Also included in the invention is a NOVX polypeptide that is a naturally occurring 
allelic variant of a NOVX sequence. In one embodiment, the allelic variant includes an 
amino acid sequence that is the translation of a nucleic acid sequence differing by a single 
nucleotide from a NOVX nucleic acid sequence. In another embodiment, the NOVX 
polypeptide is a variant polypeptide described therein, wherein any amino acid specified in 
the chosen sequence is changed to provide a conservative substitution. In one embodiment, 
the invention discloses a method for determining the presence or amount of the NOVX 
polypeptide in a sample. The method involves the steps of: providing a sample; 
introducing the sample to an antibody that binds immunospecifically to the polypeptide; 
and determining the presence or amount of antibody bound to the NOVX polypeptide, 
thereby determining the presence or amount of the NOVX polypeptide in the sample. In 
another embodiment, the invention provides a method for determining the presence of or 
predisposition to a disease associated with altered levels of a NOVX polypeptide in a 
mammalian subject. This method involves the steps of: measuring the level of expression 
of the polypeptide in a sample from the first mammalian subject; and comparing the 
amount of the polypeptide in the sample of the first step to the amount of the polypeptide 
present in a control sample from a second mammalian subject known not to have, or not to 
be predisposed to, the disease, wherein an alteration in the expression level of the 
polypeptide in the first subject as compared to the control sample indicates the presence of 
or predisposition to the disease. 

In a further embodiment, the invention includes a method of identifying an agent 
that binds to a NOVX polypeptide. This method involves the steps of: introducing the 
polypeptide to the agent; and determining whether the agent binds to the polypeptide. In 
various embodiments, the agent is a cellular receptor or a downstream effector. 

In another aspect, the invention provides a method for identifying a potential 
therapeutic agent for use in treatment of a pathology, wherein the pathology is related to 
aberrant expression or aberrant physiological interactions of a NOVX polypeptide. The 
method involves the steps of: providing a cell expressing the NOVX polypeptide and 
having a property or function ascribable to the polypeptide; contacting the cell with a 
composition comprising a candidate substance; and determining whether the substance 
alters the property or function ascribable to the polypeptide; whereby, if an alteration 
observed in the presence of the substance is not observed when the cell is contacted with a 



composition devoid of the substance, the substance is identified as a potential therapeutic 
agent. In another aspect, the invention describes a method for screening for a modulator of 
activity or of latency or predisposition to a pathology associated with the NOVX 
polypeptide. This method involves the following steps: administering a test compound to a 
5 test animal at increased risk for a pathology associated with the NOVX polypeptide, 
wherein the test animal recombinantly expresses the NOVX polypeptide. This method 
involves the steps of measuring the activity of the NOVX polypeptide in the test animal 
after administering the compound of step; and comparing the activity of the protein in the 
test animal with the activity of the NOVX polypeptide in a control animal not administered 
rj 1 0 the polypeptide, wherein a change in the activity of the NOVX polypeptide in the test 

animal relative to the control animal indicates the test compound is a modulator of latency 
of, or predisposition to, a pathology associated with the NOVX polypeptide. In one 
embodiment, the test animal is a recombinant test animal that expresses a test protein 
transgene or expresses the transgene under the control of a promoter at an increased level 
15 relative to a wild-type test animal, and wherein the promoter is not the native gene 
promoter of the transgene. In another aspect, the invention includes a method for 
modulating the activity of the NOVX polypeptide, the method comprising introducing a 
cell sample expressing the NOVX polypeptide with a compound that binds to the 
polypeptide in an amount sufficient to modulate the activity of the polypeptide. 
20 The invention also includes an isolated nucleic acid that encodes a NOVX polypeptide, 
or a fragment, homolog, analog or derivative thereof. In a preferred embodiment, the 
nucleic acid molecule comprises the nucleotide sequence of a naturally occurring allelic 
nucleic acid variant. In another embodiment, the nucleic acid encodes a variant 
polypeptide, wherein the variant polypeptide has the polypeptide sequence of a naturally 
25 occurring polypeptide variant. In another embodiment, the nucleic acid molecule differs by 
a single nucleotide from a NOVX nucleic acid sequence. In one embodiment, the NOVX 
nucleic acid molecule hybridizes under stringent conditions to the nucleotide sequence 
selected from the group consisting of SEQ ID NO: 2n-l, wherein n is an integer between 1 
and 178, or a complement of the nucleotide sequence. In another aspect, the invention 
30 provides a vector or a cell expressing a NOVX nucleotide sequence. 

In one embodiment, the invention discloses a method for modulating the activity of 
a NOVX polypeptide. The method includes the steps of: introducing a cell sample 
expressing the NOVX polypeptide with a compound that binds to the polypeptide in an 
amount sufficient to modulate the activity of the polypeptide. In another embodiment, the 
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invention includes an isolated NOVX nucleic acid molecule comprising a nucleic acid 
sequence encoding a polypeptide comprising a NOVX amino acid sequence or a variant of 
a mature form of the NOVX amino acid sequence, wherein any amino acid in the mature 
form of the chosen sequence is changed to a different amino acid, provided that no more 
than 1 5% of the amino acid residues in the sequence of the mature form are so changed. In 
another embodiment, the invention includes an amino acid sequence that is a variant of the 
NOVX amino acid sequence, in which any amino acid specified in the chosen sequence is 
changed to a different amino acid, provided that no more than 15% of the amino acid 
residues in the sequence are so changed. 

In one embodiment, the invention discloses a NOVX nucleic acid fragment 
encoding at least a portion of a NOVX polypeptide or any variant of the polypeptide, 
wherein any amino acid of the chosen sequence is changed to a different amino acid, 
provided that no more than 10% of the amino acid residues in the sequence are so changed. 
In another embodiment, the invention includes the complement of any of the NOVX 
nucleic acid molecules or a naturally occurring allelic nucleic acid variant. In another 
embodiment, the invention discloses a NOVX nucleic acid molecule that encodes a variant 
polypeptide, wherein the variant polypeptide has the polypeptide sequence of a naturally 
occurring polypeptide variant. In another embodiment, the invention discloses a NOVX 
nucleic acid, wherein the nucleic acid molecule differs by a single nucleotide from a 
NOVX nucleic acid sequence. 

In another aspect, the invention includes a NOVX nucleic acid, wherein one or 
more nucleotides in the NOVX nucleotide sequence is changed to a different nucleotide 
provided that no more than 15% of the nucleotides are so changed. In one embodiment, the 
invention discloses a nucleic acid fragment of the NOVX nucleotide sequence and a 
nucleic acid fragment wherein one or more nucleotides in the NOVX nucleotide sequence 
is changed from that selected from the group consisting of the chosen sequence to a 
different nucleotide provided that no more than 1 5% of the nucleotides are so changed. In 
another embodiment, the invention includes a nucleic acid molecule wherein the nucleic 
acid molecule hybridizes under stringent conditions to a NOVX nucleotide sequence or a 
complement of the NOVX nucleotide sequence. In one embodiment, the invention 
includes a nucleic acid molecule, wherein the sequence is changed such that no more than 
15% of the nucleotides in the coding sequence differ from the NOVX nucleotide sequence 
or a fragment thereof. 



In a further aspect, the invention includes a method for determining the presence or 
amount of the NOVX nucleic acid in a sample. The method involves the steps of: 
providing the sample; introducing the sample to a probe that binds to the nucleic acid 
molecule; and determining the presence or amount of the probe bound to the NOVX 
5 nucleic acid molecule, thereby determining the presence or amount of the NOVX nucleic 
acid molecule in the sample. In one embodiment, the presence or amount of the nucleic 
acid molecule is used as a marker for cell or tissue type. 

In another aspect, the invention discloses a method for determining the presence of or 
predisposition to a disease associated with altered levels of the NOVX nucleic acid 

M 1 0 molecule of in a first mammalian subject. The method involves the steps of: measuring the 

O 

O amount of NOVX nucleic acid in a sample from the first mammalian subject; and 

comparing the amount of the nucleic acid in the sample of step (a) to the amount of NOVX 

in nucleic acid present in a control sample from a second mammalian subject known not to 

have or not be predisposed to, the disease; wherein an alteration in the level of the nucleic 

ss 15 acid in the first subject as compared to the control sample indicates the presence of or 

pi 

5 — predisposition to the disease. 

p Unless otherwise defined, all technical and scientific terms used herein have the 

Jf same meaning as commonly understood by one of ordinary skill in the art to which this 

RJ invention belongs. Although methods and materials similar or equivalent to those 

20 described herein can be used in the practice or testing of the present invent ion, suitable 
methods and materials are described below. All publications, patent applications, patents, 
and other references mentioned herein are incorporated by reference in their entirety. In 
the case of conflict, the present specification, including definitions, will control. In 
addition, the materials, methods, and examples are illustrative only and not intended to be 
25 limiting. 

Other features and advantages of the invention will be apparent from the following 
detailed description and claims. 

30 Detailed Description Of The Invention 

The present invention provides novel nucleotides and polypeptides encoded 
thereby. Included in the invention are the novel nucleic acid sequences, their encoded 
35 polypeptides, antibodies, and other related compounds. The sequences are collectively 
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referred to herein as "NOVX nucleic acids" or ,r NOVX polynucleotides" and the 
corresponding encoded polypeptides are referred to as "NOVX polypeptides" or "NOVX 
proteins." Unless indicated otherwise, "NOVX" is meant to refer to any of the novel 
sequences disclosed herein. Table 1 provides a summary of the NOVX nucleic acids and 
their encoded polypeptides. 



TABLE 1. Sequences and Corresponding SEQ ID Numbers 



NOVX 


Interna] Acc. No. 




Amino 


Ho niolo gy 


No. 




Acid 6 ' 0 








SEQ ID 


SEQ II) 








NO. 


NO. 




la 


vUJ OJZ.Z.\t 1 


1 


2 


-r j— — j— 

niiniQTl pi 316161 










activating factor 












2a 


CG58520-01 


— 


— 


GAB A. (A.) receptor 


2b 


CG5 8520-02 


5 






2c 


CG58520-03 





g 


GABA(A) receptor 


3a 


posxsi 8 m 

V A J J> OJ I O-U I 






vj/\d/\\s\ ) rccepLor 




LUjoj 1 O-U I 


— 


~ 


Beta transducin 


~T 


OH / J>-U 1 


— 


— 


Protein kinase 


— 

a 


LVJJO t /KJ-KJ 1 


— 


— 












acetylnexosamine 










pyroptosphorylase 


7a 


CG58593-01 


17 


18 


" 1C ^ U1 " 1 e — jTj 


8a 


v^VJ J fori -\J 1 






tousled lilce Kinase lilce 


9a 


CG5 85 90-01 


21 





gusnylate kinase lilce 


9b 


CG58590-02 


23 


24 


gucinylate kinase like 


10a 


CG58572-01 


25 


26 


glucosamine phosphate 










N acetyltransferase like 


10b 


CG5 8572-02 


27 


28 


glucosamine phosphate 










N acetyltransferase like 


11a 


CG58564-01 


29 


30 


Protein tyrosine 










phosphatase like 


lib 


CG58564-02 


31 


32 


Protein tyrosine 










phosphatase like 


11c 


CG58564-03 


33 


34 


Dual-Specificity 










phosphatase like 


lid 


CG5 8564-04 


35 


36 


Dual-Specificity 










phosphatase like 


12a 


CG578 19-01 


37 


38 


RPGR interacting 










protein 1 like 


13a 


CG57789-01 


39 


40 


RAS like protein 










RRP22 like 


13b 


CG57789-02 


41 


42 


RAS like protein 










RRP22 like 


14a 


CG57758-01 


43 


44 


sodium/lithium 










dependent 










dicarboxylate 










transporter like 


14b 


CG57758-02 


45 


46 


sodium/lithium 
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dependent 
dicarboxylate 
transporter like 


14c 


CG57758-03 


47 


48 


sodium/lithium 
dependent 
dicarboxylate 
transporter like 


14d 


CG57758-04 


49 


50 


sodium/lithium 
dependent 
dicarboxylate 
transporter like 


14e 


CG57758-05 


51 


52 


sodium/lithium 
dependent 
dicarboxylate 
transporter like 


15a 


CG57732-01 


53 


54 


Ca 2+ calmodulin 
dependent protein 
kinase IV kinase like 


15b 


CG57732-02 


55 


56 


Ca 2+ calmodulin 
dependent protein 
kinase IV kinase like 


15c 


CG57732-03 


57 


58 


Ca 2+ calmodulin 
dependent protein 
kinase IV kinase like 


16a 


CG57709-01 


59 


60 


TCE2 like 


17a 


CG57700-01 


61 


62 


hydoxyacylglutathione 
hydrolase like 


17b 


CG57700-02 


63 


64 


hydoxyacylglutathione 
hydrolase like 


17c 


CG57700-03 


65 


66 


hydoxyacylglutathione 
hydrolase like 


17d 


CG57700-04 


67 


68 


hydoxyacylglutathione 
hydrolase like 


18a 


CG58553-01 


69 


70 


vasopressin receptor 
like 


19a 


CG58626-01 


71 


72 


phosphatidic acid 
preferring 

phospholipase Al like 


20a 


CG57597-01 


73 


74 


hypothetical protein 
like 


21a 


CG57804-01 


75 


76 


Talin like 


22a 


CG57551-01 


77 


78 


NAC-1 like 


23a 


CG57411-01 


79 


80 


Kelch like 


24a 


CG57399-01 


81 


82 


phospholipase 
ADRAB-B precursor 
like 


24b 


CG57399-02 


83 


84 


phospholipase 
ADRAB-B precursor 
like 


24c 


CG57399-03 


85 


86 


phospholipase 
ADRAB-B precursor 
like 


25a 


CG59311-01 


87 


88 


acyl-coenzyme A 
thioester hydrolase 
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25b 


CG59311-02 


89 


90 


peroxisomal acyl- 
coenzyme A thioeseter 
hydrolase like 


25c 


CG59311-03 


91 


92 


peroxisomal acyl- 
coenzyme A thioeseter 
hydrolase like 


26a 


CG59309-01 


93 


94 


acyl-coenzyme A 
thioester hydrolase 


27a 


CG57364-01 


95 


96 


CG6896 


28a 


CG59348-01 


97 


98 


cytoplasmic protein 
(patent calls this Cyclin 
L-like) 


29a 


CG59245-01 


99 


100 


glucose 6-phosphatase 


29b 


CG59245-02 


101 


102 


glucose 6-phosphatase 


30a 


CG59241-01 


103 


104 


Amiloride-sensitive 
sodium channel 


31a 


CG58602-01 


105 


106 


FAD binding domain 
containing protein 


32a 


CG58468-01 


107 


108 


Serum Amyloid Protein 


33a 


CG58 183-01 


109 


110 


N-Methyl-D-Aspartate 
receptor 


34a 


CG593 15-01 


111 


112 


Connexin 


35a 


CG59203-01 


113 


114 


lysozyme C 


35b 


CG59203-02 


115 


116 


lysozyme C 


36a 


CG58662-01 


117 


118 


cytoplasmic protein 


36b 


CG58662-02 


119 


120 


cytoplasmic protein 


37a 


CG58584-01 


121 


122 


40S ribosomal protein 
S29 like 


38a 


CG58538-01 


123 


124 


Histone deacetylase 
complex protein 66 like 


39a 


CG59371-01 


125 


126 


expressed cytoplasmic 
protein like 


40a 


CG59346-01 


127 


128 


cortactin binding 
protein 1 like 


41a 


CG578 14-01 


129 


130 


Basic I 19 like homo 
sapiens 


41b 


CG578 14-02 


131 


132 


Basic I 19 like homo 
sapiens 


42a 


CG59327-01 


133 


134 


Monocarboxylate 
transporter 1 like 


43a 


CG59494-01 


135 


136 


myelin P2 like 


44a 


CG59432-01 


137 


138 


chloride channel like 


44b 


CG59432-02 


139 


140 


chloride channel like 


45a 


CG59394-01 


141 


142 


GPCR like 


46a 


CG59383-01 


143 


144 


D6MM5E PROTEIN 
like 


46b 


CG59383-02 


145 


146 


D6MM5E PROTEIN 
like 


47a 


CG5 8526-01 


147 


148 


scramblase like 


48a 


CG57851-01 


149 


150 


sulfotransferase like 


49a 


CG59377-01 


151 


152 


epsin like 


50a 


CG59258-01 


153 


154 


transcriptional activator 
like 


51a 


CG59492-01 


155 


156 


Myosin Head (Motor 



12 



ru 











Domain) like 


52a 


CG59564-01 


157 


158 


Sorting nexin 6 like 


53a 


CG59553-01 


159 


160 


Secretory protein SEC 8 
like 


54a 


CG59545-01 


161 


162 


Placental protein 13 
like 


55a 


CG59435-01 


163 


164 


Nedd-1 like 


55b 


CG59435-02 


165 


166 


Nedd-1 like 


56a 


CG59439-01 


167 


168 


Xenobiotic/medium- 
chain fatty acid:CoA 
ligase form XL-III like 


56b 


CG59439-02 


169 


170 


Xenobiotic/medium- 
chain fatty acidrCoA 
ligase form XL-III like 


57a 


CG59354-01 


171 


172 


phosducin like 


57b 


CG59354-02 


173 


174 


phosducin like 


57c 


CG59354-03 


175 


176 


phosducin like 


58a 


CG593 19-01 


177 


178 


phosducin like 


58b 


CG593 19-02 


179 


180 


phosducin like 


59a 


CG59576-01 


181 


182 


GPCR like 


60a 


CG59557-01 


183 


184 


GPCR like 


61a 


CG59555-01 


185 


186 


GPCR like 


62a 


CG59551-01 


187 


188 


GPCR like 


63a 


CG59540-01 


189 


190 


GPCR like 


64a 


CG59280-01 


191 


192 


GPCR like 


64b 


CG59280-02 


193 


194 


GPCR like 


65a 


CG59568-01 


195 


196 


GPCR like 


66a 


CG59224-01 


197 


198 


GPCR like 


67a 


CG59222-01 


199 


200 


GPCR like 


68a 


CG59220-01 


201 


202 


GPCR like 


69a 


CG592 18-01 


203 


204 


GPCR like 


70a 


CG592 16-01 


205 


206 


GPCR like 


71a 


CG592 14-01 


207 


208 


GPCR like 


72a 


CG59211-01 


209 


210 


GPCR like 


73a 


CG59276-01 


211 


212 


Dihydroorotate 
dehydrogenase like 


74a 


CG59268-01 


213 


214 


monooxygenase like 


75a 


CG59549-01 


215 


216 


H326 like (cytoplasmic 
protein with WD repeat 
domain) 


76a 


CG59641-01 


217 


218 


Acetyl-CoA 
Carboxylase 2 like 


77a 


CG59630-01 


219 


220 


Midnolin like 


78a 


CG59561-01 


221 


222 


ACYL COENZYME A 
THIOESTER 
HYDROLASE like 


79a 


CG59452-01 


223 


224 


CELL 

PROLIFERATION 
RELATED PROTEIN 
CAP like 


80a 


CG59572-01 


225 


226 


Pseudouridine Synthase 
3 like 


80b 


CG59572-02 


227 


228 


Pseudouridine Synthase 
3 like 
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81a 


CG59522-01 


229 


230 


Myosin like 


82a 


CG59520-01 


231 


232 


Farnesyl- 










pyrophosphate 










synthetase like 


83a 


CG59758-01 


233 


234 


UBIQUITIN like 


83b 


CG59758-02 


235 


236 


UBIQUITIN like 


84a 


CG59586-01 


237 


238 


glucokinase like 


85a 


CG59704-01 


239 


240 


serine/threonine kinase 










like 


86a 


CG59628-01 


241 


242 


Short-chain 










dehydrogenase like 


87a 


CG595 16-01 


243 


244 


Calponin like 


87b 


CG595 16-02 


245 


246 


Calponin like 


88a 


CG59671-02 


247 


248 


acyl-coenzyme A 










thioester hydrolase 


89a 


CG56870-01 


249 


250 


NDRG3 like 


89b 


CG56870-02 


251 


252 


NDRG3 like 


89c 


CG56870-03 


253 


254 


NDRG3 like 


89d 


CG56870-04 


255 


256 


NDRG3 like 


89e 


CG56870-05 


257 


258 


NDRG3 like 


90a 


CG59764-01 


259 


260 


Ferritin like 


91a 


CG59710-01 


261 


262 


PI 4 like 


92a 


CG59754-02 


263 


264 


Downs syndrome cell 










adhesion molecule like 


92b 


CG59754-01 


265 


266 


Downs syndrome cell 










adhesion molecule like 


93a 


CG59800-01 


267 


268 


HEPARAN SULFATE 










D-GLUCOSAMINYL 










3-0- 










SULFOTRANSFERAS 










E-3B like 


94a 


CG59761-01 


269 


270 


AXIN 1 (AXIS 










INHIBITION 










PROTEIN 1)(HAXIN) 










like 


95a 


CG59756-01 


271 


272 


JUNCTOPHILIN 










TYPE 2 like 


96a 


CG59708-01 


273 


274 


Ubiquitin carboxyl- 










terminal hydrolase 21 










like 


96b 


CG59708-02 


275 


276 


Ubiquitin carboxyl- 










terminal hydrolase 21 










like 


96c 


CG59708-03 


277 


278 


Ubiquitin carboxyl- 










terminal hydrolase 21 










like 


97a 


CG59559-01 


279 


280 


BA12M19.1.3 like 


98a 


CG59669-01 


281 


282 


carbonyl reductase 










(called NADPH- 










dependent carbonyl 










reductase-like in 
patent) 


99a 


CG5 8624-01 


283 


284 


metal transporter 


100a 


CG59679-01 


285 


286 


carbonyl reductase 
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101a 


CG59644-01 


287 


288 


CGI 2091 (putative 
protein phosphatase) 


102a 


CG59662-01 


289 


290 


Cyclophilin 


103a 


CG59773-01 


291 


292 


Myomegalin 


103b 


CG59773-02 


293 


294 


Myomegalin 


103c 


CG59773-03 


295 


296 


Myomegalin 


104a 


CG57460-01 


297 


298 


PEPTIDYL-PROLYL 
CIS-TRANS 
ISOMERASE like 


105a 


CG57464-01 


299 


300 


N- 

ACETYLTRANSFER 
ASE like 


106a 


CG57466-01 


301 


302 


Acetylglucosaminyltra 
nsferase like 


107a 


CG57468-01 


303 


304 


ABC transporter like 
homo sapiens 


108a 


CG59609-01 


305 


306 


PEPTIDYL-PROLYL 
CIS-TRANS 
ISOMERASE A like 


109a 


CG59613-01 


307 


308 


Proliferating cell 
nuclear antigen like 


110a 


CG596 19-01 


309 


310 


CYTOPLASMIC 
ACTIN 2 like 


111a 


CG59621-01 


311 


312 


SELENOPHOSPHAT 
E SYNTHETASE like 


112a 


CG59625-01 


313 


314 


glucose transporter like 


113a 


CG59887-01 


315 


316 


Amino Acid/Metabolite 
Permease like 


113b 


CG59887-02 


317 


318 


Amino Acid/Metabolite 
Permease like 


114a 


CG59861-01 


319 


320 


RIBULOSE-5- 
PHOSPHATE- 
EPIMERASE like 


114b 


CG59861-02 


321 


322 


RIBULOSE-5- 
PHOSPHATE- 
EPIMERASE like 


115a 


CG59857-01 


323 


324 


Rhotekin like homo 
sapiens 


116a 


CG59855-01 


325 


326 


ATP SYNTHASE 
SUBUNITCIik 


116b 


CG59855-02 


327 


328 


ATP SYNTHASE 
SUBUNITClike 


117a 


CG59807-01 


329 


330 


Zinc finger like 


118a 


CG59805-01 


331 


332 


Zinc finger like 


119a 


CG59928-01 


333 


334 


Universal Stress (USP) 
Domain Containing 
Protein like 


120a 


CG59947-01 


335 


336 


VOLTAGE-GATED 
POTASSIUM 
CHANNEL PROTEIN 
KV3.3 (KSHIIID) like 


121a 


CG59938-01 


337 


338 


arylsulfatase like homo 
sapiens 


122a 


CG59746-01 


339 


340 


ubiquitin-specific 
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processing protease 
like homo sapiens 


123a 


CG88613-01 


341 


342 


INOSITOL 1,4,5- 
TRISPHOSPHATE 3- 
KINASE 

ISOENZYME like 


124a 


CG59993-01 


343 


344 


synaptotagmin II like 


124b 


CG59993-02 


345 


346 


synaptotagmin II like 


125a 


CG59991-01 


347 


348 


ooplasm specific 
protein like 


126a 


CG59987-01 


349 


350 


GTP-RHO binding 
protein 1 (rhophilin) 
like 


126b 


CG59987-02 


351 


352 


GTP-RHO binding 
protein 1 (rhophilin) 
like 


127a 


CG59971-01 


353 


354 


Leucine rich repeat 
(LRR) like 


127b 


CG59971-02 


355 


356 


Leucine rich repeat 
(LRR) like 



Table 1 indicates homology of NOVX nucleic acids to known protein families. 
Thus, the nucleic acids and polypeptides, antibodies and related compounds according to 
the invention corresponding to a NOVX as identified in column 1 of Table 1 will be useful 
in therapeutic and diagnostic applications implicated in, for example, pathologies and 
disorders associated with the known protein families identified in column 5 of Table 1. 

NOVX nucleic acids and their encoded polypeptides are useful in a variety of 
applications and contexts. The various NOVX nucleic acids and polypeptides according to 
the invention are useful as novel members of the protein families according to the presence 
of domains and sequence relatedness to previously described proteins. Additionally, 
NOVX nucleic acids and polypeptides can also be used to identify proteins that are 
members of the family to which the NOVX polypeptides belong. 

Consistent with other known members of the family of proteins, identified in 
column 5 of Table 1, the NOVX polypeptides of the present invention show homology to, 
and contain domains that are characteristic of, other members of such protein 
families. Details of the sequence relatedness and domain analysis for each NOVX are 
presented in Example A. 

The NOVX nucleic acids and polypeptides can also be used to screen for 
molecules, which inhibit or enhance NOVX activity or function. Specifically, the nucleic 
acids and polypeptides according to the invention may be used as targets for the 
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identification of small molecules that modulate or inhibit diseases associated with the 
protein families listed in Table 1 . 

The NOVX nucleic acids and polypeptides are also useful for detecting specific cell 
types. Details of the expression analysis for each NOVX are presented in Example C. 
5 Accordingly, the NOVX nucleic acids, polypeptides, antibodies and related compounds 
according to the invention will have diagnostic and therapeutic applications in the detection 
of a variety of diseases with differential expression in normal vs. diseased tissues, e.g.a 
variety of cancers. 

Additional utilities for NOVX nucleic acids and polypeptides according to the 
10 invention are disclosed herein. 

The present invention is based on the identification of biological macromolecules 
differentially modulated in a pathologic state, disease, or an abnormal condition or state. 
Among the pathologies or diseases of present interest include metabolic diseases including 
those related to endocrinologic disorders, cancers, various tumors and neoplasias, 
15 inflammatory disorders, central nervous system disorders, and similar abnormal conditions 
or states. In very significant embodiments of the present invention, the biological 
macromolecules implicated in the pathologies and conditions are proteins and 
polypeptides, and in such cases the present invention is related as well to the nucleic acids 
pi that encode them. Methods that may be employed to identify relevant biological 

20 macromolecules include any procedures that detect differential expression of nucleic acids 
encoding proteins and polypeptides associated with the disorder, as well as procedures that 
detect the respective proteins and polypeptides themselves. Significant methods that have 
been employed by the present inventors, include GeneCalling ® technology and 
SeqCalling TM technology, disclosed respectively, in U. S. Patent No. 5,871,697, and in U. 
25 S. Ser. No. 09/417,386, filed Oct. 13, 1999, each of which is incorporated herein by 
reference in its entirety. GeneCalling ® is also described in Shimkets, et al., "Gene 
expression analysis by transcript profiling coupled to a gene database query" Nature 
Biotechnology 17:198-803 (1999). 

The invention provides polypeptides and nucleotides encoded thereby that have 
30 been identified as having novel associations with a disease or pathology, or an abnormal 
state or condition, in a mammal. The present invention further identifies a set of proteins 
and polypeptides, including naturally occurring polypeptides, precursor forms or 
proproteins, or mature forms of the polypeptides or proteins, which are implicated as 
targets for therapeutic agents in the treatment of various diseases, pathologies, abnormal 
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states and conditions. A target may be employed in any of a variety of screening 
methodologies in order to identify candidate therapeutic agents which interact with the 
target and in so doing exert a desired or favorable effect. The candidate therapeutic agent 
is identified by screening a large collection of substances or compounds in an important 
embodiment of the invention. Such a collection may comprise a combinatorial library of 
substances or compounds in which, in at least one subset of substances or compounds, the 
individual members are related to each other by simple structural variations based on a 
particular canonical or basic chemical structure. The variations may include, by way of 
nonlimiting example, changes in length or identity of a basic framework of bonded atoms; 
changes in number, composition and disposition of ringed structures, bridge structures, 
alicyclic rings, and aromatic rings; and changes in pendent or substituents atoms or groups 
that are bonded at particular positions to the basic framework of bonded atoms or to the 
ringed structures, the bridge structures, the alicyclic structures, or the aromatic structures. 

A polypeptide or protein described herein, and that serves as a target in the 
screening procedure, includes the product of a naturally occurring polypeptide or precursor 
form or proprotein. The naturally occurring polypeptide, precursor or proprotein includes, 
e.g., the full-length gene product, encoded by the corresponding gene. The naturally 
occurring polypeptide also includes the polypeptide, precursor or proprotein encoded by an 
open reading frame described herein. A "mature" form of a polypeptide or protein arises as 
a result of one or more naturally occurring processing steps as they may occur within the 
cell, including a host cell. The processing steps occur as the gene product arises, e.g., via 
cleavage of the amino-terminal methionine residue encoded by the initiation codon of an 
open reading frame, or the proteolytic cleavage of a signal peptide or leader sequence. 
Thus, a mature form arising from a precursor polypeptide or protein that has residues 1 to 
N, where residue 1 is the N-terminal methionine, would have residues 2 through N 
remaining. Alternatively, a mature form arising from a precursor polypeptide or protein 
having residues 1 to N, in which an amino-terminal signal sequence from residue 1 to 
residue M is cleaved, includes the residues from residue M+l to residue N remaining. A 
"mature" form of a polypeptide or protein may also arise from non-proteolytic post- 
translational modification. Such non-proteolytic processes include, e.g., glycosylation, 
myristylation or phosphorylation. In general, a mature polypeptide or protein may result 
from the operation of only one of these processes, or the combination of any of them. 

As used herein, "identical" residues correspond to those residues in a comparison 
between two sequences where the equivalent nucleotide base or amino acid residue in an 
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alignment of two sequences is the same residue. Residues are alternatively described as 
"similar" or "positive" when the comparisons between two sequences in an alignment show 
that residues in an equivalent position in a comparison are either the same amino acid or a 
conserved amino acid as defined below. 
5 As used herein, a "chemical composition" relates to a composition including at least 

one compound that is either synthesized or extracted from a natural source. A chemical 
compound may be the product of a defined synthetic procedure. Such a synthesized 
compound is understood herein to have defined properties in terms of molecular formula, 
M= molecular structure relating the association of bonded atoms to each other, physical 

g 10 properties such as chromatographic or spectroscopic characterizations, and the like. A 

W3 compound extracted from a natural source is advantageously analyzed by chemical and 

physical methods in order to provide a representation of its defined properties, including its 
molecular formula, molecular structure relating the association of bonded atoms to each 
r other, physical properties such as chromatographic or spectroscopic characterizations, and 

15 the like. 

□ As used herein, a "candidate therapeutic agent" is a chemical compound that 

2! includes at least one substance shown to bind to a target biopolymer. In important 

PJ embodiments of the invention, the target biopolymer is a protein or polypeptide, a nucleic 

acid, a polysaccharide or proteoglycan, or a lipid such as a complex lipid. The method of 
20 identifying compounds that bind to the target effectively eliminates compounds with little 
or no binding affinity, thereby increasing the potential that the identified chemical 
compound may have beneficial therapeutic applications. In cases where the "candidate 
therapeutic agent" is a mixture of more than one chemical compound, subsequent screening 
procedures may be carried out to identify the particular substance in the mixture that is the 
25 binding compound, and that is to be identified as a candidate therapeutic agent. 

As used herein, a "pharmaceutical agent" is provided by screening a candidate 
therapeutic agent using models for a disease state or pathology in order to identify a 
candidate exerting a desired or beneficial therapeutic effect with relation to the disease or 
pathology. Such a candidate that successfully provides such an effect is termed a 
30 pharmaceutical agent herein. Nonlimiting examples of model systems that may be used in 
such screens include particular cell lines, cultured cells, tissue preparations, whole tissues, 
organ preparations, intact organs, and nonhuman mammals. Screens employing at least 
one system, and preferably more than one system, may be employed in order to identify a 
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pharmaceutical agent. Any pharmaceutical agent so identified may be pursued in further 
investigation using human subjects. 



NOVX Nucleic Acids and Polypeptides 

5 

NOVX clones 

NOVX nucleic acids and their encoded polypeptides are useful in a variety of 
applications and contexts. The various NOVX nucleic acids and polypeptides according to 
the invention are useful as novel members of the protein families according to the presence 

10 of domains and sequence relatedness to previously described proteins. Additionally, 
NOVX nucleic acids and polypeptides can also be used to identify proteins that are 
members of the family to which the NOVX polypeptides belong. 

The NOVX genes and their corresponding encoded proteins are useful for 
preventing, treating or ameliorating medical conditions, e.g., by protein or gene therapy. 

1 5 Pathological conditions can be diagnosed by determining the amount of the new protein in 
a sample or by determining the presence of mutations in the new genes. Specific uses are 
described for each of the NOVX genes, based on the tissues in which they are most highly 
expressed. Uses include developing products for the diagnosis or treatment of a variety of 
diseases and disorders. 

20 The NOVX nucleic acids and proteins of the invention are useful in potential 

diagnostic and therapeutic applications and as a research tool. These include serving as a 
specific or selective nucleic acid or protein diagnostic and/or prognostic marker, wherein 
the presence or amount of the nucleic acid or the protein are to be assessed, as well as 
potential therapeutic applications such as the following: (i) a protein therapeutic, (ii) a 

25 small molecule drug target, (iii) an antibody target (therapeutic, diagnostic, drug 

targeting/cytotoxic antibody), (iv) a nucleic acid useful in gene therapy (gene delivery/gene 
ablation), and (v) a composition promoting tissue regeneration in vitro and in vivo (vi) 
biological defense weapon. 

In one specific embodiment, the invention includes an isolated polypeptide 

30 comprising an amino acid sequence selected from the group consisting of: (a) a mature 

form of the amino acid sequence selected from the group consisting of SEQ ID NO: 2n, 

wherein n is an integer between 1 and 1 78; (b) a variant of a mature form of the amino acid 

sequence selected from the group consisting of SEQ ID NO: 2n, wherein n is an integer 

between 1 and 178, wherein any amino acid in the mature form is changed to a different 

35 amino acid, provided that no more than 1 5% of the amino acid residues in the sequence of 
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the mature form are so changed; (c) an amino acid sequence selected from the group 
consisting of SEQ ID NO: 2n, wherein n is an integer between 1 and 178; (d) a variant of 
the amino acid sequence selected from the group consisting of SEQ ID NO:2n, wherein n is 
an integer between 1 and 178 wherein any amino acid specified in the chosen sequence is 
5 changed to a different amino acid, provided that no more than 1 5% of the amino acid 
residues in the sequence are so changed; and (e) a fragment of any of (a) through (d). 

In another specific embodiment, the invention includes an isolated nucleic acid 
molecule comprising a nucleic acid sequence encoding a polypeptide comprising an amino 
acid sequence selected from the group consisting of: (a) a mature form of the amino acid 

10 sequence given SEQ ID NO: 2n, wherein n is an integer between 1 and 178; (b) a variant of 
a mature form of the amino acid sequence selected from the group consisting of SEQ ID 
NO: 2n, wherein n is an integer between 1 and 178 wherein any amino acid in the mature 
form of the chosen sequence is changed to a different amino acid, provided that no more 
than 15% of the amino acid residues in the sequence of the mature form are so changed; (c) 

1 5 the amino acid sequence selected from the group consisting of SEQ ID NO: 2n, wherein n 
is an integer between 1 and 178; (d) a variant of the amino acid sequence selected from the 
group consisting of SEQ ID NO: 2n, wherein n is an integer between 1 and 178, in which 
any amino acid specified in the chosen sequence is changed to a different amino acid, 
provided that no more than 1 5% of the amino acid residues in the sequence are so changed; 

20 (e) a nucleic acid fragment encoding at least a portion of a polypeptide comprising the 

amino acid sequence selected from the group consisting of SEQ ID NO: 2n, wherein n is an 
integer between 1 and 178 or any variant of said polypeptide wherein any amino acid of the 
chosen sequence is changed to a different amino acid, provided that no more than 10% of 
the amino acid residues in the sequence are so changed; and (f) the complement of any of 

25 said nucleic acid molecules. 

In yet another specific embodiment, the invention includes an isolated nucleic acid 
molecule, wherein said nucleic acid molecule comprises a nucleotide sequence selected 
from the group consisting of: (a) the nucleotide sequence selected from the group 
consisting of SEQ ID NO: 2n-l, wherein n is an integer between 1 and 178; (b) a 

30 nucleotide sequence wherein one or more nucleotides in the nucleotide sequence selected 
from the group consisting of SEQ ID NO: 2n-l, wherein n is an integer between 1 and 178 
is changed from that selected from the group consisting of the chosen sequence to a 
different nucleotide provided that no more than 15% of the nucleotides are so changed; (c) 
a nucleic acid fragment of the sequence selected from the group consisting of SEQ ID NO: 



2n-l, wherein n is an integer between 1 and 178; and (d) a nucleic acid fragment wherein 
one or more nucleotides in the nucleotide sequence selected from the group consisting of 
SEQ ID NO: 2n-l, wherein n is an integer between 1 and 178 is changed from that 
selected from the group consisting of the chosen sequence to a different nucleotide 
provided that no more than 1 5% of the nucleotides are so changed. 

One aspect of the invention pertains to isolated nucleic acid molecules that encode 
NOVX polypeptides or biologically active portions thereof. Also included in the invention 
are nucleic acid fragments sufficient for use as hybridization probes to identify NOVX- 
encoding nucleic acids (e.g., NOVX mRNAs) and fragments for use as PCR primers for 
the amplification and/or mutation of NOVX nucleic acid molecules. As used herein, the 
term "nucleic acid molecule" is intended to include DNA molecules (e.g., cDNA or 
genomic DNA), RNA molecules (e.g., mRNA), analogs of the DNA or RNA generated 
using nucleotide analogs, and derivatives, fragments and homologs thereof. The nucleic 
acid molecule may be single-stranded or double-stranded, but preferably is comprised 
double-stranded DNA. 

An NOVX nucleic acid can encode a mature NOVX polypeptide. As used herein, a 
"mature" form of a polypeptide or protein disclosed in the present invention is the product 
of a naturally occurring polypeptide or precursor form or proprotein. The naturally 
occurring polypeptide, precursor or proprotein includes, by way of nonlimiting example, 
the full-length gene product, encoded by the corresponding gene. Alternatively, it may be 
defined as the polypeptide, precursor or proprotein encoded by an ORF described herein. 
The product "mature" form arises, again by way of nonlimiting example, as a result of one 
or .more naturally occurring processing steps as they may take place within the cell, or host 
cell, in which the gene product arises. Examples of such processing steps leading to a 
"mature" form of a polypeptide or protein include the cleavage of the N-terminal 
methionine residue encoded by the initiation codon of an ORF, or the proteolytic cleavage 
of a signal peptide or leader sequence. Thus a mature form arising from a precursor 
polypeptide or protein that has residues 1 to N, where residue 1 is the N-terminal 
methionine, would have residues 2 through N remaining after removal of the N-terminal 
methionine. Alternatively, a mature form arising from a precursor polypeptide or protein 
having residues 1 to N, in which an N-terminal signal sequence from residue 1 to residue 
M is cleaved, would have the residues from residue M+l to residue N remaining. Further 
as used herein, a "mature" form of a polypeptide or protein may arise from a step of post- 
translational modification other than a proteolytic cleavage event. Such additional 
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processes include, by way of non-limiting example, glycosylation, myristoylation or 
phosphorylation. In general, a mature polypeptide or protein may result from the operation 
of only one of these processes, or a combination of any of them. 

The term "probes", as utilized herein, refers to nucleic acid sequences of variable 
5 length, preferably between at least about 1 0 nucleotides (nt), 1 00 nt, or as many as 
approximately, e.g., 6,000 nt, depending upon the specific use. Probes are used in the 
detection of identical, similar, or complementary nucleic acid sequences. Longer length 
probes are generally obtained from a natural or recombinant source, are highly specific, 
and much slower to hybridize than shorter-length oligomer probes. Probes may be single- 

10 or double-stranded and designed to have specificity in PCR, membrane-based hybridization 
technologies, or ELISA-like technologies. 

The term "isolated" nucleic acid molecule, as utilized herein, is one, which is 
separated from other nucleic acid molecules which are present in the natural source of the 
nucleic acid. Preferably, an "isolated" nucleic acid is free of sequences which naturally 

1 5 flank the nucleic acid (i.e., sequences located at the 5'- and 3'-termini of the nucleic acid) in 
the genomic DNA of the organism from which the nucleic acid is derived. For example, in 
various embodiments, the isolated NOVX nucleic acid molecules can contain less than 
about 5 kb, 4 kb, 3 kb, 2 kb, 1 kb, 0.5 kb or 0.1 kb of nucleotide sequences which naturally 
flank the nucleic acid molecule in genomic DNA of the cell/tissue from which the nucleic 

20 acid is derived (e.g., brain, heart, liver, spleen, etc.). Moreover, an "isolated" nucleic acid 
molecule, such as a cDNA molecule, can be substantially free of other cellular material or 
culture medium when produced by recombinant techniques, or of chemical precursors or 
other chemicals when chemically synthesized. 

A nucleic acid molecule of the invention, e.g., a nucleic acid molecule having the 
25 nucleotide sequence SEQ ID NO: 2n-l , wherein n is an integer between 1 and 1 78, or a 
complement of this aforementioned nucleotide sequence, can be isolated using standard 
molecular biology techniques and the sequence information provided herein. Using all or a 
portion of the nucleic acid sequence of SEQ ID NO: 2n-l, wherein n is an integer between 
1 and 178 as a hybridization probe, NOVX molecules can be isolated using standard 
30 hybridization and cloning techniques (e.g., as described in Sambrook, et al, (eds.), 

Molecular Cloning: A Laboratory Manual 2 nd Ed., Cold Spring Harbor Laboratory 
Press, Cold Spring Harbor, NY, 1989; and Ausubel, et al, (eds.), Current Protocols IN 
Molecular Biology, John Wiley & Sons, New York, NY, 1993.) 
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A nucleic acid of the invention can be amplified using cDNA, mRNA or 
alternatively, genomic DNA, as a template and appropriate oligonucleotide primers 
according to standard PCR amplification techniques. The nucleic acid so amplified can be 
cloned into an appropriate vector and characterized by DNA sequence analysis. 
5 Furthermore, oligonucleotides corresponding to NOVX nucleotide sequences can be 
prepared by standard synthetic techniques, e.g., using an automated DNA synthesizer. 

As used herein, the term "oligonucleotide" refers to a series of linked nucleotide 
residues, which oligonucleotide has a sufficient number of nucleotide bases to be used in a 
PCR reaction. A short oligonucleotide sequence may be based on, or designed from, a 

10 genomic or cDNA sequence and is used to amplify, confirm, or reveal the presence of an 
identical, similar or complementary DNA or RNA in a particular cell or tissue. 
Oligonucleotides comprise portions of a nucleic acid sequence having about 10 nt, 50 nt, or 
1 00 nt in length, preferably about 1 5 nt to 30 nt in length. In one embodiment of the 
invention, an oligonucleotide comprising a nucleic acid molecule less than 1 00 nt in length 

1 5 would further comprise at least 6 contiguous nucleotides SEQ ID NO: 2n-l, wherein n is an 
integer between 1 and 178, or a complement thereof. Oligonucleotides may be chemically 
synthesized and may also be used as probes. 

In another embodiment, an isolated nucleic acid molecule of the invention 
comprises a nucleic acid molecule that is a complement of the nucleotide from the group 

20 consisting of SEQ ID NO: 2n-l , wherein n is an integer between 1 and 1 78, or a portion of 
this nucleotide sequence (e.g., a fragment that can be used as a probe or primer or a 
fragment encoding a biologically-active portion of an NOVX polypeptide). A nucleic acid 
molecule that is complementary to the nucleotide sequence from the group consisting of 
SEQ ID NO: 2n-l, wherein n is an integer between 1 and 178 is one that is sufficiently 

25 complementary to the nucleotide sequence from the group consisting of SEQ ID NO: 2n- 1 , 
wherein n is an integer between 1 and 1 78 that it can hydrogen bond with little or no 
mismatches to the nucleotide sequence from the group consisting of SEQ ID NO: 2n-l, 
wherein n is an integer between 1 and 1 78, thereby forming a stable duplex. 

As used herein, the term "complementary" refers to Watson-Crick or Hoogsteen 

30 base pairing between nucleotides units of a nucleic acid molecule, and the term "binding" 
means the physical or chemical interaction between two polypeptides or compounds or 
associated polypeptides or compounds or combinations thereof. Binding includes ionic, 
non-ionic, van der Waals, hydrophobic interactions, and the like. A physical interaction 
can be either direct or indirect. Indirect interactions may be through or due to the effects of 



another polypeptide or compound. Direct binding refers to interactions that do not take 
place through, or due to, the effect of another polypeptide or compound, but instead are 
without other substantial chemical intermediates. 

Fragments provided herein are defined as sequences of at least 6 (contiguous) 
nucleic acids or at least 4 (contiguous) amino acids, a length sufficient to allow for specific 
hybridization in the case of nucleic acids or for specific recognition of an epitope in the 
case of amino acids, respectively, and are at most some portion less than a full length 
sequence. Fragments may be derived from any contiguous portion of a nucleic acid or 
amino acid sequence of choice. Derivatives are nucleic acid sequences or amino acid 
sequences formed from the native compounds either directly or by modification or partial 
substitution. Analogs are nucleic acid sequences or amino acid sequences that have a 
structure similar to, but not identical to, the native compound but differs from it in respect 
to certain components or side chains. Analogs may be synthetic or from a different 
evolutionary origin and may have a similar or opposite metabolic activity compared to wild 
type. Homologs are nucleic acid sequences or amino acid sequences of a particular gene 
that are derived from different species. 

A full-length NOVX clone is identified as containing an ATG translation start 
codon and an in-frame stop codon. Any disclosed NOVX nucleotide sequence lacking an 
ATG start codon therefore encodes a truncated C-terminal fragment of the respective 
NOVX polypeptide, and requires that the corresponding full-length cDNA extend in the 5' 
direction of the disclosed sequence. Any disclosed NOVX nucleotide sequence lacking an 
in-frame stop codon similarly encodes a truncated N-terminal fragment of the respective 
NOVX polypeptide, and requires that the corresponding full-length cDNA extend in the 3' 
direction of the disclosed sequence. 

Derivatives and analogs may be full length or other than full length, if the 
derivative or analog contains a modified nucleic acid or amino acid, as described below. 
Derivatives or analogs of the nucleic acids or proteins of the invention include, but are not 
limited to, molecules comprising regions that are substantially homologous to the nucleic 
acids or proteins of the invention, in various embodiments, by at least about 70%, 80%, or 
95% identity (with a preferred identity of 80-95%) over a nucleic acid or amino acid 
sequence of identical size or when compared to an aligned sequence in which the alignment 
is done by a computer homology program known in the art, or whose encoding nucleic acid 
is capable of hybridizing to the complement of a sequence encoding the aforementioned 
proteins under stringent, moderately stringent, or low stringent conditions. See e.g. 

25 



Ausubel, et al, Current Protocols in Molecular Biology, John Wiley & Sons, New 
York, NY, 1 993, and below. 

A "homologous nucleic acid sequence" or "homologous amino acid sequence," or 
variations thereof, refer to sequences characterized by a homology at the nucleotide level or 
amino acid level as discussed above. Homologous nucleotide sequences encode those 
sequences coding for isoforms of NOVX polypeptides. Isoforms can be expressed in 
different tissues of the same organism as a result of, for example, alternative splicing of 
PvNA. Alternatively, isoforms can be encoded by different genes. In the invention, 
homologous nucleotide sequences include nucleotide sequences encoding for an NOVX 
polypeptide of species other than humans, including, but not limited to: vertebrates, and 
thus can include, e.g., frog, mouse, rat, rabbit, dog, cat cow, horse, and other organisms. 
Homologous nucleotide sequences also include, but are not limited to, naturally occurring 
allelic variations and mutations of the nucleotide sequences set forth herein. A 
homologous nucleotide sequence does not, however, include the exact nucleotide sequence 
encoding human NOVX protein. Homologous nucleic acid sequences include those 
nucleic acid sequences that encode conservative amino acid substitutions (see below) in 
SEQ ID NO: 2n-l, wherein n is an integer between 1 and 178, as well as a polypeptide 
possessing NOVX biological activity. Various biological activities of the NOVX proteins 
are described below. 

An NOVX polypeptide is encoded by the open reading frame ("ORF") of an 
NOVX nucleic acid. An ORF corresponds to a nucleotide sequence that could potentially 
be translated into a polypeptide. A stretch of nucleic acids comprising an ORF is 
uninterrupted by a stop codon. An ORF that represents the coding sequence for a full 
protein begins with an ATG "start" codon and terminates with one of the three "stop" 
codons, namely, TAA, TAG, or TGA. For the purposes of this invention, an ORF may be 
any part of a coding sequence, with or without a start codon, a stop codon, or both. For an 
ORF to be considered as a good candidate for coding for a bona fide cellular protein, a 
minimum size requirement is often set, e.g., a stretch of DNA that would encode a protein 
of 50 amino acids or more. 

The nucleotide sequences determined from the cloning of the human NOVX genes 
allows for the generation of probes and primers designed for use in identifying and/or 
cloning NOVX homologues in other cell types, e.g. from other tissues, as well as NOVX 
homoiogues from other vertebrates. The probe/primer typically comprises substantially 
purified oligonucleotide. The oligonucleotide typically comprises a region of nucleotide 
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sequence that hybridizes under stringent conditions to at least about 12, 25, 50, 100, 150, 
200, 250, 300, 350 or 400 consecutive sense strand nucleotide sequence SEQ ID NO: 2n-l, 
wherein n is an integer between 1 and 178; or an anti-sense strand nucleotide sequence of 
SEQ ID NO: 2n-l, wherein n is an integer between 1 and 178. 

Probes based on the human NOVX nucleotide sequences can be used to detect 
transcripts or genomic sequences encoding the same or homologous proteins. In various 
embodiments, the probe further comprises a label group attached thereto, e.g. the label 
group can be a radioisotope, a fluorescent compound, an enzyme, or an enzyme co-factor. 
Such probes can be used as a part of a diagnostic test kit for identifying cells or tissues 
which mis-express an NOVX protein, such as by measuring a level of an NOVX-encoding 
nucleic acid in a sample of cells from a subject e.g., detecting NOVX mRNA levels or 
determining whether a genomic NOVX gene has been mutated or deleted. 

"A polypeptide having a biologically-active portion of an NOVX polypeptide" 
refers to polypeptides exhibiting activity similar, but not necessarily identical to, an activity 
of a polypeptide of the invention, including mature forms, as measured in a particular 
biological assay, with or without dose dependency. A nucleic acid fragment encoding a 
"biologically-active portion of NOVX" can be prepared by isolating a portion SEQ ID NO: 
2n-l, wherein n is an integer between 1 and 178, that encodes a polypeptide having an 
NOVX biological activity (the biological activities of the NOVX proteins are described 
below), expressing the encoded portion of NOVX protein (e.g., by recombinant expression 
in vitro) and assessing the activity of the encoded portion of NOVX. 

NOVX Nucleic Acid and Polypeptide Variants 

The invention further encompasses nucleic acid molecules that differ from the 
nucleotide sequences shown in SEQ ID NO: 2n-l, wherein n is an integer between 1 and 
1 78 due to degeneracy of the genetic code and thus encode the same NOVX proteins as 
that encoded by the nucleotide sequences shown in SEQ ID NO: 2n-l, wherein n is an 
integer between 1 and 178. In another embodiment, an isolated nucleic acid molecule of 
the invention has a nucleotide sequence encoding a protein having an amino acid sequence 
shown in SEQ ID NO: 2n, wherein n is an integer between 1 and 178. 

In addition to the human NOVX nucleotide sequences shown in SEQ ID NO: 2n-l, 
wherein n is an integer between 1 and 178, it will be appreciated by those skilled in the art 
that DNA sequence polymorphisms that lead to changes in the amino acid sequences of the 
NOVX polypeptides may exist within a population (e.g., the human population). Such 
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genetic polymorphism in the NOVX genes may exist among individuals within a 
population due to natural allelic variation. As used herein, the terms "gene" and 
"recombinant gene" refer to nucleic acid molecules comprising an open reading frame 
(ORF) encoding an NOVX protein, preferably a vertebrate NOVX protein. Such natural 
allelic variations can typically result in 1-5% variance in the nucleotide sequence of the 
NOVX genes. Any and all such nucleotide variations and resulting amino acid 
polymorphisms in the NOVX polypeptides, which are the result of natural allelic variation 
and that do not alter the functional activity of the NOVX polypeptides, are intended to be 
within the scope of the invention. 

Moreover, nucleic acid molecules encoding NOVX proteins from other species, and 
thus that have a nucleotide sequence that differs from the human SEQ ID NO: 2n-l, 
wherein n is an integer between 1 and 1 78 are intended to be within the scope of the 
invention. Nucleic acid molecules corresponding to natural allelic variants and 
homologues of the NOVX cDNAs of the invention can be isolated based on their 
homology to the human NOVX nucleic acids disclosed herein using the human cDNAs, or 
a portion thereof, as a hybridization probe according to standard hybridization techniques 
under stringent hybridization conditions. 

Accordingly, in another embodiment, an isolated nucleic acid molecule of the 
invention is at least 6 nucleotides in length and hybridizes under stringent conditions to the 
nucleic acid molecule comprising the nucleotide sequence of SEQ ID NO: 2n-l, wherein n 
is an integer between 1 and 178. In another embodiment, the nucleic acid is at least 10, 25, 
50, 100, 250, 500, 750, 1000, 1500, or 2000 or more nucleotides in length. In yet another 
embodiment, an isolated nucleic acid molecule of the invention hybridizes to the coding 
region. As used herein, the term "hybridizes under stringent conditions" is intended to 
describe conditions for hybridization and washing under which nucleotide sequences at 
least 60% homologous to each other typically remain hybridized to each other. 

Homologs (i.e., nucleic acids encoding NOVX proteins derived from species other 
than human) or other related sequences (e.g., paralogs) can be obtained by low, moderate 
or high stringency hybridization with all or a portion of the particular human sequence as a 
probe using methods well known in the art for nucleic acid hybridization and cloning. 

As used herein, the phrase "stringent hybridization conditions" refers to conditions 
under which a probe, primer or oligonucleotide will hybridize to its target sequence, but to 
no other sequences. Stringent conditions are sequence-dependent and will be different in 
different circumstances. Longer sequences hybridize specifically at higher temperatures 
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than shorter sequences. Generally, stringent conditions are selected to be about 5 °C lower 
than the thermal melting point (Tm) for the specific sequence at a defined ionic strength 
and pH. The Tm is the temperature (under defined ionic strength, pH and nucleic acid 
concentration) at which 50% of the probes complementary to the target sequence hybridize 
5 to the target sequence at equilibrium. Since the target sequences are generally present at 
excess, at Tm, 50% of the probes are occupied at equilibrium. Typically, stringent 
conditions will be those in which the salt concentration is less than about 1.0 M sodium 
ion, typically about 0.01 to 1.0 M sodium ion (or other salts) at 
pH 7.0 to 8.3 and the temperature is at least about 30°C for short probes, primers or 

10 oligonucleotides (e.g., 10 nt to 50 nt) and at least about 60°C for longer probes, primers 
and oligonucleotides. Stringent conditions may also be achieved with the addition of 
destabilizing agents, such as formamide. 

Stringent conditions are known to those skilled in the art and can be found in 
Ausubel, et al, (eds.), Current Protocols IN Molecular BIOLOGY, John Wiley & Sons, 

15 N.Y. (1989), 6.3.1-6.3.6. Preferably, the conditions are such that sequences at least about 
65%, 70%, 75%, 85%, 90%, 95%, 98%, or 99% homologous to each other typically remain 
hybridized to each other. A non-limiting example of stringent hybridization conditions are 
hybridization in a high salt buffer comprising 6X SSC, 50 mM Tris-HCl (pH 7.5), 1 mM 
EDTA, 0.02% PVP, 0.02% Ficoll, 0.02% BSA, and 500 mg/ml denatured salmon sperm 

20 DNA at 65°C, followed by one or more washes in 0.2X SSC, 0.01 % BSA at 50°C. An 

isolated nucleic acid molecule of the invention that hybridizes under stringent conditions to 
the sequences SEQ ID NO: 2n-l, wherein n is an integer between 1 and 178, corresponds 
to a naturally-occurring nucleic acid molecule. As used herein, a "naturally-occurring" 
nucleic acid molecule refers to an RNA or DNA molecule having a nucleotide sequence 

25 that occurs in nature (e.g., encodes a natural protein). 

In a second embodiment, a nucleic acid sequence that is hybridizable to the nucleic 
acid molecule comprising the nucleotide sequence of SEQ ID NO: 2n-l, wherein n is an 
integer between 1 and 178, or fragments, analogs or derivatives thereof, under conditions 
of moderate stringency is provided. A non-limiting example of moderate stringency 

30 hybridization conditions are hybridization in 6X SSC, 5X Denhardt's solution, 0.5% SDS 
and 100 mg/ml denatured salmon sperm DNA at 55°C, followed by one or more washes in 
IX SSC, 0.1% SDS at 37°C. Other conditions of moderate stringency that may be used are 
well-known within the art. See, e.g., Ausubel, et al. (eds.), 1993, CURRENT PROTOCOLS IN 
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Molecular Biology, John Wiley & Sons, NY, and Kriegler, 1990; Gene Transfer and 
Expression, A Laboratory Manual, Stockton Press, NY. 

In a third embodiment, a nucleic acid that is hybridizable to the nucleic acid 
molecule comprising the nucleotide sequences SEQ ID NO: 2n-l, wherein n is an integer 
5 between 1 and 178, or fragments, analogs or derivatives thereof, under conditions of low 
stringency, is provided. A non-limiting example of low stringency hybridization 
conditions are hybridization in 35% formamide, 5X SSC, 50 mM Tris-HCl (pH 7.5), 5 mM 
EDTA, 0.02% PVP, 0.02% Ficoll, 0.2% BSA, 100 mg/ml denatured salmon sperm DNA, 
10% (wt/vol) dextran sulfate at 40°C, followed by one or more washes in 2X SSC, 25 mM 

1 0 Tris-HCl (pH 7.4), 5 mM EDTA, and 0. 1 % SDS at 50°C. Other conditions of low 

stringency that may be used are well known in the art (e.g., as employed for cross-species 
hybridizations). See, e.g., Ausubel, et oil. (eds.), 1993, CURRENT PROTOCOLS IN 
Molecular Biology, John Wiley & Sons, NY, and Kriegler, 1990, Gene Transfer and 
Expression, A Laboratory Manual, Stockton Press, NY; Shilo and Weinberg, 1981. 

1 5 Proc Natl Acad Sci USA 78: 6789-6792. 

Conservative Mutations 

In addition to naturally-occurring allelic variants of NOVX sequences that may 
20 exist in the population, the skilled artisan will further appreciate that changes can be 

introduced by mutation into the nucleotide sequences SEQ ID NO: 2n-l, wherein n is an 
integer between 1 and 178, thereby leading to changes in the amino acid sequences of the 
encoded NOVX proteins, without altering the functional ability of said NOVX proteins. 
For example, nucleotide substitutions leading to amino acid substitutions at "non-essential" 
25 amino acid residues can be made in the sequence SEQ ID NO: 2n, wherein n is an integer 
between 1 and 178. A "non-essential" amino acid residue is a residue that can be altered 
from the wild-type sequences of the NOVX proteins without altering their biological 
activity, whereas an "essential" amino acid residue is required for such biological activity. 
For example, amino acid residues that are conserved among the NOVX proteins of the 
30 invention are predicted to be particularly non-amenable to alteration. Amino acids for 
which conservative substitutions can be made are well-known within the art. 

Another aspect of the invention pertains to nucleic acid molecules encoding NOVX 
proteins that contain changes in amino acid residues that are not essential for activity. Such 
NOVX proteins differ in amino acid sequence from SEQ ID NO: 2n-l, wherein n is an 
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integer between 1 and 178 yet retain biological activity. In one embodiment, the isolated 
nucleic acid molecule comprises a nucleotide sequence encoding a protein, wherein the 
protein comprises an amino acid sequence at least about 45% homologous to the amino 
acid sequences SEQ ID NO: 2n, wherein n is an integer between 1 and 178. Preferably, the 
5 protein encoded by the nucleic acid molecule is at least about 60% homologous to SEQ ID 
NO: 2n, wherein n is an integer between 1 and 178; more preferably at least about 70% 
homologous SEQ ID NO: 2n, wherein n is an integer between 1 and 178; still more 
preferably at least about 80% homologous to SEQ ID NO: 2n, wherein n is an integer 
between 1 and 178; even more preferably at least about 90% homologous to SEQ ID NO: 

10 2n, wherein n is an integer between 1 and 178; and most preferably at least about 95% 
homologous to SEQ ID NO: 2n, wherein n is an integer between 1 and 178. 

An isolated nucleic acid molecule encoding an NOVX protein homologous to the 
protein of SEQ ID NO: 2n, wherein n is an integer between 1 and 178 can be created by 
introducing one or more nucleotide substitutions, additions or deletions into the nucleotide 

15 sequence of SEQ ID NO: 2n-l, wherein n is an integer between 1 and 178, such that one or 
more amino acid substitutions, additions or deletions are introduced into the encoded 
protein. 

Mutations can be introduced into SEQ ID NO: 2n-l, wherein n is an integer 
between 1 and 178 standard techniques, such as site-directed mutagenesis and 

20 PCR-mediated mutagenesis. Preferably, conservative amino acid substitutions are made at 
one or more predicted, non-essential amino acid residues. A "conservative amino acid 
substitution" is one in which the amino acid residue is replaced with an amino acid residue 
having a similar side chain. Families of amino acid residues having similar side chains 
have been defined within the art. These families include amino acids with basic side chains 

25 (e.g., lysine, arginine, histidine), acidic side chains {e.g., aspartic acid, glutamic acid), 
uncharged polar side chains (e.g., glycine, asparagine, glutamine, serine, threonine, 
tyrosine, cysteine), nonpolar side chains (e.g., alanine, valine, leucine, isoleucine, proline, 
phenylalanine, methionine, tryptophan), beta-branched side chains (e.g., threonine, valine, 
isoleucine) and aromatic side chains (e.g., tyrosine, phenylalanine, tryptophan, histidine). 

30 Thus, a predicted non-essential amino acid residue in the NOVX protein is replaced with 
another amino acid residue from the same side chain family. Alternatively, in another 
embodiment, mutations can be introduced randomly along all or part of an NOVX coding 
sequence, such as by saturation mutagenesis, and the resultant mutants can be screened for 
NOVX biological activity to identify mutants that retain activity. Following mutagenesis 



SEQ ID NO: 2n-l, wherein n is an integer between 1 and 178, the encoded protein can be 
expressed by any recombinant technology known in the art and the activity of the protein 
can be determined. 

The relatedness of amino acid families may also be determined based on side chain 
5 interactions. Substituted amino acids may be fully conserved "strong" residues or fully 
conserved "weak" residues. The "strong" group of conserved amino acid residues may be 
any one of the following groups: STA, NEQK, NHQK, NDEQ, QHRK, MILV, MILF, HY, 
FYW, wherein the single letter amino acid codes are grouped by those amino acids that 
may be substituted for each other. Likewise, the "weak" group of conserved residues may 
10 be any one of the following: CSA, ATV, SAG, STNK, STPA, SGND, SNDEQK, 

NDEQHK, NEQHRK, HFY, wherein the letters within each group represent the single 
letter amino acid code. 

In one embodiment, a mutant NOVX protein can be assayed for (z) the ability to 
form protein:protein interactions with other NOVX proteins, other cell-surface proteins, or 
15 biologically-active portions thereof, (//) complex formation between a mutant NOVX 
protein and an NOVX ligand; or (Hi) the ability of a mutant NOVX protein to bind to an 
intracellular target protein or biologically-active portion thereof; (e.g. avidin proteins). 

In yet another embodiment, a mutant NOVX protein can be assayed for the ability 
to regulate a specific biological function (e.g., regulation of insulin release). 

20 

Antisense Nucleic Acids 

Another aspect of the invention pertains to isolated antisense nucleic acid molecules 
that are hybridizable to or complementary to the nucleic acid molecule comprising the 

25 nucleotide sequence of SEQ ID NO: 2n-l, wherein n is an integer between 1 and 178, or 
fragments, analogs or derivatives thereof. An "antisense" nucleic acid comprises a 
nucleotide sequence that is complementary to a "sense" nucleic acid encoding a protein 
(e.g., complementary to the coding strand of a double-stranded cDNA molecule or 
complementary to an mRNA sequence). In specific aspects, antisense nucleic acid 

30 molecules are provided that comprise a sequence complementary to at least about 10, 25, 
50, 1 00, 250 or 500 nucleotides or an entire NOVX coding strand, or to only a portion 
thereof. Nucleic acid molecules encoding fragments, homologs, derivatives and analogs of 
an NOVX protein of SEQ ID NO: 2n, wherein n is an integer between 1 and 1 78, or 
antisense nucleic acids complementary to an NOVX nucleic acid sequence of SEQ ID NO: 

35 2n-l, wherein n is an integer between 1 and 178, are additionally provided. 
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In one embodiment, an antisense nucleic acid molecule is antisense to a "coding 
region" of the coding strand of a nucleotide sequence encoding an NOVX protein. The 
term "coding region" refers to the region of the nucleotide sequence comprising codons 
which are translated into amino acid residues. In another embodiment, the antisense 
nucleic acid molecule is antisense to a "noncoding region" of the coding strand of a 
nucleotide sequence encoding the NOVX protein. The term "noncoding region" refers to 5' 
and 3' sequences which flank the coding region that are not translated into amino acids 
(i.e., also referred to as 5' and 3' untranslated regions). 

Given the coding strand sequences encoding the NOVX protein disclosed herein, 
antisense nucleic acids of the invention can be designed according to the rules of Watson 
and Crick or Hoogsteen base pairing. The antisense nucleic acid molecule can be 
complementary to the entire coding region of NOVX mRNA, but more preferably is an 
oligonucleotide that is antisense to only a portion of the coding or noncoding region of 
NOVX mRNA. For example, the antisense oligonucleotide can be complementary to the 
region surrounding the translation start site of NOVX mRNA. An antisense 
oligonucleotide can be, for example, about 5, 10, 15, 20, 25, 30, 35, 40, 45 or 50 
nucleotides in length. An antisense nucleic acid of the invention can be constructed using 
chemical synthesis or enzymatic ligation reactions using procedures known in the art. For 
example, an antisense nucleic acid (e.g., an antisense oligonucleotide) can be chemically 
synthesized using naturally-occurring nucleotides or variously modified nucleotides 
designed to increase the biological stability of the molecules or to increase the physical 
stability of the duplex formed between the antisense and sense nucleic acids (e.g., 
phosphorothioate derivatives and acridine substituted nucleotides can be used). 

Examples of modified nucleotides that can be used to generate the antisense nucleic 
acid include: 5-fluorouracil, 5-bromouracil, 5-chlorouracil, 5-iodouracil, hypoxanthine, 
xanthine, 4-acetylcytosine, 5-(carboxyhydroxy I methyl) uracil, 
5-carboxymethylaminomethyl-2-thiouridine, 5-carboxymethylaminomethyluracil, 
dihydrouracil, beta-D-galactosylqueosine, inosine, N6-isopentenyladenine, 

1- methylguanine, 1 -methyl inosine, 2,2-dimethylguanine, 2-methyladenine, 

2- methylguanine, 3-methylcytosine, 5-methylcytosine, N6-adenine, 7-methylguanine, 
5-methylaminomethyluracil, 5-methoxyaminomethyl-2-thiouracil, 
beta-D-mannosylqueosine, 5'-methoxycarboxymethyluracil, 5-methoxyuracil, 
2-methylthio-N6-isopentenyladenine, uracil-5-oxyacetic acid (v), wybutoxosine, 
pseudouracil, queosine, 2-thiocytosine, 5-methyl-2-thiouraciI, 2-thiouraciI, 4-thiouracil, 
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5-methyluracil, uracil-5-oxyacetic acid methylester, uracil-5-oxyacetic acid (v), 
5-methyl-2-thiouracil, 3-(3-amino-3-N-2-carboxypropyl) uracil, (acp3)w, and 
2,6-diaminopurine. Alternatively, the antisense nucleic acid can be produced biologically 
using an expression vector into which a nucleic acid has been subcloned in an antisense 
orientation (i.e., RNA transcribed from the inserted nucleic acid will be of an antisense 
orientation to a target nucleic acid of interest, described further in the following 
subsection). 

The antisense nucleic acid molecules of the invention are typically administered to 
a subject or generated in situ such that they hybridize with or bind to cellular mRNA and/or 
genomic DNA encoding an NOVX protein to thereby inhibit expression of the protein 
(e.g., by inhibiting transcription and/or translation). The hybridization can be by 
conventional nucleotide complementarity to form a stable duplex, or, for example, in the 
case of an antisense nucleic acid molecule that binds to DNA duplexes, through specific 
interactions in the major groove of the double helix. An example of a route of 
administration of antisense nucleic acid molecules of the invention includes direct 
injection at a tissue site. Alternatively, antisense nucleic acid molecules can be modified to 
target selected cells and then administered systemically. For example, for systemic 
administration, antisense molecules can be modified such that they specifically bind to 
receptors or antigens expressed on a selected cell surface (e.g., by linking the antisense 
nucleic acid molecules to peptides or antibodies that bind to cell surface receptors or 
antigens). The antisense nucleic acid molecules can also be delivered to cells using the 
vectors described herein. To achieve sufficient nucleic acid molecules, vector constructs in 
which the antisense nucleic acid molecule is placed under the control of a strong pol II or 
pol III promoter are preferred. 

In yet another embodiment, the antisense nucleic acid molecule of the invention is 
an ot-anomeric nucleic acid molecule. An a-anomeric nucleic acid molecule forms specific 
double-stranded hybrids with complementary RNA in which, contrary to the usual p-units, 
the strands run parallel to each other. See, e.g., Gaultier, et ah, 1987. Nucl. Acids Res. 15: 
6625-6641 . The antisense nucleic acid molecule can also comprise a 

2'-o-methylribonucIeotide (See, e.g., Inoue, et al. 1 987. Nucl Acids Res. 15: 6 1 3 1 -6 1 48) or 
a chimeric RNA-DNA analogue (See, e.g., Inoue, et al, 1987. FEBS Lett. 215: 327-330. 
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Ribozymes and PNA Moieties 

Nucleic acid modifications include, by way of non-limiting example, modified 
bases, and nucleic acids whose sugar phosphate backbones are modified or derivatized. 
These modifications are carried out at least in part to enhance the chemical stability of the 
5 modified nucleic acid, such that they may be used, for example, as antisense binding 
nucleic acids in therapeutic applications in a subject. 

In one embodiment, an antisense nucleic acid of the invention is a ribozyme. 
Ribozymes are catalytic RNA molecules with ribonuclease activity that are capable of 
cleaving a single-stranded nucleic acid, such as an mRNA, to which they have a 

10 complementary region. Thus, ribozymes (e.g., hammerhead ribozymes as described in 
Haselhoff and Gerlach 1988. Nature 334: 585-591) can be used to catalytically cleave 
NOVX mRNA transcripts to thereby inhibit translation of NOVX mRNA. A ribozyme 
having specificity for an NOVX-encoding nucleic acid can be designed based upon the 
nucleotide sequence of an NOVX cDNA disclosed herein (i.e., SEQ ID NO: 2n-l, wherein 

15 n is an integer between 1 and 178). For example, a derivative of a Tetrahymena L-19 IVS 
RNA can be constructed in which the nucleotide sequence of the active site is 
complementary to the nucleotide sequence to be cleaved in an NOVX-encoding mRNA. 
See, e.g., U.S. Patent 4,987,071 to Cech, etal. and U.S. Patent 5,1 16,742 to Cech, et al. 
NOVX mRNA can also be used to select a catalytic RNA having a specific ribonuclease 

20 activity from a pool of RNA molecules. See, e.g., Bartel et al., (1993) Science 
261:1411-1418. 

Alternatively, NOVX gene expression can be inhibited by targeting nucleotide 
sequences complementary to the regulatory region of the NOVX nucleic acid (e.g., the 
NOVX promoter and/or enhancers) to form triple helical structures that prevent 

25 transcription of the NOVX gene in target cells. See, e.g., Helene, 1 991 . Anticancer Drug 
Des. 6: 569-84; Helene, et al. 1992. Ann. N.Y. Acad. Sci. 660: 27-36; Maher, 1992. 
Bioassays 14: 807-15. 

In various embodiments, the NOVX nucleic acids can be modified at the base 
moiety, sugar moiety or phosphate backbone to improve, e.g., the stability, hybridization, 

30 or solubility of the molecule. For example, the deoxyribose phosphate backbone of the 
nucleic acids can be modified to generate peptide nucleic acids. See, e.g., Hyrup, et al., 
1996. Bioorg Med Chem 4: 5-23. As used herein, the terms "peptide nucleic acids" or 
"PNAs" refer to nucleic acid mimics (e.g., DNA mimics) in which the deoxyribose 
phosphate backbone is replaced by a pseudopeptide backbone and only the four natural 



nucleobases are retained. The neutral backbone of PNAs has been shown to allow for 
specific hybridization to DNA and RNA under conditions of low ionic strength. The 
synthesis of PNA oligomers can be performed using standard solid phase peptide synthesis 
protocols as described in Hyrup, et al, 1996. supra; Perry-O'Keefe, et al, 1996. Proc. 
5 Natl. Acad. Sci. USA 93: 14670-14675. 

PNAs of NOVX can be used in therapeutic and diagnostic applications. For 
example, PNAs can be used as antisense or antigene agents for sequence-specific 
modulation of gene expression by, e.g., inducing transcription or translation arrest or 
inhibiting replication. PNAs of NOVX can also be used, for example, in the analysis of 

1 0 single base pair mutations in a gene (e.g. , PNA directed PCR clamping; as artificial 

restriction enzymes when used in combination with other enzymes, e.g., Si nucleases (See, 
Hyrup, et al, \996. supra); or as probes or primers for DNA sequence and hybridization 
(See, Hyrup, et al, 1996, supra; Perry-O'Keefe, et al, 1 996. supra). 

In another embodiment, PNAs of NOVX can be modified, e.g., to enhance their 

1 5 stability or cellular uptake, by attaching lipophilic or other helper groups to PNA, by the 
formation of PNA-DNA chimeras, or by the use of liposomes or other techniques of drug 
delivery known in the art. For example, PNA-DNA chimeras of NOVX can be generated 
that may combine the advantageous properties of PNA and DNA. Such chimeras allow 
DNA recognition enzymes (e.g., RNase H and DNA polymerases) to interact with the 

20 DNA portion while the PNA portion would provide high binding affinity and specificity. 
PNA-DNA chimeras can be linked using linkers of appropriate lengths selected in terms of 
base stacking, number of bonds between the nucleobases, and orientation (see, Hyrup, et 
al., 1996. supra). The synthesis of PNA-DNA chimeras can be performed as described in 
Hyrup, et al, 1996. supra and Finn, et al, 1996. Nucl Acids Res 24: 3357-3363. For 

25 example, a DNA chain can be synthesized on a solid support using standard 
phosphoramidite coupling chemistry, and modified nucleoside analogs, e.g., 
5'-(4-methoxytrityl)amino-5'-deoxy-thymidine phosphoramidite, can be used between the 
PNA and the 5' end of DNA. See, e.g., Mag, et al, 1989. Nucl Acid Res 17: 5973-5988. 
PNA monomers are then coupled in a stepwise manner to produce a chimeric molecule 

30 with a 5' PNA segment and a 3' DNA segment. See, e.g., Finn, et al, 1996. supra. 

Alternatively, chimeric molecules can be synthesized with a 5' DNA segment and a 3' PNA 
segment. See, e.g., Petersen, et al, 1975. Bioorg. Med. Chem. Lett. 5:1119-11 124. 

In other embodiments, the oligonucleotide may include other appended groups such 
as 



peptides (e.g., for targeting host cell receptors in vivo), or agents facilitating transport 
across the cell membrane (see, e.g., Letsinger, et al, 1989. Proc. Natl. Acad. Sci. U.S.A. 
86: 6553-6556; Lemaitre, et al., 1987. Proc. Natl. Acad. Sci. 84: 648-652; PCT Publication 
No. WO88/09810) or the blood-brain barrier (see, e.g., PCT Publication No. WO 
5 89/1 01 34). In addition, oligonucleotides can be modified with hybridization triggered 
cleavage agents (see, e.g., Krol, et al, 1988. BioTechniques 6:958-976) or intercalating 
agents (see, e.g., Zon, 1988. Pharm. Res. 5: 539-549). To this end, the oligonucleotide 
may be conjugated to another molecule, e.g., a peptide, a hybridization triggered 
cross-linking agent, a transport agent, a hybridization-triggered cleavage agent, and the 
10 like. 

NOVX Polypeptides 

A polypeptide according to the invention includes a polypeptide including the 

1 5 amino acid sequence of NOVX polypeptides whose sequences are provided in SEQ ID 
NO: 2n, wherein n is an integer between 1 and 178. The invention also includes a mutant 
or variant protein any of whose residues may be changed from the corresponding residues 
shown in SEQ ID NO: 2n, wherein n is an integer between 1 and 178 while still encoding a 
protein that maintains its NOVX activities and physiological functions, or a functional 

20 fragment thereof. 

In general, an NOVX variant that preserves NOVX-like function includes any 
variant in which residues at a particular position in the sequence have been substituted by 
other amino acids, and further include the possibility of inserting an additional residue or 
residues between two residues of the parent protein as well as the possibility of deleting 

25 one or more residues from the parent sequence. Any amino acid substitution, insertion, or 
deletion is encompassed by the invention. In favorable circumstances, the substitution is a 
conservative substitution as defined above. 

One aspect of the invention pertains to isolated NOVX proteins, and biologically- 
active portions thereof, or derivatives, fragments, analogs or homologs thereof. Also 

30 provided are polypeptide fragments suitable for use as immunogens to raise anti-NOVX 
antibodies. In one embodiment, native NOVX proteins can be isolated from cells or tissue 
sources by an appropriate purification scheme using standard protein purification 
techniques. In another embodiment, NOVX proteins are produced by recombinant DNA 
techniques. Alternative to recombinant expression, an NOVX protein or polypeptide can 

35 be synthesized chemically using standard peptide synthesis techniques. 
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An "isolated" or "purified" polypeptide or protein or biologically-active portion 
thereof is substantially free of cellular material or other contaminating proteins from the 
cell or tissue source from which the NOVX protein is derived, or substantially free from 
chemical precursors or other chemicals when chemically synthesized. The language 
5 "substantially free of cellular material" includes preparations of NOVX proteins in which 
the protein is separated from cellular components of the cells from which it is isolated or 
recombinantly-produced. In one embodiment, the language "substantially free of cellular 
material" includes preparations of NOVX proteins having less than about 30% (by dry 
weight) of non-NOVX proteins (also referred to herein as a "contaminating protein"), more 

1 0 preferably less than about 20% of non-NOVX proteins, still more preferably less than 

about 10% of non-NOVX proteins, and most preferably less than about 5% of non-NOVX 
proteins. When the NOVX protein or biologically-active portion thereof is recombinantly- 
produced, it is also preferably substantially free of culture medium, i.e., culture medium 
represents less than about 20%, more preferably less than about 10%, and most preferably 

1 5 less than about 5% of the volume of the NOVX protein preparation. 

The language "substantially free of chemical precursors or other chemicals" 
includes preparations of NOVX proteins in which the protein is separated from chemical 
precursors or other chemicals that are involved in the synthesis of the protein. In one 
embodiment, the language "substantially free of chemical precursors or other chemicals" 

20 includes preparations of NOVX proteins having less than about 30% (by dry weight) of 
chemical precursors or non-NOVX chemicals, more preferably less than about 20% 
chemical precursors or non-NOVX chemicals, still more preferably less than about 10% 
chemical precursors or non-NOVX chemicals, and most preferably less than about 5% 
chemical precursors or non-NOVX chemicals. 

25 Biologically-active portions of NOVX proteins include peptides comprising amino 

acid sequences sufficiently homologous to or derived from the amino acid sequences of the 
NOVX proteins (e.g., the amino acid sequence shown in SEQ ID NO: 2n, wherein n is an 
integer between 1 and 178) that include fewer amino acids than the full-length NOVX 
proteins, and exhibit at least one activity of an NOVX protein. Typically, biologically- 

30 active portions comprise a domain or motif with at least one activity of the NOVX protein. 
A biologically-active portion of an NOVX protein can be a polypeptide which is, for 
example, 10, 25, 50, 100 or more amino acid residues in length. 
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Moreover, other biologically-active portions, in which other regions of the protein are 
deleted, can be prepared by recombinant techniques and evaluated for one or more of the 
functional activities of a native NOVX protein. 

In an embodiment, the NOVX protein has an amino acid sequence shown SEQ ID 
NO: 2n, wherein n is an integer between 1 and 178. In other embodiments, the NOVX 
protein is substantially homologous to SEQ ID NO: 2n, wherein n is an integer between 1 
and 1 78, and retains the functional activity of the protein of SEQ ID NO: 2n, wherein n is 
an integer between 1 and 178, yet differs in amino acid sequence due to natural allelic 
variation or mutagenesis, as described in detail, below. Accordingly, in another 
embodiment, the NOVX protein is a protein that comprises an amino acid sequence at least 
about 45% homologous to the amino acid sequence SEQ ID NO: 2n, wherein n is an 
integer between 1 and 178, and retains the functional activity of the NOVX proteins of 
SEQ ID NO: 2n, wherein n is an integer between 1 and 178. 

Determining Homology Between Two or More Sequences 

To determine the percent homology of two amino acid sequences or of two nucleic 
acids, the sequences are aligned for optimal comparison purposes (e.g., gaps can be 
introduced in the sequence of a first amino acid or nucleic acid sequence for optimal 
alignment with a second amino or nucleic acid sequence). The amino acid residues or 
nucleotides at corresponding amino acid positions or nucleotide positions are then 
compared. When a position in the first sequence is occupied by the same amino acid 
residue or nucleotide as the corresponding position in the second sequence, then the 
molecules are homologous at that position (i.e., as used herein amino acid or nucleic acid 
"homology" is equivalent to amino acid or nucleic acid "identity"). 

The nucleic acid sequence homology may be determined as the degree of identity between 
two sequences. The homology may be determined using computer programs known in the 
art, such as GAP software provided in the GCG program package. See, Needleman and 
Wunsch, 1970. J Mol Biol 48: 443-453. Using GCG GAP software with the following 
settings for nucleic acid sequence comparison: GAP creation penalty of 5.0 and GAP 
extension penalty of 0.3, the coding region of the analogous nucleic acid sequences 
referred to above exhibits a degree of identity preferably of at least 70%, 75%, 80%, 85%, 
90%, 95%, 98%, or 99%, with the CDS (encoding) part of the DNA from the group 
consisting of SEQ ID NO: 2n-l, wherein n is an integer between 1 and 1 78. 
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The term "sequence identity" refers to the degree to which two polynucleotide or 
polypeptide sequences are identical on a residue-by-residue basis over a particular region 
of comparison. The term "percentage of sequence identity" is calculated by comparing two 
optimally aligned sequences over that region of comparison, determining the number of 
positions at which the identical nucleic acid base (e.g., A, T, C, G, U, or I, in the case of 
nucleic acids) occurs in both sequences to yield the number of matched positions, dividing 
the number of matched positions by the total number of positions in the region of 
comparison (i.e., the window size), and multiplying the result by 1 00 to yield the 
percentage of sequence identity. The term "substantial identity" as used herein denotes a 
characteristic of a polynucleotide sequence, wherein the polynucleotide comprises a 
sequence that has at least 80 percent sequence identity, preferably at least 85 percent 
identity and often 90 to 95 percent sequence identity, more usually at least 99 percent 
sequence identity as compared to a reference sequence over a comparison region. 

Chimeric and Fusion Proteins 

The invention also provides NOVX chimeric or fusion proteins. As used herein, an 
NOVX "chimeric protein" or "fusion protein" comprises an NOVX polypeptide 
operatively-linked to a non-NOVX polypeptide. An "NOVX polypeptide" refers to a 
polypeptide having an amino acid sequence corresponding to an NOVX protein SEQ ID 
NO: 2n, wherein n is an integer between 1 and 178, whereas a "non-NOVX polypeptide" 
refers to a polypeptide having an amino acid sequence corresponding to a protein that is not 
substantially homologous to the NOVX protein, e.g., a protein that is different from the 
NOVX protein and that is derived from the same or a different organism. Within an 
NOVX fusion protein the NOVX polypeptide can correspond to all or a portion of an 
NOVX protein. In one embodiment, an NOVX fusion protein comprises at least one 
biologically-active portion of an NOVX protein. In another embodiment, an NOVX fusion 
protein comprises at least two biologically-active portions of an NOVX protein. In yet 
another embodiment, an NOVX fusion protein comprises at least three biologically-active 
portions of an NOVX protein. Within the fusion protein, the term "operatively-linked" is 
intended to indicate that the NOVX polypeptide and the non-NOVX polypeptide are fused 
in-frame with one another. The non-NOVX polypeptide can be fused to the N-terminus or 
C-terminus of the NOVX polypeptide. 

In one embodiment, the fusion protein is a GST-NO VX fusion protein in which the 
NOVX sequences are fused to the C-terminus of the GST (glutathione S-transferase) 
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sequences. Such fusion proteins can facilitate the purification of recombinant NOVX 
polypeptides. 

In another embodiment, the fusion protein is an NOVX protein containing a heterologous 
signal sequence at its N-terminus. In certain host cells (e.g., mammalian host cells), 
5 expression and/or secretion of NOVX can be increased through use of a heterologous 
signal sequence. 

In yet another embodiment, the fusion protein is an NOVX-immunoglobulin fusion 
protein in which the NOVX sequences are fused to sequences derived from a member of 
the immunoglobulin protein family. The NOVX-immunoglobulin fusion proteins of the 

1 0 invention can be incorporated into pharmaceutical compositions and administered to a 
subject to inhibit an interaction between an NOVX ligand and an NOVX protein on the 
surface of a cell, to thereby suppress NOVX-mediated signal transduction in vivo. The 
NOVX-immunoglobulin fusion proteins can be used to affect the bioavailability of an 
NOVX cognate ligand. Inhibition of the NOVX ligand/NOVX interaction may be useful 

1 5 therapeutically for both the treatment of proliferative and differentiative disorders, as well 
as modulating (e.g. promoting or inhibiting) cell survival. Moreover, the 
NOVX-immunoglobulin fusion proteins of the invention can be used as immunogens to 
produce anti-NOVX antibodies in a subject, to purify NOVX ligands, and in screening 
assays to identify molecules that inhibit the interaction of NOVX with an NOVX ligand. 

20 An NOVX chimeric or fusion protein of the invention can be produced by standard 

recombinant DNA techniques. For example, DNA fragments coding for the different 
polypeptide sequences are Iigated together in-frame in accordance with conventional 
techniques, e.g., by employing blunt-ended or stagger-ended termini for ligation, restriction 
enzyme digestion to provide for appropriate termini, filling-in of cohesive ends as 

25 appropriate, alkaline phosphatase treatment to avoid undesirable joining, and enzymatic 
ligation. In another embodiment, the fusion gene can be synthesized by conventional 
techniques including automated DNA synthesizers. Alternatively, PCR amplification of 
gene fragments can be carried out using anchor primers that give rise to complementary 
overhangs between two consecutive gene fragments that can subsequently be annealed and 

30 reamplified to generate a chimeric gene sequence (see, e.g., Ausubel, et al. (eds.) Current 
Protocols in Molecular Biology, John Wiley & Sons, 1992). Moreover, many 
expression vectors are commercially available that already encode a fusion moiety (e.g., a 
GST polypeptide). An NOVX-encoding nucleic acid can be cloned into such an expression 
vector such that the fusion moiety is linked in-frame to the NOVX protein. 
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NOVX Agonists and Antagonists 

The invention also pertains to variants of the NOVX proteins that function as either 
5 NOVX agonists (i.e., mimetics) or as NOVX antagonists. Variants of the NOVX protein 
can be generated by mutagenesis (e.g., discrete point mutation or truncation of the NOVX 
protein). An agonist of the NOVX protein can retain substantially the same, or a subset of, 
the biological activities of the naturally occurring form of the NOVX protein. An 
antagonist of the NOVX protein can inhibit one or more of the activities of the naturally 

1 0 occurring form of the NOVX protein by, for example, competitively binding to a 

downstream or upstream member of a cellular signaling cascade which includes the NOVX 
protein. Thus, specific biological effects can be elicited by treatment with a variant of 
limited function. In one embodiment, treatment of a subject with a variant having a subset 
of the biological activities of the naturally occurring form of the protein has fewer side 

1 5 effects in a subject relative to treatment with the naturally occurring form of the NOVX 
proteins. 

Variants of the NOVX proteins that function as either NOVX agonists (i.e., 
mimetics) or as NOVX antagonists can be identified by screening combinatorial libraries of 
mutants (e.g., truncation mutants) of the NOVX proteins for NOVX protein agonist or 

20 antagonist activity. In one embodiment, a variegated library of NOVX variants is 

generated by combinatorial mutagenesis at the nucleic acid level and is encoded by a 
variegated gene library. A variegated library of NOVX variants can be produced by, for 
example, enzymatically ligating a mixture of synthetic oligonucleotides into gene 
sequences such that a degenerate set of potential NOVX sequences is expressible as 

25 individual polypeptides, or alternatively, as a set of larger fusion proteins (e.g. , for phage 
display) containing the set of NOVX sequences therein. There are a variety of methods 
which can be used to produce libraries of potential NOVX variants from a degenerate 
oligonucleotide sequence. Chemical synthesis of a degenerate gene sequence can be 
performed in an automatic DNA synthesizer, and the synthetic gene then ligated into an 

30 appropriate expression vector. Use of a degenerate set of genes allows for the provision, in 
one mixture, of all of the sequences encoding the desired set of potential NOVX sequences. 
Methods for synthesizing degenerate oligonucleotides are well-known within the art. See, 
e.g., Narang, 1983. Tetrahedron 39: 3; Itakura, etal., 1984. Annu. Rev. Biochem. 53: 323; 
Itakura, et al, 1984. Science 198: 1056; Ike, et al., 1983. Nucl. Acids Res. 1 1 : 477. 
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Polypeptide Libraries 

In addition, libraries of fragments of the NOVX protein coding sequences can be 
5 used to generate a variegated population of NOVX fragments for screening and subsequent 
selection of variants of an NOVX protein. In one embodiment, a library of coding 
sequence fragments can be generated by treating a double stranded PCR fragment of an 
NOVX coding sequence with a nuclease under conditions wherein nicking occurs only 
about once per molecule, denaturing the double stranded DNA, renaturing the DNA to 

1 0 form double-stranded DNA that can include sense/antisense pairs from different nicked 
products, removing single stranded portions from reformed duplexes by treatment with Si 
nuclease, and ligating the resulting fragment library into an expression vector. By this 
method, expression libraries can be derived which encodes N-terminal and internal 
fragments of various sizes of the NOVX proteins. 

1 5 Various techniques are known in the art for screening gene products of 

combinatorial libraries made by point mutations or truncation, and for screening cDNA 
libraries for gene products having a selected property. Such techniques are adaptable for 
rapid screening of the gene libraries generated by the combinatorial mutagenesis of NOVX 
proteins. The most widely used techniques, which are amenable to high throughput 

20 analysis, for screening large gene libraries typically include cloning the gene library into 
replicable expression vectors, transforming appropriate cells with the resulting library of 
vectors, and expressing the combinatorial genes under conditions in which detection of a 
desired activity facilitates isolation of the vector encoding the gene whose product was 
detected. Recursive ensemble mutagenesis (REM), a new technique that enhances the 

25 frequency of functional mutants in the libraries, can be used in combination with the 
screening assays to identify NOVX variants. See, e.g., Arkin and Yourvan, 1992. Proc. 
Natl. Acad. Sci. USA 89: 781 1-7815; Delgrave, et ah, 1993. Protein Engineering 
6:327-331. 

30 NOVX Antibodies 

The term "antibody" as used herein refers to immunoglobulin molecules and 
immunologically active portions of immunoglobulin (Ig) molecules, i.e., molecules that 
contain an antigen binding site that specifically binds (immunoreacts with) an antigen. 
Such antibodies include, but are not limited to, polyclonal, monoclonal, chimeric, single 
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chain, F a b, F ab - and F (ab ')2 fragments, and an F ab expression library. In general, antibody 
molecules obtained from humans relates to any of the classes IgG, IgM, IgA, IgE and IgD, 
which differ from one another by the nature of the heavy chain present in the molecule. 
Certain classes have subclasses as well, such as IgGi, IgG 2 , and others. Furthermore, in 
5 humans, the light chain may be a kappa chain or a lambda chain. Reference herein to 

antibodies includes a reference to all such classes, subclasses and types of human antibody 
species. 

An isolated protein of the invention intended to serve as an antigen, or a portion or 
M= fragment thereof, can be used as an immunogen to generate antibodies that 

O 1 0 immunospecifically bind the antigen, using standard techniques for polyclonal and 

monoclonal antibody preparation. The full-length protein can be used or, alternatively, the 
u3 invention provides antigenic peptide fragments of the antigen for use as immunogens. An 

5? antigenic peptide fragment comprises at least 6 amino acid residues of the amino acid 

s sequence of the full length protein, such as an amino acid sequence shown in SEQ ID NO: 

J°i 1 5 2n, wherein n is an integer between 1 and 1 78, and encompasses an epitope thereof such 

that an antibody raised against the peptide forms a specific immune complex with the full 
p length protein or with any fragment that contains the epitope. Preferably, the antigenic 

MJ peptide comprises at least 1 0 amino acid residues, or at least 1 5 amino acid residues, or at 

least 20 amino acid residues, or at least 30 amino acid residues. Preferred epitopes 
20 encompassed by the antigenic peptide are regions of the protein that are located on its 

surface; commonly these are hydrophilic regions. 

In certain embodiments of the invention, at least one epitope encompassed by the 
antigenic peptide is a region of NOVX that is located on the surface of the protein, e.g., a 
hydrophilic region. A hydrophobicity analysis of the human NOVX protein sequence will 

25 indicate which regions of a NOVX polypeptide are particularly hydrophilic and, therefore, 
are likely to encode surface residues useful for targeting antibody production. As a means 
for targeting antibody production, hydropathy plots showing regions of hydrophilicity and 
hydrophobicity may be generated by any method well known in the art, including, for 
example, the Kyte Doolittle or the Hopp Woods methods, either with or without Fourier 

30 transformation. See, e.g., Hopp and Woods, 1 981 , Proc. Nat. Acad. Set. USA 78: 3824- 
3828; Kyte and Doolittle 1982, J. Mol. Biol. 1 57: 105-142, each incorporated herein by 
reference in their entirety. Antibodies that are specific for one or more domains within an 
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antigenic protein, or derivatives, fragments, analogs or homologs thereof, are also provided 
herein. 

A protein of the invention, or a derivative, fragment, analog, homolog or ortholog 
thereof, may be utilized as an immunogen in the generation of antibodies that 
immunospecifically bind these protein components. 

Various procedures known within the art may be used for the production of 
polyclonal or monoclonal antibodies directed against a protein of the invention, or against 
derivatives, fragments, analogs homologs or orthologs thereof (see, for example, 
Antibodies: A Laboratory Manual, Harlow E, and Lane D, 1988, Cold Spring Harbor 
Laboratory Press, Cold Spring Harbor, NY, incorporated herein by reference). Some of 
these antibodies are discussed below. 

Polyclonal Antibodies 

For the production of polyclonal antibodies, various suitable host animals (e.g., 
rabbit, goat, mouse or other mammal) may be immunized by one or more injections with 
the native protein, a synthetic variant thereof, or a derivative of the foregoing. An 
appropriate immunogenic preparation can contain, for example, the naturally occurring 
immunogenic protein, a chemically synthesized polypeptide representing the immunogenic 
protein, or a recombinantly expressed immunogenic protein. Furthermore, the protein may 
be conjugated to a second protein known to be immunogenic in the mammal being 
immunized. Examples of such immunogenic proteins include but are not limited to 
keyhole limpet hemocyanin, serum albumin, bovine thyroglobulin, and soybean trypsin 
inhibitor. The preparation can further include an adjuvant. Various adjuvants used to 
increase the immunological response include, but are not limited to, Freund's (complete 
and incomplete), mineral gels (e.g., aluminum hydroxide), surface active substances (e.g., 
lysolecithin, pluronic polyols, polyanions, peptides, oil emulsions, dinitrophenol, etc.), 
adjuvants usable in humans such as Bacille Calmette-Guerin and Corynebacterium parvum, 
or similar immunostimulatory agents. Additional examples of adjuvants which can be 
employed include MPL-TDM adjuvant (monophosphoryl Lipid A, synthetic trehalose 
dicorynomycolate). 

The polyclonal antibody molecules directed against the immunogenic protein can 
be isolated from the mammal (e.g., from the blood) and further purified by well known 
techniques, such as affinity chromatography using protein A or protein G, which provide 
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primarily the IgG fraction of immune serum. Subsequently, or alternatively, the specific 
antigen which is the target of the immunoglobulin sought, or an epitope thereof, may be 
immobilized on a column to purify the immune specific antibody by immunoaffinity 
chromatography. Purification of immunoglobulins is discussed, for example, by D. 
Wilkinson (The Scientist, published by The Scientist, Inc., Philadelphia PA, Vol. 14, No. 8 
(April 17, 2000), pp. 25-28). 

Monoclonal Antibodies 

The term "monoclonal antibody" (MAb) or "monoclonal antibody composition", as 
used herein, refers to a population of antibody molecules that contain only one molecular 
species of antibody molecule consisting of a unique light chain gene product and a unique 
heavy chain gene product. In particular, the complementarity determining regions (CDRs) 
of the monoclonal antibody are identical in all the molecules of the population. MAbs thus 
contain an antigen binding site capable of immunoreacting with a particular epitope of the 
antigen characterized by a unique binding affinity for it. 

Monoclonal antibodies can be prepared using hybridoma methods, such as those 
described by Kohler and Milstein, Nature, 256:495 ( 1 975). In a hybridoma method, a 
mouse, hamster, or other appropriate host animal, is typically immunized with an 
immunizing agent to elicit lymphocytes that produce or are capable of producing antibodies 
that will specifically bind to the immunizing agent. Alternatively, the lymphocytes can be 
immunized in vitro. 

The immunizing agent will typically include the protein antigen, a fragment thereof 
or a fusion protein thereof. Generally, either peripheral blood lymphocytes are used if cells 
of human origin are desired, or spleen cells or lymph node cells are used if non-human 
mammalian sources are desired. The lymphocytes are then fused with an immortalized cell 
line using a suitable fusing agent, such as polyethylene glycol, to form a hybridoma cell 
[Goding, Monoclonal Antibodies: Principles and Practice , Academic Press, (1986) pp. 59- 
103]. Immortalized cell lines are usually transformed mammalian cells, particularly 
myeloma cells of rodent, bovine and human origin. Usually, rat or mouse myeloma cell 
lines are employed. The hybridoma cells can be cultured in a suitable culture medium that 
preferably contains one or more substances that inhibit the growth or survival of the 
unfused, immortalized cells. For example, if the parental cells lack the enzyme 
hypoxanthine guanine phosphoribosyl transferase (HGPRT or HPRT), the culture medium 
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for the hybridomas typically will include hypoxanthine, amine-pterin, and thymidine ("HAT 
medium"), which substances prevent the growth of HGPRT-deficient cells. 
Preferred immortalized cell lines are those that fuse efficiently, support stable high level 
expression of antibody by the selected antibody-producing cells, and are sensitive to a 
5 medium such as HAT medium. More preferred immortalized cell lines are murine 
myeloma lines, which can be obtained, for instance, from the Salk Institute Cell 
Distribution Center, San Diego, California and the American Type Culture Collection, 
Manassas, Virginia. Human myeloma and mouse-human heteromyeloma cell lines also 
have been described for the production of human monoclonal antibodies [Kozbor, J. 

1 0 Immunol., 133 :300 1 ( 1 984); Brodeur et al., Monoclonal Antibody Production Techniques 
and Applications. Marcel Dekker, Inc., New York, (1987) pp. 51-63]. 

The culture medium in which the hybridoma cells are cultured can then be assayed 
for the presence of monoclonal antibodies directed against the antigen. Preferably, the 
binding specificity of monoclonal antibodies produced by the hybridoma cells is 

1 5 determined by immunoprecipitation or by an in vitro binding assay, such as 

radioimmunoassay (RIA) or enzyme-linked immunoabsorbent assay (ELISA). Such 
techniques and assays are known in the art. The binding affinity of the monoclonal 
antibody can, for example, be determined by the Scatchard analysis of Munson and Pollard, 
Anal. Biochem.. 107:220 (1980). It is an objective, especially important in therapeutic 

20 applications of monoclonal antibodies, to identify antibodies having a high degree of 
specificity and a high binding affinity for the target antigen. 

After the desired hybridoma cells are identified, the clones can be subcloned by 
limiting dilution procedures and grown by standard methods (Goding,1986). Suitable 
culture media for this purpose include, for example, Dulbecco's Modified Eagle's Medium 

25 and RPMI-1640 medium. Alternatively, the hybridoma cells can be grown in vivo as 
ascites in a mammal. 

The monoclonal antibodies secreted by the subclones can be isolated or purified from the 
culture medium or ascites fluid by conventional immunoglobulin purification procedures 
such as, for example, protein A-Sepharose, hydroxylapatite chromatography, gel 
30 electrophoresis, dialysis, or affinity chromatography. 

The monoclonal antibodies can also be made by recombinant DNA methods, such 
as those described in U.S. Patent No. 4,816,567. DNA encoding the monoclonal antibodies 
of the invention can be readily isolated and sequenced using conventional procedures (e.g., 
by using oligonucleotide probes that are capable of binding specifically to genes encoding 



the heavy and light chains of murine antibodies). The hybridoma cells of the invention 
serve as a preferred source of such DNA. Once isolated, the DNA can be placed into 
expression vectors, which are then transfected into host cells such as simian COS cells, 
Chinese hamster ovary (CHO) cells, or myeloma cells that do not otherwise produce 
immunoglobulin protein, to obtain the synthesis of monoclonal antibodies in the 
recombinant host cells. The DNA also can be modified, for example, by substituting the 
coding sequence for human heavy and light chain constant domains in place of the 
homologous murine sequences (U.S. Patent No. 4,816,567; Morrison, Nature 368, 812-13 
(1994)) or by covalently joining to the immunoglobulin coding sequence all or part of the 
coding sequence for a non-immunoglobulin polypeptide. Such a non-immunoglobulin 
polypeptide can be substituted for the constant domains of an antibody of the invention, or 
can be substituted for the variable domains of one antigen-combining site of an antibody of 
the invention to create a chimeric bivalent antibody. 

Humanized Antibodies 

The antibodies directed against the protein antigens of the invention can further 
comprise humanized antibodies or human antibodies. These antibodies are suitable for 
administration to humans without engendering an immune response by the human against 
the administered immunoglobulin. Humanized forms of antibodies are chimeric 
immunoglobulins, immunoglobulin chains or fragments thereof (such as Fv, Fab, Fab', 
F(ab') 2 or other antigen-binding subsequences of antibodies) that are principally comprised 
of the sequence of a human immunoglobulin, and contain minimal sequence derived from a 
non-human immunoglobulin. Humanization can be performed following the method of 
Winter and co-workers (Jones et ah, Nature, 321:522-525 (1986); Riechmann et al., Nature , 
332:323-327 (1988); Verhoeyen et al., Science. 239:1534-1536 (1988)), by substituting 
rodent CDRs or CDR sequences for the corresponding sequences of a human antibody. 
(See also U.S. Patent No. 5,225,539.) In some instances, Fv framework residues of the 
human immunoglobulin are replaced by corresponding non-human residues. Humanized 
antibodies can also comprise residues which are found neither in the recipient antibody nor 
in the imported CDR or framework sequences. In general, the humanized antibody will 
comprise substantially all of at least one, and typically two, variable domains, in which all 
or substantially all of the CDR regions correspond to those of a non-human 
immunoglobulin and all or substantially all of the framework regions are those of a human 
immunoglobulin consensus sequence. The humanized antibody optimally also will 
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comprise at least a portion of an immunoglobulin constant region (Fc), typically that of a 
human immunoglobulin (Jones et al., 1986; Riechmann et al., 1988; and Presta, Curr. Op. 
Struct. Biol.. 2:593-596 (1992)). 



5 Human Antibodies 

Fully human antibodies essentially relate to antibody molecules in which the entire 
sequence of both the light chain and the heavy chain, including the CDRs, arise from 
human genes. Such antibodies are termed "human antibodies", or "fully human 
antibodies" herein. Human monoclonal antibodies can be prepared by the trioma 

10 technique; the human B-cell hybridoma technique (see Kozbor, et al., 1983 Immunol 

Today 4: 72) and the EBV hybridoma technique to produce human monoclonal antibodies 
(see Cole, etal., 1985 In: Monoclonal Antibodies and Cancer Therapy, Alan R. Liss, 
Inc., pp. 77-96). Human monoclonal antibodies may be utilized in the practice of the 
present invention and may be produced by using human hybridomas (see Cote, et al., 1983. 

1 5 Proc Natl Acad Sci USA 80: 2026-2030) or by transform ing human B-cells with Epstein 
Barr Virus in vitro (see Cole, et al., 1 985 In: Monoclonal Antibodies and Cancer 
Therapy, Alan R. Liss, Inc., pp. 77-96). 

In addition, human antibodies can also be produced using additional techniques, 
including phage display libraries (Hoogenboom and Winter, J. Mol. Biol.. 227:381 (1991); 

20 Marks et al., J. Mol. Biol.. 222:581 (1991)). Similarly, human antibodies can be made by 
introducing human immunoglobulin loci into transgenic animals, e.g., mice in which the 
endogenous immunoglobulin genes have been partially or completely inactivated. Upon 
challenge, human antibody production is observed, which closely resembles that seen in 
humans in all respects, including gene rearrangement, assembly, and antibody repertoire. 

25 This approach is described, for example, in U.S. Patent Nos. 5,545,807; 5,545,806; 

5,569,825; 5,625,126; 5,633,425; 5,661,016, and in Marks et al. (Bio/Technology 10. 779- 
783 (1992)); Lonberg et al. (Nature 368 856-859 (1994)); Morrison ( Nature 368. 812-13 
(1994)); Fishwild et al,( Nature Biotechnology 14. 845-51 (1996)); Neuberger (Nature 
Biotechnology 14, 826 (1996)); and Lonberg and Huszar ( Intern. Rev. Immunol. 13 65-93 

30 (1995)). 

Human antibodies may additionally be produced using transgenic nonhuman 
animals which are modified so as to produce fully human antibodies rather than the 
animal's endogenous antibodies in response to challenge by an antigen. (See PCT 
publication WO94/02602). The endogenous genes encoding the heavy and light 



immunoglobulin chains in the nonhuman host have been incapacitated, and active loci 
encoding human heavy and light chain immunoglobulins are inserted into the host's 
genome. The human genes are incorporated, for example, using yeast artificial 
chromosomes containing the requisite human DNA segments. An animal which provides 
all the desired modifications is then obtained as progeny by crossbreeding intermediate 
transgenic animals containing fewer than the full complement of the modifications. The 
preferred embodiment of such a nonhuman animal is a mouse, and is termed the 
Xenomouse™ as disclosed in PCT publications WO 96/33735 and WO 96/34096. This 
animal produces B cells which secrete fully human immunoglobulins. The antibodies can 
be obtained directly from the animal after immunization with an immunogen of interest, as, 
for example, a preparation of a polyclonal antibody, or alternatively from immortalized B 
cells derived from the animal, such as hybridomas producing monoclonal antibodies. 
Additionally, the genes encoding the immunoglobulins with human variable regions can be 
recovered and expressed to obtain the antibodies directly, or can be further modified to 
obtain analogs of antibodies such as, for example, single chain Fv molecules. 

An example of a method of producing a nonhuman host, exemplified as a mouse, 
lacking expression of an endogenous immunoglobulin heavy chain is disclosed in U.S. 
Patent No. 5,939,598. It can be obtained by a method including deleting the J segment 
genes from at least one endogenous heavy chain locus in an embryonic stem cell to prevent 
rearrangement of the locus and to prevent formation of a transcript of a rearranged 
immunoglobulin heavy chain locus, the deletion being effected by a targeting vector 
containing a gene encoding a selectable marker; and producing from the embryonic stem 
cell a transgenic mouse whose somatic and germ cells contain the gene encoding the 
selectable marker. 

A method for producing an antibody of interest, such as a human antibody, is 
disclosed in U.S. Patent No. 5,91 6,771 . It includes introducing an expression vector that 
contains a nucleotide sequence encoding a heavy chain into one mammalian host cell in 
culture, introducing an expression vector containing a nucleotide sequence encoding a light 
chain into another mammalian host cell, and fusing the two cells to form a hybrid cell. The 
hybrid cell expresses an antibody containing the heavy chain and the light chain. 

In a further improvement on this procedure, a method for identifying a clinically 
relevant epitope on an immunogen, and a correlative method for selecting an antibody that 
binds immunospecifically to the relevant epitope with high affinity, are disclosed in PCT 
publication WO 99/53049. 
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F ab Fragments and Single Chain Antibodies 

According to the invention, techniques can be adapted for the production of 
single-chain antibodies specific to an antigenic protein of the invention (see e.g., U.S. 
Patent No. 4,946,778). In addition, methods can be adapted for the construction of F ab 
expression libraries (see e.g., Huse, et al., 1989 Science 246: 1275-1281) to allow rapid and 
effective identification of monoclonal F ab fragments with the desired specificity for a 
protein or derivatives, fragments, analogs or homologs thereof. Antibody fragments that 
contain the idiotypes to a protein antigen may be produced by techniques known in the art 
including, but not limited to: (i) an F (ab )2 fragment produced by pepsin digestion of an 
antibody molecule; (ii) an F ab fragment generated by reducing the disulfide bridges of an 
F(ab )2 fragment; (iii) an F ab fragment generated by the treatment of the antibody molecule 
with papain and a reducing agent and (iv) F v fragments. 

Bispecific Antibodies 

Bispecific antibodies are monoclonal, preferably human or humanized, antibodies 
that have binding specificities for at least two different antigens. In the present case, one of 
the binding specificities is for an antigenic protein of the invention. The second binding 
target is any other antigen, and advantageously is a cell-surface protein or receptor or 
receptor subunit. 

Methods for making bispecific antibodies are known in the art. Traditionally, the 
recombinant production of bispecific antibodies is based on the co-expression of two 
immunoglobulin heavy-chain/light-chain pairs, where the two heavy chains have different 
specificities (Milstein and Cuello, Nature, 305:537-539 (1983)). Because of the random 
assortment of immunoglobulin heavy and light chains, these hybridomas (quadromas) 
produce a potential mixture of ten different antibody molecules, of which only one has the 
correct bispecific structure. The purification of the correct molecule is usually 
accomplished by affinity chromatography steps. Similar procedures are disclosed in WO 
93/08829, published 13 May 1993, and in Traunecker et al., EMBO J.. 10:3655-3659 
(1991). 

Antibody variable domains with the desired binding specificities (antibody-antigen 
combining sites) can be fused to immunoglobulin constant domain sequences. The fusion 
preferably is with an immunoglobulin heavy-chain constant domain, comprising at least 
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part of the hinge, CH2, and CH3 regions. It is preferred to have the first heavy-chain 
constant region (CHI) containing the site necessary for light-chain binding present in at 
least one of the fusions. DNAs encoding the immunoglobulin heavy-chain fusions and, if 
desired, the immunoglobulin light chain, are inserted into separate expression vectors, and 
are co-transfected into a suitable host organism. For further details of generating bispecific 
antibodies see, for example, Suresh et al., Methods in Enzvmologv. 121:210 (1986). 

According to another approach described in WO 96/2701 1, the interface between a 
pair of antibody molecules can be engineered to maximize the percentage of heterodimers 
which are recovered from recombinant cell culture. The preferred interface comprises at 
least a part of the CH3 region of an antibody constant domain. In this method, one or more 
small amino acid side chains from the interface of the first antibody molecule are replaced 
with larger side chains (e.g. tyrosine or tryptophan). Compensatory "cavities" of identical 
or similar size to the large side chain(s) are created on the interface of the second antibody 
molecule by replacing large amino acid side chains with smaller ones (e.g. alanine or 
threonine). This provides a mechanism for increasing the yield of the heterodimer over 
other unwanted end-products such as homodimers. 

Bispecific antibodies can be prepared as full length antibodies or antibody fragments (e.g. 
F(ab') 2 bispecific antibodies). Techniques for generating bispecific antibodies from 
antibody fragments have been described in the literature. For example, bispecific 
antibodies can be prepared using chemical linkage. Brennan et al., Science 229:81 (1985) 
describe a procedure wherein intact antibodies are proteolytically cleaved to generate 
F(ab') 2 fragments. These fragments are reduced in the presence of the dithiol complexing 
agent sodium arsenite to stabilize vicinal dithiols and prevent intermolecular disulfide 
formation. The Fab' fragments generated are then converted to thionitrobenzoate (TNB) 
derivatives. One of the Fab'-TNB derivatives is then reconverted to the Fab'-thiol by 
reduction with mercaptoethylamine and is mixed with an equimolar amount of the other 
Fab'-TNB derivative to form the bispecific antibody. The bispecific antibodies produced 
can be used as agents for the selective immobilization of enzymes. 

Additionally, Fab' fragments can be directly recovered from E. coli and chemically 
coupled to form bispecific antibodies. Shalaby et al., J. Exp. Med. 175:217-225 (1992) 
describe the production of a fully humanized bispecific antibody F(ab') 2 molecule. Each 
Fab' fragment was separately secreted from E. coli and subjected to directed chemical 
coupling in vitro to form the bispecific antibody. The bispecific antibody thus formed was 
able to bind to cells overexpressing the ErbB2 receptor and normal human T cells, as well 
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as trigger the lytic activity of human cytotoxic lymphocytes against human breast tumor 
targets. 

Various techniques for making and isolating bispecific antibody fragments directly 
from recombinant cell culture have also been described. For example, bispecific antibodies 
5 have been produced using leucine zippers. Kostelny et al., J. Immunol. 148(5):1547-1553 
(1992). The leucine zipper peptides from the Fos and Jun proteins were linked to the Fab' 
portions of two different antibodies by gene fusion. The antibody homodimers were 
reduced at the hinge region to form monomers and then re-oxidized to form the antibody 
heterodimers. This method can also be utilized for the production of antibody 

1 0 homodimers. The "diabody" technology described by Hollinger et al., Proc. Natl. Acad. 
Sci. USA 90:6444-6448 (1993) has provided an alternative mechanism for making 
bispecific antibody fragments. The fragments comprise a heavy-chain variable domain 
(V H ) connected to a light-chain variable domain (V L ) by a linker which is too short to allow 
pairing between the two domains on the same chain. Accordingly, the V H and V L domains 

1 5 of one fragment are forced to pair with the complementary V L and V H domains of another 
fragment, thereby forming two antigen-binding sites. Another strategy for making 
bispecific antibody fragments by the use of single-chain Fv (sFv) dimers has also been 
reported. See, Gruber et al., J. Immunol. 152:5368 (1994). 

Antibodies with more than two valencies are contemplated. For example, trispecific 

20 antibodies can be prepared. Tutt et al., J. Immunol. 147:60 (199 1). 

Exemplary bispecific antibodies can bind to two different epitopes, at least one of 
which originates in the protein antigen of the invention. Alternatively, an anti-antigenic 
arm of an immunoglobulin molecule can be combined with an arm which binds to a 
triggering molecule on a leukocyte such as a T-cell receptor molecule (e.g. CD2, CD3, 

25 CD28, or B7), or Fc receptors for IgG (FcyR), such as FcyRI (CD64), FcyRII (CD32) and 
FcyRIII (CD16) so as to focus cellular defense mechanisms to the cell expressing the 
particular antigen. Bispecific antibodies can also be used to direct cytotoxic agents to cells 
which express a particular antigen. These antibodies possess an antigen-binding arm and 
an arm which binds a cytotoxic agent or a radionuclide chelator, such as EOTUBE, DPTA, 

30 DOTA, or TETA. Another bispecific antibody of interest binds the protein antigen 
described herein and further binds tissue factor (TF). 
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Heterocon jugate Antibodies 

Heteroconjugate antibodies are also within the scope of the present invention. 
Heteroconjugate antibodies are composed of two covalently joined antibodies. Such 
antibodies have, for example, been proposed to target immune system cells to unwanted 
5 cells (U.S. Patent No. 4,676,980), and for treatment of HIV infection (WO 91/00360; WO 
92/200373; EP 03089). It is contemplated that the antibodies can be prepared in vitro using 
known methods in synthetic protein chemistry, including those involving crosslinking 
agents. For example, immunotoxins can be constructed using a disulfide exchange reaction 
or by forming a thioether bond. Examples of suitable reagents for this purpose include 
1 0 iminothiolate and methyl-4-mercaptobutyrimidate and those disclosed, for example, in U.S. 
Patent No. 4,676,980. 

Effector Function Engineering 

It can be desirable to modify the antibody of the invention with respect to effector 
1 5 function, so as to enhance, e.g., the effectiveness of the antibody in treating cancer. For 
example, cysteine residue(s) can be introduced into the Fc region, thereby allowing 
interchain disulfide bond formation in this region. The homodimeric antibody thus 
generated can have improved internalization capability and/or increased complement- 
mediated cell killing and antibody-dependent cellular cytotoxicity (ADCC). See Caron et 
20 ah, J. Exp Med .. 176 : 1 191-1 195 (1992) and Shopes, J. Immunol .. 148 : 2918-2922 (1992). 
Homodimeric antibodies with enhanced anti-tumor activity can also be prepared using 
heterobifunctional cross-linkers as described in Wolff et al. Cancer Research. 53: 2560- 
2565 (1993). Alternatively, an antibody can be engineered that has dual Fc regions and can 
thereby have enhanced complement lysis and ADCC capabilities. See Stevenson et al., 
25 Anti-Cancer Drug Design. 3:219-230(1989). 

Im m u nocon j ugates 

The invention also pertains to immunoconj ugates comprising an antibody 
conjugated to a cytotoxic agent such as a chemotherapeutic agent, toxin (e.g., an 
30 enzymatically active toxin of bacterial, fungal, plant, or animal origin, or fragments 
thereof), or a radioactive isotope (i.e., a radioconjugate). 

Chemotherapeutic agents useful in the generation of such immunoconjugates have 
been described above. Enzymatically active toxins and fragments thereof that can be used 
include diphtheria A chain, nonbinding active fragments of diphtheria toxin, exotoxin A 
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chain (from Pseudomonas aeruginosa), ricin A chain, abrin A chain, modeccin A chain, 
alpha-sarcin, Aleurites fordii proteins, dianthin proteins, Phytolaca americana proteins 
(PAPI, PAPII, and PAP-S), momordica charantia inhibitor, curcin, crotin, sapaonaria 
officinalis inhibitor, gelonin, mitogellin, restrictocin, phenomycin, enomycin, and the 
tricothecenes. A variety of radionuclides are available for the production of 
radioconjugated antibodies. Examples include 212 Bi, 131 1, 131 In, 90 Y, and 1S6 Re. 

Conjugates of the antibody and cytotoxic agent are made using a variety of 
bifunctional protein-coupling agents such as N-succinimidyl-3-(2-pyridyldithiol) 
propionate (SPDP), iminothiolane (IT), bifunctional derivatives of imidoesters (such as 
dimethyl adipimidate HCL), active esters (such as disuccinimidyl suberate), aldehydes 
(such as glutareldehyde), bis-azido compounds (such as bis (p-azidobenzoyl) 
hexanediamine), bis-diazonium derivatives (such as bis-(p-diazoniumbenzoyl)- 
ethylenediamine), diisocyanates (such as tolyene 2,6-diisocyanate), and bis-active fluorine 
compounds (such as l,5-difiuoro-2,4-dinitrobenzene). For example, a ricin immunotoxin 
can be prepared as described in Vitetta et al., Science. 238 : 1098 (1987). Carbon-14- 
labeled l-isothiocyanatobenzyl-3-methyldiethylene triaminepentaacetic acid (MX-DTPA) 
is an exemplary chelating agent for conjugation of radionucleotide to the antibody. See 
WO94/11026. 

In another embodiment, the antibody can be conjugated to a "receptor" (such 
streptavidin) for utilization in tumor pretargeting wherein the antibody-receptor conjugate 
is administered to the patient, followed by removal of unbound conjugate from the 
circulation using a clearing agent and then administration of a "ligand" (e.g., avidin) that is 
in turn conjugated to a cytotoxic agent. 

Immunoliposomes 

The antibodies disclosed herein can also be formulated as immunoliposomes. 
Liposomes containing the antibody are prepared by methods known in the art, such as 
described in Epstein et al., Proc. Natl. Acad. Sci. USA. 82: 3688 (1985); Hwang et al., 
Proc. Natl Acad. Sci. USA . 77: 4030 (1980); and U.S. Pat Nos. 4,485,045 and 4,544,545. 
Liposomes with enhanced circulation time are disclosed in U.S. Patent No. 5,013,556. 

Particularly useful liposomes can be generated by the reverse-phase evaporation 
method with a lipid composition comprising phosphatidylcholine, cholesterol, and PEG- 
derivatized phosphatidylethanolamine (PEG-PE). Liposomes are extruded through filters 
of defined pore size to yield liposomes with the desired diameter. Fab' fragments of the 
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antibody of the present invention can be conjugated to the liposomes as described in Martin 
et al J. Biol. Chem.. 257: 286-288 (1982) via a disulfide-interchange reaction. A 
chemotherapeutic agent (such as Doxorubicin) is optionally contained within the liposome. 
See Gabizon et al., J. National Cancer Inst.. 81.(19): 1484 (1989). 

5 

Diagnostic Applications of Antibodies Directed Against the Proteins of the Invention 

Antibodies directed against a protein of the invention may be used in methods 
known within the art relating to the localization and/or quantitation of the protein (e.g., for 
use in measuring levels of the protein within appropriate physiological samples, for use in 
10 diagnostic methods, for use in imaging the protein, and the like). In a given embodiment, 
antibodies against the proteins, or derivatives, fragments, analogs or homologs thereof, that 
contain the antigen binding domain, are utilized as pharmacologically-active compounds 
(see below). 

An antibody specific for a protein of the invention can be used to isolate the protein 

1 5 by standard techniques, such as immunoaffinity chromatography or immunoprecipitation. 
Such an antibody can facilitate the purification of the natural protein antigen from cells and 
of recombinantly produced antigen expressed in host cells. Moreover, such an antibody 
can be used to detect the antigenic protein (e.g., in a cellular lysate or cell supernatant) in 
order to evaluate the abundance and pattern of expression of the antigenic protein. 

20 Antibodies directed against the protein can be used diagnostically to monitor protein levels 
in tissue as part of a clinical testing procedure, e.g., to, for example, determine the efficacy 
of a given treatment regimen. Detection can be facilitated by coupling (i.e., physically 
linking) the antibody to a detectable substance. Examples of detectable substances include 
various enzymes, prosthetic groups, fluorescent materials, luminescent materials, 

25 bioluminescent materials, and radioactive materials. Examples of suitable enzymes include 
horseradish peroxidase, alkaline phosphatase, p-galactosidase, or acetylcholinesterase; 
examples of suitable prosthetic group complexes include streptavidin/biotin and 
avidin/biotin; examples of suitable fluorescent materials include umbel liferone, fluorescein, 
fluorescein isothiocyanate, rhodamine, dichlorotriazinylamine fluorescein, dansyl chloride 

30 or phycoerythrin; an example of a luminescent material includes Iuminol; examples of 
bioluminescent materials include luciferase, luciferin, and aequorin, and examples of 
suitable radioactive material include ,25 I, 13, I, 35 S or 3 H. 
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Antibody Therapeutics 

Antibodies of the invention, including polyclonal, monoclonal, humanized and fully 
human antibodies, may used as therapeutic agents. Such agents will generally be employed 
to treat or prevent a disease or pathology in a subject. An antibody preparation, preferably 
5 one having high specificity and high affinity for its target antigen, is administered to the 
subject and will generally have an effect due to its binding with the target. Such an effect 
may be one of two kinds, depending on the specific nature of the interaction between the 
given antibody molecule and the target antigen in question. In the first instance, 
administration of the antibody may abrogate or inhibit the binding of the target with an 

10 endogenous ligand to which it naturally binds. In this case, the antibody binds to the target 
and masks a binding site of the naturally occurring ligand, wherein the ligand serves as an 
effector molecule. Thus the receptor mediates a signal transduction pathway for which 
ligand is responsible. 

Alternatively, the effect may be one in which the antibody elicits a physiological 

1 5 result by virtue of binding to an effector binding site on the target molecule. In this case 
the target, a receptor having an endogenous ligand which may be absent or defective in the 
disease or pathology, binds the antibody as a surrogate effector ligand, initiating a receptor- 
based signal transduction event by the receptor. 

A therapeutically effective amount of an antibody of the invention relates generally 

20 to the amount needed to achieve a therapeutic objective. As noted above, this may be a 
binding interaction between the antibody and its target antigen that, in certain cases, 
interferes with the functioning of the target, and in other cases, promotes a physiological 
response. The amount required to be administered will furthermore depend on the binding 
affinity of the antibody for its specific antigen, and will also depend on the rate at which an 

25 administered antibody is depleted from the free volume other subject to which it is 
administered. Common ranges for therapeutically effective dosing of an antibody or 
antibody fragment of the invention may be, by way of nonlimiting example, from about 0.1 
mg/kg body weight to about 50 mg/kg body weight. Common dosing frequencies may 
range, for example, from twice daily to once a week. 

30 

Pharmaceutical Compositions of Antibodies 

Antibodies specifically binding a protein of the invention, as well as other 
molecules identified by the screening assays disclosed herein, can be administered for the 
treatment of various disorders in the form of pharmaceutical compositions. Principles and 
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considerations involved in preparing such compositions, as well as guidance in the choice 
of components are provided, for example, in Remington : The Science And Practice Of 
Pharmacy 19th ed. (Alfonso R. Gennaro, et al., editors) Mack Pub. Co., Easton, Pa. : 1995; 
Drug Absorption Enhancement : Concepts, Possibilities, Limitations, And Trends, 
Harwood Academic Publishers, Langhorne, Pa., 1994; and Peptide And Protein Drug 
Delivery (Advances In Parenteral Sciences, Vol. 4), 1991, M. Dekker, New York. 

If the antigenic protein is intracellular and whole antibodies are used as inhibitors, 
internalizing antibodies are preferred. However, liposomes can also be used to deliver the 
antibody, or an antibody fragment, into cells. Where antibody fragments are used, the 
smallest inhibitory fragment that specifically binds to the binding domain of the target 
protein is preferred. For example, based upon the variable-region sequences of an 
antibody, peptide molecules can be designed that retain the ability to bind the target protein 
sequence. Such peptides can be synthesized chemically and/or produced by recombinant 
DNA technology. See, e.g., Marasco et al., Proc. Natl. Acad. Sci. USA, 90: 7889-7893 
(1993). The formulation herein can also contain more than one active compound as 
necessary for the particular indication being treated, preferably those with complementary 
activities that do not adversely affect each other. Alternatively, or in addition, the 
composition can comprise an agent that enhances its function, such as, for example, a 
cytotoxic agent, cytokine, chemotherapeutic agent, or growth-inhibitory agent. Such 
molecules are suitably present in combination in amounts that are effective for the purpose 
intended. 

The active ingredients can also be entrapped in microcapsules prepared, for 
example, by coacervation techniques or by interfacial polymerization, for example, 
hydroxymethylcellulose or gelatin-microcapsules and poly-(methylmethacrylate) 
microcapsules, respectively, in colloidal drug delivery systems (for example, liposomes, 
albumin microspheres, microemulsions, nano-particles, and nanocapsules) or in 
macroemulsions. 

The formulations to be used for in vivo administration must be sterile. This is 
readily accomplished by filtration through sterile filtration membranes. 

Sustained-release preparations can be prepared. Suitable examples of sustained- 
release preparations include semipermeable matrices of solid hydrophobic polymers 
containing the antibody, which matrices are in the form of shaped articles, e.g., films, or 
microcapsules. Examples of sustained-release matrices include polyesters, hydrogels (for 
example, poly(2-hydroxyethyI-methacrylate), or poly(vinylalcohol)), polylactides (U.S. 
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Pat. No. 3,773,919), copolymers of L-glutamic acid and y ethyl-L-glutamate, non- 
degradable ethylene-vinyl acetate, degradable lactic acid-glycolic acid copolymers such as 
the LUPRON DEPOT ™ (injectable microspheres composed of lactic acid-glycolic acid 
copolymer and leuprolide acetate), and poly-D-(-)-3-hydroxybutyric acid. While polymers 
5 such as ethylene-vinyl acetate and lactic acid-glycolic acid enable release of molecules for 
over 100 days, certain hydrogels release proteins for shorter time periods. 



ELISA Assay 

An agent for detecting an analyte protein is an antibody capable of binding to an 
analyte protein, preferably an antibody with a detectable label. Antibodies can be 
polyclonal, or more preferably, monoclonal. An intact antibody, or a fragment thereof 
(e.g., F ab or F (ab) 2) can be used. The term "labeled", with regard to the probe or antibody, is 
intended to encompass direct labeling of the probe or antibody by coupling (i.e., physically 
linking) a detectable substance to the probe or antibody, as well as indirect labeling of the 
probe or antibody by reactivity with another reagent that is directly labeled. Examples of 
indirect labeling include detection of a primary antibody using a fluorescently-labeled 
secondary antibody and end-labeling of a DNA probe with biotin such that it can be 
detected with fluorescently-labeled streptavidin. The term "biological sample" is intended 
to include tissues, cells and biological fluids isolated from a subject, as well as tissues, cells 
and fluids present within a subject. Included within the usage of the term "biological 
sample", therefore, is blood and a fraction or component of blood including blood serum, 
blood plasma, or lymph. That is, the detection method of the invention can be used to 
detect an analyte mRNA, protein, or genomic DNA in a biological sample in vitro as well 
as in vivo. For example, in vitro techniques for detection of an analyte mRNA include 
Northern hybridizations and in situ hybridizations. In vitro techniques for detection of an 
analyte protein include enzyme linked immunosorbent assays (ELISAs), Western blots, 
immunoprecipitations, and immunofluorescence. In vitro techniques for detection of an 
analyte genomic DNA include Southern hybridizations. Procedures for conducting 
immunoassays are described, for example in "ELISA: Theory and Practice: Methods in 
Molecular Biology", Vol. 42, J. R. Crowther (Ed.) Human Press, Totowa, NJ, 1995; 
"Immunoassay", E. Diamandis and T. Christopoulus, Academic Press, Inc., San Diego, 
CA, 1996; and "Practice and Thory of Enzyme Immunoassays", P. Tijssen, Elsevier 
Science Publishers, Amsterdam, 1985. Furthermore, in vivo techniques for detection of an 
analyte protein include introducing into a subject a labeled anti-an analyte protein antibody. 
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For example, the antibody can be labeled with a radioactive marker whose presence and 
location in a subject can be detected by standard imaging techniques. 

NOVX Recombinant Expression Vectors and Host Cells 

Another aspect of the invention pertains to vectors, preferably expression vectors, 
containing a nucleic acid encoding an NOVX protein, or derivatives, fragments, analogs or 
homologs thereof. As used herein, the term "vector" refers to a nucleic acid molecule 
capable of transporting another nucleic acid to which it has been linked. One type of 
vector is a "plasmid", which refers to a circular double stranded DNA loop into which 
additional DNA segments can be ligated. Another type of vector is a viral vector, wherein 
additional DNA segments can be ligated into the viral genome. Certain vectors are capable 
of autonomous replication in a host cell into which they are introduced (e.g., bacterial 
vectors having a bacterial origin of replication and episomal mammalian vectors). Other 
vectors (e.g., non-episomal mammalian vectors) are integrated into the genome of a host 
cell upon introduction into the host cell, and thereby are replicated along with the host 
genome. Moreover, certain vectors are capable of directing the expression of genes to 
which they are operatively-linked. Such vectors are referred to herein as "expression 
vectors". In general, expression vectors of utility in recombinant DNA techniques are often 
in the form of plasmids. In the present specification, "plasmid" and "vector" can be used 
interchangeably as the plasmid is the most commonly used form of vector. However, the 
Invention is intended to include such other forms of expression vectors, such as viral 
vectors (e.g., replication defective retroviruses, adenoviruses and adeno-associated 
viruses), which serve equivalent functions. 

The recombinant expression vectors of the invention comprise a nucleic acid of the 
invention in a form suitable for expression of the nucleic acid in a host cell, which means 
that the recombinant expression vectors include one or more regulatory sequences, selected 
on the basis of the host cells to be used for expression, that is operatively-linked to the 
nucleic acid sequence to be expressed. Within a recombinant expression vector, "operably- 
linked" is intended to mean that the nucleotide sequence of interest is linked to the 
regulatory sequence(s) in a manner that allows for expression of the nucleotide sequence 
(e.g., in an in vitro transcription/translation system or in a host cell when the vector is 
introduced into the host cell). 
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The term "regulatory sequence" is intended to includes promoters, enhancers and 
other expression control elements {e.g., polyadenylation signals). Such regulatory 
sequences are described, for example, in Goeddel, Gene Expression Technology: 
Methods in Enzymology 185, Academic Press, San Diego, Calif. (1990). Regulatory 
sequences include those that direct constitutive expression of a nucleotide sequence in 
many types of host cell and those that direct expression of the nucleotide sequence only in 
certain host cells {e.g., tissue-specific regulatory sequences). It will be appreciated by 
those skilled in the art that the design of the expression vector can depend on such factors 
as the choice of the host cell to be transformed, the level of expression of protein desired, 
etc. The expression vectors of the invention can be introduced into host cells to thereby 
produce proteins or peptides, including fusion proteins or peptides, encoded by nucleic 
acids as described herein {e.g., NOVX proteins, mutant forms of NOVX proteins, fusion 
proteins, etc.). 

The recombinant expression vectors of the invention can be designed for expression 
of NOVX proteins in prokaryotic or eukaryotic cells. For example, NOVX proteins can be 
expressed in bacterial cells such as Escherichia coli, insect cells (using baculovirus 
expression vectors) yeast cells or mammalian cells. Suitable host cells are discussed further 
in Goeddel, Gene Expression Technology: Methods in Enzymology 1 85, Academic 
Press, San Diego, Calif. (1990). Alternatively, the recombinant expression vector can be 
transcribed and translated in vitro, for example using T7 promoter regulatory sequences 
and T7 polymerase. 

Expression of proteins in prokaryotes is most often carried out in Escherichia coli with 
vectors containing constitutive or inducible promoters directing the expression of either 
fusion or non-fusion proteins. Fusion vectors add a number of amino acids to a protein 
encoded therein, usually to the amino terminus of the recombinant protein. Such fusion 
vectors typically serve three purposes: (/) to increase expression of recombinant protein; 
(/'/") to increase the solubility of the recombinant protein; and {Hi) to aid in the purification 
of the recombinant protein by acting as a ligand in affinity purification. Often, in fusion 
expression vectors, a proteolytic cleavage site is introduced at the junction of the fusion 
moiety and the recombinant protein to enable separation of the recombinant protein from 
the fusion moiety subsequent to purification of the fusion protein. Such enzymes, and their 
cognate recognition sequences, include Factor Xa, thrombin and enterokinase. Typical 
fusion expression vectors include pGEX (Pharmacia Biotech Inc; Smith and Johnson, 
1988. Gene 67: 31-40), pMAL (New England Biolabs, Beverly, Mass.) and pRIT5 
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(Pharmacia, Piscataway, N.J.) that fuse glutathione S-transferase (GST), maltose E binding 

protein, or protein A, respectively, to the target recombinant protein. 

Examples of suitable inducible non-fusion E. coli expression vectors include pTrc (Amrann 

et al, (1988) Gene 69:301-315) and pET 1 Id (Studier et al, Gene Expression 

Technology: Methods in Enzymology 185, Academic Press, San Diego, Calif. (1990) 

60-89). 

One strategy to maximize recombinant protein expression in E. coli is to express the 
protein in a host bacteria with an impaired capacity to proteolytically cleave the 
recombinant protein. See, e.g., Gottesman, Gene Expression Technology: Methods in 
Enzymology 1 85, Academic Press, San Diego, Calif. (1990) 1 19-128. Another strategy is 
to alter the nucleic acid sequence of the nucleic acid to be inserted into an expression 
vector so that the individual codons for each amino acid are those preferentially utilized in 
E. coli (see, e.g., Wada, et al, 1992. Nucl. Acids Res. 20: 21 1 1-21 18). Such alteration of 
nucleic acid sequences of the invention can be carried out by standard DNA synthesis 
techniques. 

In another embodiment, the NOVX expression vector is a yeast expression vector. 
Examples of vectors for expression in yeast Saccharomyces cerivisae include pYepSecl 
(Baldari, et al, 1987. EMBO J. 6: 229-234), pMFa (Kurjan and Herskowitz, 1982. Ce//30: 
933-943), pJRY88 (Schultz et al., 1987. Gene 54: 1 13-123), pYES2 (Invitrogen 
Corporation, San Diego, Calif.), and picZ (InVitrogen Corp, San Diego, Calif.). 
Alternatively, NOVX can be expressed in insect cells using baculovirus expression vectors. 
Baculovirus vectors available for expression of proteins in cultured insect cells (e.g., SF9 
cells) include the pAc series (Smith, et al., 1983. Mol. Cell Biol. 3: 2156-2165) and the 
pVL series (Lucklow and Summers, 1989. Virology 170: 31-39). 

In yet another embodiment, a nucleic acid of the invention is expressed in 
mammalian cells using a mammalian expression vector. Examples of mammalian 
expression vectors include pCDM8 (Seed, 1987. Nature 329: 840) and pMT2PC 
(Kaufman, et al, 1987. EMBO J. 6: 187-195). When used in mammalian cells, the 
expression vector's control functions are often provided by viral regulatory elements. For 
example, commonly used promoters are derived from polyoma, adenovirus 2, 
cytomegalovirus, and simian virus 40. For other suitable expression systems for both 
prokaryotic and eukaryotic cells see, e.g., Chapters 16 and 1 7 of Sambrook, et al, 
Molecular Cloning: A Laboratory Manual. 2nd ed., Cold Spring Harbor 
Laboratory, Cold Spring Harbor Laboratory Press, Cold Spring Harbor, N.Y., 1 989. 
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In another embodiment, the recombinant mammalian expression vector is capable 
of directing expression of the nucleic acid preferentially in a particular cell type {e.g., 
tissue-specific regulatory elements are used to express the nucleic acid). Tissue-specific 
regulatory elements are known in the art. Non-limiting examples of suitable tissue-specific 
5 promoters include the albumin promoter (liver-specific; Pinkert, et al., 1 987. Genes Dev. 1 : 
268-277), lymphoid-specific promoters (Calame and Eaton, 1988. Adv. Immunol. 43: 
235-275), in particular promoters of T cell receptors (Winoto and Baltimore, 1989. EMBO 
J. 8: 729-733) and immunoglobulins (Banerji, et al, 1983. Cell 33: 729-740; Queen and 
Baltimore, 1983. Cell 33: 741-748), neuron-specific promoters {e.g., the neurofilament 

10 promoter; Byrne and Ruddle, 1989. Proc. Natl. Acad. Sci. USA 86: 5473-5477), 

pancreas-specific promoters (Edlund, et al, 1985. Science 230: 912-916), and mammary 
gland-specific promoters {e.g., milk whey promoter; U.S. Pat. No. 4,873,316 and European 
Application Publication No. 264,166). Developmentally-regulated promoters are also 
encompassed, e.g., the murine hox promoters (Kessel and Gruss, 1990. Science 249: 

1 5 374-379) and the oc-fetoprotein promoter (Campes and Tilghman, 1 989. Genes Dev. 3: 
537-546). 

The invention further provides a recombinant expression vector comprising a DNA 
molecule of the invention cloned into the expression vector in an antisense orientation. 
That is, the DNA molecule is operatively-1 inked to a regulatory sequence in a manner that 

20 allows for expression (by transcription of the DNA molecule) of an RNA molecule that is 
antisense to NOVX mRNA. Regulatory sequences operatively linked to a nucleic acid 
cloned in the antisense orientation can be chosen that direct the continuous expression of 
the antisense RNA molecule in a variety of cell types, for instance viral promoters and/or 
enhancers, or regulatory sequences can be chosen that direct constitutive, tissue specific or 

25 cell type specific expression of antisense RNA. The antisense expression vector can be in 
the form of a recombinant plasmid, phagemid or attenuated virus in which antisense 
nucleic acids are produced under the control of a high efficiency regulatory region, the 
activity of which can be determined by the cell type into which the vector is introduced. 
For a discussion of the regulation of gene expression using antisense genes see, e.g., 

30 Weintraub, et al., "Antisense RNA as a molecular tool for genetic analysis," Reviews- 
Trends in Genetics, Vol. 1(1) 1986. 

Another aspect of the invention pertains to host cells into which a recombinant 
expression vector of the invention has been introduced. The terms "host cell" and 
"recombinant host cell" are used interchangeably herein. It is understood that such terms 
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refer not only to the particular subject cell but also to the progeny or potential progeny of 
such a cell. Because certain modifications may occur in succeeding generations due to 
either mutation or environmental influences, such progeny may not, in fact, be identical to 
the parent cell, but are still included within the scope of the term as used herein. 
5 A host cell can be any prokaryotic or eukaryotic cell. For example, NOVX protein 

can be expressed in bacterial cells such as E. coli, insect cells, yeast or mammalian cells 
(such as Chinese hamster ovary cells (CHO) or COS cells). Other suitable host cells are 
known to those skilled in the art. 

Vector DNA can be introduced into prokaryotic or eukaryotic cells via conventional 
H 10 transformation or transfection techniques. As used herein, the terms "transformation" and 

"transfection" are intended to refer to a variety of art-recognized techniques for introducing 
fy foreign nucleic acid (e.g., DNA) into a host cell, including calcium phosphate or calcium 

% chloride co-precipitation, DEAE-dextran-mediated transfection, lipofection, or 

O electroporation. Suitable methods for transforming or transfecting host cells can be found 

L 1 5 in Sambrook, et al. (MOLECULAR CLONING: A LABORATORY MANUAL. 2nd ed., Cold 

W Spring Harbor Laboratory, Cold Spring Harbor Laboratory Press, Cold Spring Harbor, 

'TJ; N.Y., 1989), and other laboratory manuals. 

O For stable transfection of mammalian cells, it is known that, depending upon the 

expression vector and transfection technique used, only a small fraction of cells may 

20 integrate the foreign DNA into their genome. In order to identify and select these 
integrants, a gene that encodes a selectable marker (e.g., resistance to antibiotics) is 
generally introduced into the host cells along with the gene of interest. Various selectable 
markers include those that confer resistance to drugs, such as G41 8, hygromycin and 
methotrexate. Nucleic acid encoding a selectable marker can be introduced into a host cell 

25 on the same vector as that encoding NOVX or can be introduced on a separate vector. 

Cells stably transfected with the introduced nucleic acid can be identified by drug selection 
(e.g., cells that have incorporated the selectable marker gene will survive, while the other 
cells die). 

A host cell of the invention, such as a prokaryotic or eukaryotic host cell in culture, 
30 can be used to produce (i.e., express) NOVX protein. Accordingly, the invention further 
provides methods for producing NOVX protein using the host cells of the invention. In 
one embodiment, the method comprises culturing the host cell of invention (into which a 
recombinant expression vector encoding NOVX protein has been introduced) in a suitable 
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medium such that NOVX protein is produced. In another embodiment, the method further 
comprises isolating NOVX protein from the medium or the host cell. 

Transgenic NOVX Animals 

5 

The host cells of the invention can also be used to produce non-human transgenic 
animals. For example, in one embodiment, a host cell of the invention is a fertilized oocyte 
or an embryonic stem cell into which NOVX protein-coding sequences have been 
introduced. Such host cells can then be used to create non-human transgenic animals in 

1 0 which exogenous NOVX sequences have been introduced into their genome or 

homologous recombinant animals in which endogenous NOVX sequences have been 
altered. Such animals are useful for studying the function and/or activity of NOVX protein 
and for identifying and/or evaluating modulators of NOVX protein activity. As used 
herein, a "transgenic animal" is a non-human animal, preferably a mammal, more 

1 5 preferably a rodent such as a rat or mouse, in which one or more of the cells of the animal 
includes a transgene. Other examples of transgenic animals include non-human primates, 
sheep, dogs, cows, goats, chickens, amphibians, etc. A transgene is exogenous DNA that is 
integrated into the genome of a cell from which a transgenic animal develops and that 
remains in the genome of the mature animal, thereby directing the expression of an 

20 encoded gene product in one or more cell types or tissues of the transgenic animal. As 
used herein, a "homologous recombinant animal" is a non-human animal, preferably a 
mammal, more preferably a mouse, in which an endogenous NOVX gene has been altered 
by homologous recombination between the endogenous gene and an exogenous DNA 
molecule introduced into a cell of the animal, e.g., an embryonic cell of the animal, prior to 

25 development of the animal. 

A transgenic animal of the invention can be created by introducing 
NOVX-encoding nucleic acid into the male pronuclei of a fertilized oocyte (e.g., by 
microinjection, retroviral infection) and allowing the oocyte to develop in a 
pseudopregnant female foster animal. The human NOVX cDNA sequences SEQ ID NO: 

30 2n-l, wherein n is an integer between 1 and 1 78 can be introduced as a transgene into the 
genome of a non-human animal. Alternatively, a non-human homologue of the human 
NOVX gene, such as a mouse NOVX gene, can be isolated based on hybridization to the 
human NOVX cDNA (described further supra) and used as a transgene. Intronic 
sequences and polyadenylation signals can also be included in the transgene to increase the 

35 efficiency of expression of the transgene. A tissue-specific regulatory sequence(s) can be 
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operably-linked to the NOVX transgene to direct expression of NOVX protein to particular 
cells. Methods for generating transgenic animals via embryo manipulation and 
microinjection, particularly animals such as mice, have become conventional in the art and 
are described, for example, in U.S. Patent Nos. 4,736,866; 4,870,009; and 4,873,191; and 
Hogan, 1 986. In: Manipulating the Mouse Embryo, Cold Spring Harbor Laboratory 
Press, Cold Spring Harbor, N.Y. Similar methods are used for production of other 
transgenic animals. A transgenic founder animal can be identified based upon the presence 
of the NOVX transgene in its genome and/or expression of NOVX mRNA in tissues or 
cells of the animals. A transgenic founder animal can then be used to breed additional 
animals carrying the transgene. Moreover, transgenic animals carrying a transgene- 
encoding NOVX protein can further be bred to other transgenic animals carrying other 
transgenes. 

To create a homologous recombinant animal, a vector is prepared which contains at 
least a portion of an NOVX gene into which a deletion, addition or substitution has been 
introduced to thereby alter, e.g., functionally disrupt, the NOVX gene. The NOVX gene 
can be a human gene (e.g., the cDNA of SEQ ID NO: 2n-l , wherein n is an integer 
between 1 and 178), but more preferably, is a non-human homologue of a human NOVX 
gene. For example, a mouse homologue of human NOVX gene of SEQ ID NO: 2n-l, 
wherein n is an integer between 1 and 178 can be used to construct a homologous 
recombination vector suitable for altering an endogenous NOVX gene in the mouse 
genome. In one embodiment, the vector is designed such that, upon homologous 
recombination, the endogenous NOVX gene is functionally disrupted (i.e., no longer 
encodes a functional protein; also referred to as a "knock out" vector). 

Alternatively, the vector can be designed such that, upon homologous 
recombination, the endogenous NOVX gene is mutated or otherwise altered but still 
encodes functional protein (e.g., the upstream regulatory region can be altered to thereby 
alter the expression of the endogenous NOVX protein). In the homologous recombination 
vector, the altered portion of the NOVX gene is flanked at its 5'- and 3'-termini by 
additional nucleic acid of the NOVX gene to allow for homologous recombination to occur 
between the exogenous NOVX gene carried by the vector and an endogenous NOVX gene 
in an embryonic stem cell. The additional flanking NOVX nucleic acid is of sufficient 
length for successful homologous recombination with the endogenous gene. Typically, 
several kilobases of flanking DNA (both at the 5'- and 3'-termini) are included in the 
vector. See, e.g., Thomas, et ah, 1987. Cell 51: 503 for a description of homologous 
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recombination vectors. The vector is ten introduced into an embryonic stem cell line (e.g., 
by electroporation) and cells in which the introduced NOVX gene has homologously- 
recombined with the endogenous NOVX gene are selected. See, e.g., Li, et ah, 1992. Cell 
69: 915. 

The selected cells are then injected into a blastocyst of an animal {e.g., a mouse) to 
form aggregation chimeras. See, e.g., Bradley, 1987. In: Teratocarcinomas and 
Embryonic Stem Cells: A Practical Approach, Robertson, ed. IRL, Oxford, pp. 
1 13-152. A chimeric embryo can then be implanted into a suitable pseudopregnant female 
foster animal and the embryo brought to term. Progeny harboring the homologously- 
recombined DNA in their germ cells can be used to breed animals in which all cells of the 
animal contain the homologously-recombined DNA by germline transmission of the 
transgene. Methods for constructing homologous recombination vectors and homologous 
recombinant animals are described further in Bradley, 1991. Curr. Opin. Biotechnol. 2: 
823-829; PCT International Publication Nos.: WO 90/1 1354; WO 91/01 140; WO 92/0968; 
and WO 93/04169. 

In another embodiment, transgenic non-humans animals can be produced that 
contain selected systems that allow for regulated expression of the transgene. One example 
of such a system is the cre/loxP recombinase system of bacteriophage PI. For a description 
of the cre/loxP recombinase system, See, e.g., Lakso, et ah, 1992. Proc. Natl. Acad. Sci. 
USA 89: 6232-6236. Another example of a recombinase system is the FLP recombinase 
system of Saccharomyces cerevisiae. See, O'Gorman, et ah, 1 991 . Science 25 1 :1 35 1-1 355. 
If a cre/loxP recombinase system is used to regulate expression of the transgene, animals 
containing transgenes encoding both the Cre recombinase and a selected protein are 
required. Such animals can be provided through the construction of "double" transgenic 
animals, e.g., by mating two transgenic animals, one containing a transgene encoding a 
selected protein and the other containing a transgene encoding a recombinase. 

Clones of the non-human transgenic animals described herein can also be produced 
according to the methods described in Wilmut, et ah, 1997. Nature 385: 810-813. In brief, 
a cell {e.g., a somatic cell) from the transgenic animal can be isolated and induced to exit 
the growth cycle and enter G 0 phase. The quiescent cell can then be fused, e.g., through the 
use of electrical pulses, to an enucleated oocyte from an animal of the same species from 
which the quiescent cell is isolated. The reconstructed oocyte is then cultured such that it 
develops to morula or blastocyte and then transferred to pseudopregnant female foster 
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animal. The offspring borne of this female foster animal will be a clone of the animal from 
which the cell (e.g., the somatic cell) is isolated. 



Pharmaceutical Compositions 

5 

The NOVX nucleic acid molecules, NOVX proteins, and anti-NOVX antibodies 
(also referred to herein as "active compounds") of the invention, and derivatives, 
fragments, analogs and homologs thereof, can be incorporated into pharmaceutical 
compositions suitable for administration. Such compositions typically comprise the 

10 nucleic acid molecule, protein, or antibody and a pharmaceutically acceptable carrier. As 
used herein, "pharmaceutically acceptable carrier" is intended to include any and all 
solvents, dispersion media, coatings, antibacterial and antifungal agents, isotonic and 
absorption delaying agents, and the like, compatible with pharmaceutical administration. 
Suitable carriers are described in the most recent edition of Remington's Pharmaceutical 

1 5 Sciences, a standard reference text in the field, which is incorporated herein by reference. 
Preferred examples of such carriers or diluents include, but are not limited to, water, saline, 
finger's solutions, dextrose solution, and 5% human serum albumin. Liposomes and non- 
aqueous vehicles such as fixed oils may also be used. The use of such media and agents 
for pharmaceutically active substances is well known in the art. Except insofar as any 

20 conventional media or agent is incompatible with the active compound, use thereof in the 
compositions is contemplated. Supplementary active compounds can also be incorporated 
into the compositions. 

A pharmaceutical composition of the invention is formulated to be compatible with 
its intended route of administration. Examples of routes of administration include 

25 parenteral, e.g., intravenous, intradermal, subcutaneous, oral (e.g., inhalation), transdermal 
(i.e., topical), transmucosal, and rectal administration. Solutions or suspensions used for 
parenteral, intradermal, or subcutaneous application can include the following components: 
a sterile diluent such as water for injection, saline solution, fixed oils, polyethylene glycols, 
glycerine, propylene glycol or other synthetic solvents; antibacterial agents such as berayl 

30 alcohol or methyl parabens; antioxidants such as ascorbic acid or sodium bisulfite; 

chelating agents such as ethyl enediaminetetraacetic acid (EDTA); buffers such as acetates, 
citrates or phosphates, and agents for the adjustment of tonicity such as sodium chloride or 
dextrose. The pH can be adjusted with acids or bases, such as hydrochloric acid or sodium 
hydroxide. The parenteral preparation can be enclosed in ampoules, disposable syringes or 

35 multiple dose vials made of glass or plastic. 
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Pharmaceutical compositions suitable for injectable use include sterile aqueous 
solutions (where water soluble) or dispersions and sterile powders for the extemporaneous 
preparation of sterile injectable solutions or dispersion. For intravenous administration, 
suitable carriers include physiological saline, bacteriostatic water, Cremophor EL™ (BASF, 
5 Parsippany, N.J.) or phosphate buffered saline (PBS). In all cases, the composition must be 
sterile and should be fluid to the extent that easy syringeability exists. It must be stable 
under the conditions of manufacture and storage and must be preserved against the 
contaminating action of microorganisms such as bacteria and fungi. The carrier can be a 
^ solvent or dispersion medium containing, for example, water, ethanol, polyol (for example, 

O 10 glycerol, propylene glycol, and liquid polyethylene glycol, and the like), and suitable 

yfi mixtures thereof. The proper fluidity can be maintained, for example, by the use of a 

coating such as lecithin, by the maintenance of the required particle size in the case of 
dispersion and by the use of surfactants. Prevention of the action of microorganisms can be 
achieved by various antibacterial and antifungal agents, for example, parabens, 
O 15 chlorobutanol, phenol, ascorbic acid, thimerosal, and the like. In many cases, it will be 

2[ preferable to include isotonic agents, for example, sugars, polyalcohols such as manitol, 

% sorbitol, sodium chloride in the composition. Prolonged absorption of the injectable 

compositions can be brought about by including in the composition an agent which delays 
absorption, for example, aluminum monostearate and gelatin. 
20 Sterile injectable solutions can be prepared by incorporating the active compound 

(e.g., an NOVX protein or anti-NOVX antibody) in the required amount in an appropriate 
solvent with one or a combination of ingredients enumerated above, as required, followed 
by filtered sterilization. Generally, dispersions are prepared by incorporating the active 
compound into a sterile vehicle that contains a basic dispersion medium and the required 
25 other ingredients from those enumerated above. In the case of sterile powders for the 
preparation of sterile injectable solutions, methods of preparation are vacuum drying and 
freeze-drying that yields a powder of the active ingredient plus any additional desired 
ingredient from a previously sterile-filtered solution thereof. 

Oral compositions generally include an inert diluent or an edible carrier. They can 
30 be enclosed in gelatin capsules or compressed into tablets. For the purpose of oral 

therapeutic administration, the active compound can be incorporated with excipients and 
used in the form of tablets, troches, or capsules. Oral compositions can also be prepared 
using a fluid carrier for use as a mouthwash, wherein the compound in the fluid carrier is 
applied orally and swished and expectorated or swallowed. Pharmaceutically compatible 



binding agents, and/or adjuvant materials can be included as part of the composition. The 
tablets, pills, capsules, troches and the like can contain any of the following ingredients, or 
compounds of a similar nature: a binder such as microcrystalline cellulose, gum tragacanth 
or gelatin; an excipient such as starch or lactose, a disintegrating agent such as alginic acid, 
5 Primogel, or corn starch; a lubricant such as magnesium stearate or Sterotes; a glidant such 
as colloidal silicon dioxide; a sweetening agent such as sucrose or saccharin; or a flavoring 
agent such as peppermint, methyl salicylate, or orange flavoring. 

For administration by inhalation, the compounds are delivered in the form of an 
aerosol spray from pressured container or dispenser which contains a suitable propellant, 

10 e.g. , a gas such as carbon dioxide, or a nebulizer. 

Systemic administration can also be by transmucosal or transdermal means. For 
transmucosal or transdermal administration, penetrants appropriate to the barrier to be 
permeated are used in the formulation. Such penetrants are generally known in the art, and 
include, for example, for transmucosal administration, detergents, bile salts, and fusidic 

1 5 acid derivatives. Transmucosal administration can be accomplished through the use of 
nasal sprays or suppositories. For transdermal administration, the active compounds are 
formulated into ointments, salves, gels, or creams as generally known in the art. 

The compounds can also be prepared in the form of suppositories (e.g., with 
conventional suppository bases such as cocoa butter and other glycerides) or retention 

20 enemas for rectal delivery. 

In one embodiment, the active compounds are prepared with carriers that will 
protect the compound against rapid elimination from the body, such as a controlled release 
formulation, including implants and microencapsulated delivery systems. Biodegradable, 
biocompatible polymers can be used, such as ethylene vinyl acetate, polyanhydrides, 

25 polyglycolic acid, collagen, polyorthoesters, and polylactic acid. Methods for preparation 
of such formulations will be apparent to those skilled in the art. The materials can also be 
obtained commercially from Alza Corporation and Nova Pharmaceuticals, Inc. Liposomal 
suspensions (including liposomes targeted to infected cells with monoclonal antibodies to 
viral antigens) can also be used as pharmaceutically acceptable carriers. These can be 

30 prepared according to methods known to those skilled in the art, for example, as described 
in U.S. Patent No. 4,522,81 1 . 

It is especially advantageous to formulate oral or parenteral compositions in dosage 
unit form for ease of administration and uniformity of dosage. Dosage unit form as used 
herein refers to physically discrete units suited as unitary dosages for the subject to be 
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treated; each unit containing a predetermined quantity of active compound calculated to 
produce the desired therapeutic effect in association with the required pharmaceutical 
carrier. The specification for the dosage unit forms of the invention are dictated by and 
directly dependent on the unique characteristics of the active compound and the particular 
5 therapeutic effect to be achieved, and the limitations inherent in the art of compounding 
such an active compound for the treatment of individuals. 

The nucleic acid molecules of the invention can be inserted into vectors and used as 
gene therapy vectors. Gene therapy vectors can be delivered to a subject by, for example, 
intravenous injection, local administration (see, e.g., U.S. Patent No. 5,328,470) or by 

10 stereotactic injection (see, e.g., Chen, et al, 1994. Proc. Natl. Acad. Set USA 91 : 

3054-3057). The pharmaceutical preparation of the gene therapy vector can include the 
gene therapy vector in an acceptable diluent, or can comprise a slow release matrix in 
which the gene delivery vehicle is imbedded. Alternatively, where the complete gene 
delivery vector can be produced intact from recombinant cells, e.g., retroviral vectors, the 

1 5 pharmaceutical preparation can include one or more cells that produce the gene delivery 
system. 

The pharmaceutical compositions can be included in a container, pack, or dispenser 
together with instructions for administration. 

20 Screening and Detection Methods 

The isolated nucleic acid molecules of the invention can be used to express NOVX 
protein (e.g., via a recombinant expression vector in a host cell in gene therapy 
applications), to detect NOVX mRNA (e.g., in a biological sample) or a genetic lesion in 

25 an NOVX gene, and to modulate NOVX activity, as described further, below. In addition, 
the NOVX proteins can be used to screen drugs or compounds that modulate the NOVX 
protein activity or expression as well as to treat disorders characterized by insufficient or 
excessive production of NOVX protein or production of NOVX protein forms that have 
decreased or aberrant activity compared to NOVX wild-type protein (e.g.; diabetes 

30 (regulates insulin release); obesity (binds and transport lipids); metabolic disturbances 
associated with obesity, the metabolic syndrome X as well as anorexia and wasting 
disorders associated with chronic diseases and various cancers, and infectious 
disease(possesses anti-microbial activity) and the various dyslipidemias. In addition, the 
anti-NOVX antibodies of the invention can be used to detect and isolate NOVX proteins 

35 and modulate NOVX activity. In yet a further aspect, the invention can be used in methods 
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to influence appetite, absorption of nutrients and the disposition of metabolic substrates in 
both a positive and negative fashion. 

The invention further pertains to novel agents identified by the screening assays 
described herein and uses thereof for treatments as described, supra. 

Screening Assays 

The invention provides a method (also referred to herein as a "screening assay") for 
identifying modulators, i.e., candidate or test compounds or agents {e.g., peptides, 
peptidomimetics, small molecules or other drugs) that bind to NOVX proteins or have a 
stimulatory or inhibitory effect on, e.g., NOVX protein expression or NOVX protein 
activity. The invention also includes compounds identified in the screening assays 
described herein. 

In one embodiment, the invention provides assays for screening candidate or test 
compounds which bind to or modulate the activity of the membrane-bound form of an 
NOVX protein or polypeptide or biologically-active portion thereof. The test compounds 
of the invention can be obtained using any of the numerous approaches in combinatorial 
library methods known in the art, including: biological libraries; spatially addressable 
parallel solid phase or solution phase libraries; synthetic library methods requiring 
deconvolution; the "one-bead one-compound" library method; and synthetic library 
methods using affinity chromatography selection. The biological library approach is 
limited to peptide libraries, while the other four approaches are applicable to peptide, 
non-peptide oligomer or small molecule libraries of compounds. See, e.g., Lam, 1997. 
Anticancer Drug Design 12: 1 45 . 

A "small molecule" as used herein, is meant to refer to a composition that has a 
molecular weight of less than about 5 kD and most preferably less than about 4 kD. Small 
molecules can be, e.g., nucleic acids, peptides, polypeptides, peptidomimetics, 
carbohydrates, lipids or other organic or inorganic molecules. Libraries of chemical and/or 
biological mixtures, such as fungal, bacterial, or algal extracts, are known in the art and can 
be screened with any of the assays of the invention. 

Examples of methods for the synthesis of molecular libraries can be found in the 
art, for example in: DeWitt, et ah, 1993. Proc. Natl. Acad. Sci. U.S.A. 90: 6909; Erb, et ah, 
1994. Proc. Natl. Acad. Sci. U.S.A. 91 : 1 1422; Zuckermann, et al, 1994. J. Med. Chem. 37: 
2678; Cho,etal., 1 993. Science 261: 1303; Carrell, et al., \99A.Angew. Chem. Int. Ed. 
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Engl. 33: 2059; Carell, et al, 1994. Angew. Chem. Int. Ed. Engl. 33: 2061; and Gallop, et 
al, 1994. J. Med. Chem. 37: 1233. 

Libraries of compounds may be presented in solution {e.g., Houghten, 1992. 
Biotechniques 13: 412-421), or on beads (Lam, 1991. Nature 354: 82-84), on chips (Fodor, 
5 1 993. Nature 364: 555-556), bacteria (Ladner, U.S. Patent No. 5,223,409), spores (Ladner, 
U.S. Patent 5,233,409), plasmids (Cull, et al, 1992. Proc. Natl. Acad. Sci. USA 89: 
1865-1869) or on phage (Scott and Smith, 1990. Science 249: 386-390; Devlin, 1990. 
Science 249: 404-406; Cwirla, et al, 1990. Proc. Natl. Acad. Sci. U.S.A. 87: 6378-6382; 
Felici, 1991. J. Mol. Biol. 222: 301-310; Ladner, U.S. Patent No. 5,233,409.). 

10 In one embodiment, an assay is a cell-based assay in which a cell which expresses a 

membrane-bound form of NOVX protein, or a biologically-active portion thereof, on the 
cell surface is contacted with a test compound and the ability of the test compound to bind 
to an NOVX protein determined. The cell, for example, can of mammalian origin or a 
yeast cell. Determining the ability of the test compound to bind to the NOVX protein can 

15 be accomplished, for example, by coupling the test compound with a radioisotope or 
enzymatic label such that binding of the test compound to the NOVX protein or 
biologically-active portion thereof can be determined by detecting the labeled compound in 
a complex. For example, test compounds can be labeled with ,25 1, 35 S, I4 C, or 3 H, either 
directly or indirectly, and the radioisotope detected by direct counting of radioemission or 

20 by scintillation counting. Alternatively, test compounds can be enzymatically-labeled with, 
for example, horseradish peroxidase, alkaline phosphatase, or Iuciferase, and the enzymatic 
label detected by determination of conversion of an appropriate substrate to product. In 
one embodiment, the assay comprises contacting a cell which expresses a membrane-bound 
form of NOVX protein, or a biologically-active portion thereof, on the cell surface with a 

25 known compound which binds NOVX to form an assay mixture, contacting the assay 

mixture with a test compound, and determining the ability of the test compound to interact 
with an NOVX protein, wherein determining the ability of the test compound to interact 
with an NOVX protein comprises determining the ability of the test compound to 
preferentially bind to NOVX protein or a biologically-active portion thereof as compared to 

30 the known compound. 

In another embodiment, an assay is a cell-based assay comprising contacting a cell 
expressing a membrane-bound form of NOVX protein, or a biologically-active portion 
thereof, on the cell surface with a test compound and determining the ability of the test 
compound to modulate {e.g., stimulate or inhibit) the activity of the NOVX protein or 
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biologically-active portion thereof. Determining the ability of the test compound to 
modulate the activity of NOVX or a biologically-active portion thereof can be 
accomplished, for example, by determining the ability of the NOVX protein to bind to or 
interact with an NOVX target molecule. As used herein, a "target molecule" is a molecule 
with which an NOVX protein binds or interacts in nature, for example, a molecule on the 
surface of a cell which expresses an NOVX interacting protein, a molecule on the surface 
of a second cell, a molecule in the extracellular milieu, a molecule associated with the 
internal surface of a cell membrane or a cytoplasmic molecule. An NOVX target molecule 
can be a non-NOVX molecule or an NOVX protein or polypeptide of the invention. In one 
embodiment, an NOVX target molecule is a component of a signal transduction pathway 
that facilitates transduction of an extracellular signal (e.g. a signal generated by binding of 
a compound to a membrane-bound NOVX molecule) through the cell membrane and into 
the cell. The target, for example, can be a second intercellular protein that has catalytic 
activity or a protein that facilitates the association of downstream signaling molecules with 
NOVX. 

Determining the ability of the NOVX protein to bind to or interact with an NOVX 
target molecule can be accomplished by one of the methods described above for 
determining direct binding. In one embodiment, determining the ability of the NOVX 
protein to bind to or interact with an NOVX target molecule can be accomplished by 
determining the activity of the target molecule. For example, the activity of the target 
molecule can be determined by detecting induction of a cellular second messenger of the 
target (i.e. intracellular Ca 2+ , diacylglycerol, IP 3 , etc.), detecting catalytic/enzymatic 
activity of the target an appropriate substrate, detecting the induction of a reporter gene 
(comprising an NOVX-responsive regulatory element operatively linked to a nucleic acid 
encoding a detectable marker, e.g., luciferase), or detecting a cellular response, for 
example, cell survival, cellular differentiation, or cell proliferation. 

In yet another embodiment, an assay of the invention is a cell-free assay comprising 
contacting an NOVX protein or biologically-active portion thereof with a test compound 
and determining the ability of the test compound to bind to the NOVX protein or 
biologically-active portion thereof. Binding of the test compound to the NOVX protein can 
be determined either directly or indirectly as described above. In one such embodiment, 
the assay comprises contacting the NOVX protein or biologically-active portion thereof 
with a known compound which binds NOVX to form an assay mixture, contacting the 
assay mixture with a test compound, and determining the ability of the test compound to 
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interact with an NOVX protein, wherein determining the ability of the test compound to 
interact with an NOVX protein comprises determining the ability of the test compound to 
preferentially bind to NOVX or biologically-active portion thereof as compared to the 
known compound. 

In still another embodiment, an assay is a cell-free assay comprising contacting 
NOVX protein or biologically-active portion thereof with a test compound and determining 
the ability of the test compound to modulate (e.g. stimulate or inhibit) the activity of the 
NOVX protein or biologically-active portion thereof. Determining the ability of the test 
compound to modulate the activity of NOVX can be accomplished, for example, by 
determining the ability of the NOVX protein to bind to an NOVX target molecule by one 
of the methods described above for determining direct binding. In an alternative 
embodiment, determining the ability of the test compound to modulate the activity of 
NOVX protein can be accomplished by determining the ability of the NOVX protein 
further modulate an NOVX target molecule. For example, the catalytic/enzymatic activity 
of the target molecule on an appropriate substrate can be determined as described, supra. 

In yet another embodiment, the cell-free assay comprises contacting the NOVX 
protein or biologically-active portion thereof with a known compound which binds NOVX 
protein to form an assay mixture, contacting the assay mixture with a test compound, and 
determining the ability of the test compound to interact with an NOVX protein, wherein 
determining the ability of the test compound to interact with an NOVX protein comprises 
determining the ability of the NOVX protein to preferentially bind to or modulate the 
activity of an NOVX target molecule. 

The cell-free assays of the invention are amenable to use of both the soluble form or the 
membrane-bound form of NOVX protein. In the case of cell-free assays comprising the 
membrane-bound form of NOVX protein, it may be desirable to utilize a solubilizing agent 
such that the membrane-bound form of NOVX protein is maintained in solution. Examples 
of such solubilizing agents include non-ionic detergents such as n-octylglucoside, 
n-dodecylglucoside, n-dodecylmaltoside, octanoyl-N-methylglucamide, 
decanoyl-N-methylglucamide, Triton® X-100, Triton® X-l 14, Thesit®, 
Isotridecypoly(ethylene glycol ether) n , N-dodecyl--N,N-dimethyl-3-ammonio-l-propane 
sulfonate, 3-(3-cholamidopropyl) dimethylamminiol-1 -propane sulfonate (CHAPS), or 
3-(3-cholamidopropyl)dimethylamminiol-2-hydroxy-l -propane sulfonate (CHAPSO). 

In more than one embodiment of the above assay methods of the invention, it may 
be desirable to immobilize either NOVX protein or its target molecule to facilitate 
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separation of complexed from uncomplexed forms of one or both of the proteins, as well as 
to accommodate automation of the assay. Binding of a test compound to NOVX protein, 
or interaction of NOVX protein with a target molecule in the presence and absence of a 
candidate compound, can be accomplished in any vessel suitable for containing the 
5 reactants. Examples of such vessels include microtiter plates, test tubes, and 

micro-centrifuge tubes. In one embodiment, a fusion protein can be provided that adds a 
domain that allows one or both of the proteins to be bound to a matrix. For example, GST- 
NOVX fusion proteins or GST-target fusion proteins can be adsorbed onto glutathione 
sepharose beads (Sigma Chemical, St. Louis, MO) or glutathione derivatized microtiter 

1 0 plates, that are then combined with the test compound or the test compound and either the 
non-adsorbed target protein or NOVX protein, and the mixture is incubated under 
conditions conducive to complex formation {e.g., at physiological conditions for salt and 
pH). Following incubation, the beads or microtiter plate wells are washed to remove any 
unbound components, the matrix immobilized in the case of beads, complex determined 

1 5 either directly or indirectly, for example, as described, supra. Alternatively, the complexes 
can be dissociated from the matrix, and the level of NOVX protein binding or activity 
determined using standard techniques. 

Other techniques for immobilizing proteins on matrices can also be used in the 
screening assays of the invention. For example, either the NOVX protein or its target 

20 molecule can be immobilized utilizing conjugation of biotin and streptavidin. Biotinylated 
NOVX protein or target molecules can be prepared from biotin-NHS 
(N-hydroxy-succinimide) using techniques well-known within the art {e.g., biotinylation 
kit, Pierce Chemicals, Rockford, 111.), and immobilized in the wells of streptavidin-coated 
96 well plates (Pierce Chemical). Alternatively, antibodies reactive with NOVX protein or 

25 target molecules, but which do not interfere with binding of the NOVX protein to its target 
molecule, can be derivatized to the wells of the plate, and unbound target or NOVX protein 
trapped in the wells by antibody conjugation. Methods for detecting such complexes, in 
addition to those described above for the GST-immobilized complexes, include 
immunodetection of complexes using antibodies reactive with the NOVX protein or target 

30 molecule, as well as enzyme-linked assays that rely on detecting an enzymatic activity 
associated with the NOVX protein or target molecule. 

In another embodiment, modulators of NOVX protein expression are identified in a 
method wherein a cell is contacted with a candidate compound and the expression of 
NOVX mRNA or protein in the cell is determined. The level of expression of NOVX 



mRNA or protein in the presence of the candidate compound is compared to the level of 
expression of NOVX mRNA or protein in the absence of the candidate compound. The 
candidate compound can then be identified as a modulator of NOVX mRNA or protein 
expression based upon this comparison. For example, when expression of NOVX mRNA 
5 or protein is greater (i.e., statistically significantly greater) in the presence of the candidate 
compound than in its absence, the candidate compound is identified as a stimulator of 
NOVX mRNA or protein expression. Alternatively, when expression of NOVX mRNA or 
protein is less (statistically significantly less) in the presence of the candidate compound 
than in its absence, the candidate compound is identified as an inhibitor of NOVX mRNA 

10 or protein expression. The level of NOVX mRNA or protein expression in the cells can be 
determined by methods described herein for detecting NOVX mRNA or protein. 

In yet another aspect of the invention, the NOVX proteins can be used as "bait 
proteins" in a two-hybrid assay or three hybrid assay (see, e.g., U.S. Patent No. 5,283,317; 
Zervos, et al., 1993. Cell 72: 223-232; Madura, et al, 1993. J. Biol. Chem. 268: 

1 5 1 2046-1 2054; Battel, et al, 1 993. Biotechniques 14: 920-924; Iwabuchi, et al, 1 993 . 

Oncogene 8: 1693-1696; and Brent WO 94/10300), to identify other proteins that bind to or 
interact with NOVX ("NOVX-binding proteins" or "NOVX-bp") and modulate NOVX 
activity. Such NOVX-binding proteins are also likely to be involved in the propagation of 
signals by the NOVX proteins as, for example, upstream or downstream elements of the 

20 NOVX pathway. 

The two-hybrid system is based on the modular nature of most transcription factors, 
which consist of separable DNA-binding and activation domains. Briefly, the assay 
utilizes two different DNA constructs. In one construct, the gene that codes for NOVX is 
fused to a gene encoding the DNA binding domain of a known transcription factor (e.g., 

25 GAL-4). In the other construct, a DNA sequence, from a library of DNA sequences, that 
encodes an unidentified protein ("prey" or "sample") is fused to a gene that codes for the 
activation domain of the known transcription factor. If the "bait" and the "prey" proteins 
are able to interact, in vivo, forming an NOVX-dependent complex, the DNA-binding and 
activation domains of the transcription factor are brought into close proximity. This 

30 proximity allows transcription of a reporter gene (e.g., LacZ) that is operably linked to a 
transcriptional regulatory site responsive to the transcription factor. Expression of the 
reporter gene can be detected and cell colonies containing the functional transcription 
factor can be isolated and used to obtain the cloned gene that encodes the protein which 
interacts with NOVX. 



The invention further pertains to novel agents identified by the aforementioned 
screening assays and uses thereof for treatments as described herein. 

Detection Assays 

5 

Portions or fragments of the cDNA sequences identified herein (and the 
corresponding complete gene sequences) can be used in numerous ways as polynucleotide 
reagents. By way of example, and not of limitation, these sequences can be used to: (/') 
map their respective genes on a chromosome; and, thus, locate gene regions associated 
10 with genetic disease; (//") identify an individual from a minute biological sample (tissue 
typing); and (Hi) aid in forensic identification of a biological sample. Some of these 
applications are described in the subsections, below. 

Chromosome Mapping 

1 5 Once the sequence (or a portion of the sequence) of a gene has been isolated, this 

sequence can be used to map the location of the gene on a chromosome. This process is 
called chromosome mapping. Accordingly, portions or fragments of the NOVX sequences, 
SEQ ID NO: 2n-l, wherein n is an integer between 1 and 178, or fragments or derivatives 
thereof, can be used to map the location of the NOVX genes, respectively, on a 

20 chromosome. The mapping of the NOVX sequences to chromosomes is an important first 
step in correlating these sequences with genes associated with disease. 

Briefly, NOVX genes can be mapped to chromosomes by preparing PCR primers 
(preferably 15-25 bp in length) from the NOVX sequences. Computer analysis of the 
NOVX, sequences can be used to rapidly select primers that do not span more than one 

25 exon in the genomic DNA, thus complicating the amplification process. These primers can 
then be used for PCR screening of somatic cell hybrids containing individual human 
chromosomes. Only those hybrids containing the human gene corresponding to the NOVX 
sequences will yield an amplified fragment. 

Somatic cell hybrids are prepared by fusing somatic cells from different mammals 

30 (e.g., human and mouse cells). As hybrids of human and mouse cells grow and divide, 
they gradually lose human chromosomes in random order, but retain the mouse 
chromosomes. By using media in which mouse cells cannot grow, because they lack a 
particular enzyme, but in which human cells can, the one human chromosome that contains 
the gene encoding the needed enzyme will be retained. By using various media, panels of 

35 hybrid cell lines can be established. Each cell line in a panel contains either a single human 
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chromosome or a small number of human chromosomes, and a full set of mouse 
chromosomes, allowing easy mapping of individual genes to specific human chromosomes. 
See, e.g., D'Eustachio, et ah, 1983. Science 220: 919-924. Somatic cell hybrids containing 
only fragments of human chromosomes can also be produced by using human 
5 chromosomes with translocations and deletions. 

PCR mapping of somatic cell hybrids is a rapid procedure for assigning a particular 
sequence to a particular chromosome. Three or more sequences can be assigned per day 
using a single thermal cycler. Using the NOVX sequences to design oligonucleotide 
primers, sub-localization can be achieved with panels of fragments from specific 
1 0 chromosomes. 

Fluorescence in situ hybridization (FISH) of a DNA sequence to a metaphase chromosomal 
spread can further be used to provide a precise chromosomal location in one step. 
Chromosome spreads can be made using cells whose division has been blocked in 
metaphase by a chemical like colcemid that disrupts the mitotic spindle. The chromosomes 

1 5 can be treated briefly with trypsin, and then stained with Giemsa. A pattern of light and 
dark bands develops on each chromosome, so that the chromosomes can be identified 
individually. The FISH technique can be used with a DNA sequence as short as 500 or 600 
bases. However, clones larger than 1,000 bases have a higher likelihood of binding to a 
unique chromosomal location with sufficient signal intensity for simple detection. 

20 Preferably 1,000 bases, and more preferably 2,000 bases, will suffice to get good results at 
a reasonable amount of time. For a review of this technique, see, Verma, et al., HUMAN 
Chromosomes: A Manual of Basic Techniques (Pergamon Press, New York 1 988). 

Reagents for chromosome mapping can be used individually to mark a single 
chromosome or a single site on that chromosome, or panels of reagents can be used for 

25 marking multiple sites and/or multiple chromosomes. Reagents corresponding to 

noncoding regions of the genes actually are preferred for mapping purposes. Coding 
sequences are more likely to be conserved within gene families, thus increasing the chance 
of cross hybridizations during chromosomal mapping. 

Once a sequence has been mapped to a precise chromosomal location, the physical 

30 position of the sequence on the chromosome can be correlated with genetic map data. Such 
data are found, e.g., in McKusick, MENDELIAN INHERITANCE IN Man, available on-line 
through Johns Hopkins University Welch Medical Library). The relationship between 
genes and disease, mapped to the same chromosomal region, can then be identified through 
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linkage analysis (co-inheritance of physically adjacent genes), described in, e.g., Egeland, 
et al., 1987. Nature, 325: 783-787. 

Moreover, differences in the DNA sequences between individuals affected and 
unaffected with a disease associated with the NOVX gene, can be determined. If a 
5 mutation is observed in some or all of the affected individuals but not in any unaffected 
individuals, then the mutation is likely to be the causative agent of the particular disease. 
Comparison of affected and unaffected individuals generally involves first looking for 
structural alterations in the chromosomes, such as deletions or translocations that are 
visible from chromosome spreads or detectable using PCR based on that DNA sequence. 
1 0 Ultimately, complete sequencing of genes from several individuals can be performed to 
confirm the presence of a mutation and to distinguish mutations from polymorphisms. 

Tissue Typing 

The NOVX sequences of the invention can also be used to identify individuals from 
15 minute biological samples. In this technique, an individual's genomic DNA is digested 

with one or more restriction enzymes, and probed on a Southern blot to yield unique bands 
for identification. The sequences of the invention are useful as additional DNA markers 
for RFLP ("restriction fragment length polymorphisms," described in U.S. Patent No. 
5,272,057). 

20 Furthermore, the sequences of the invention can be used to provide an alternative technique 
that determines the actual base-by-base DNA sequence of selected portions of an 
individual's genome. Thus, the NOVX sequences described herein can be used to prepare 
two PCR primers from the 5'- and 3'-termini of the sequences. These primers can then be 
used to amplify an individual's DNA and subsequently sequence it. 

25 Panels of corresponding DNA sequences from individuals, prepared in this manner, 

can provide unique individual identifications, as each individual will have a unique set of 
such DNA sequences due to allelic differences. The sequences of the invention can be 
used to obtain such identification sequences from individuals and from tissue. The NOVX 
sequences of the invention uniquely represent portions of the human genome. Allelic 

30 variation occurs to some degree in the coding regions of these sequences, and to a greater 
degree in the noncoding regions. It is estimated that allelic variation between individual 
humans occurs with a frequency of about once per each 500 bases. Much of the allelic 
variation is due to single nucleotide polymorphisms (SNPs), which include restriction 
fragment length polymorphisms (RFLPs). 
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Each of the sequences described herein can, to some degree, be used as a standard against 
which DNA from an individual can be compared for identification purposes. Because 
greater numbers of polymorphisms occur in the noncoding regions, fewer sequences are 
necessary to differentiate individuals. The noncoding sequences can comfortably provide 
5 positive individual identification with a panel of perhaps 1 0 to 1 ,000 primers that each 

yield a noncoding amplified sequence of 100 bases. If predicted coding sequences, such as 
those in SEQ ID NO: 2n-l, wherein n is an integer between 1 and 178 are used, a more 
appropriate number of primers for positive individual identification would be 500-2,000. 

10 Predictive Medicine 

The invention also pertains to the field of predictive medicine in which diagnostic 
assays, prognostic assays, pharmacogenomics, and monitoring clinical trials are used for 
prognostic (predictive) purposes to thereby treat an individual prophylactically. 
Accordingly, one aspect of the invention relates to diagnostic assays for determining 

1 5 NOVX protein and/or nucleic acid expression as well as NOVX activity, in the context of a 
biological sample (e.g., blood, serum, cells, tissue) to thereby determine whether an 
individual is afflicted with a disease or disorder, or is at risk of developing a disorder, 
associated with aberrant NOVX expression or activity. The disorders include metabolic 
disorders, diabetes, obesity, infectious disease, anorexia, cancer-associated cachexia, 

20 cancer, neurodegenerative disorders, Alzheimer's Disease, Parkinson's Disorder, immune 
disorders, and hematopoietic disorders, and the various dyslipidemias, metabolic 
disturbances associated with obesity, the metabolic syndrome X and wasting disorders 
associated with chronic diseases and various cancers. The invention also provides for 
prognostic (or predictive) assays for determining whether an individual is at risk of 

25 developing a disorder associated with NOVX protein, nucleic acid expression or activity. 
For example, mutations in an NOVX gene can be assayed in a biological sample. Such 
assays can be used for prognostic or predictive purpose to thereby prophylactically treat an 
individual prior to the onset of a disorder characterized by or associated with NOVX 
protein, nucleic acid expression, or biological activity. 

30 Another aspect of the invention provides methods for determining NOVX protein, 

nucleic acid expression or activity in an individual to thereby select appropriate therapeutic 
or prophylactic agents for that individual (referred to herein as "pharmacogenomics"). 
Pharmacogenomics allows for the selection of agents (e.g., drugs) for therapeutic or 
prophylactic treatment of an individual based on the genotype of the individual (e.g., the 
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genotype of the individual examined to determine the ability of the individual to respond to 
a particular agent.) 

Yet another aspect of the invention pertains to monitoring the influence of agents (e.g., 
drugs, compounds) on the expression or activity of NOVX in clinical trials. 

These and other agents are described in further detail in the following sections. 

Diagnostic Assays 

An exemplary method for detecting the presence or absence of NOVX in a 
biological sample involves obtaining a biological sample from a test subject and contacting 
the biological sample with a compound or an agent capable of detecting NOVX protein or 
nucleic acid (e.g., mRNA, genomic DNA) that encodes NOVX protein such that the 
presence of NOVX is detected in the biological sample. An agent for detecting NOVX 
mRNA or genomic DNA is a labeled nucleic acid probe capable of hybridizing to NOVX 
mRNA or genomic DNA. The nucleic acid probe can be, for example, a full-length NOVX 
nucleic acid, such as the nucleic acid of SEQ ID NO: 2n-l, wherein n is an integer between 
1 and 178, or a portion thereof, such as an oligonucleotide of at least 15, 30, 50, 100, 250 
or 500 nucleotides in length and sufficient to specifically hybridize under stringent 
conditions to NOVX mRNA or genomic DNA. Other suitable probes for use in the 
diagnostic assays of the invention are described herein. 

An agent for detecting NOVX protein is an antibody capable of binding to NOVX 
protein, preferably an antibody with a detectable label. Antibodies can be polyclonal, or 
more preferably, monoclonal. An intact antibody, or a fragment thereof (e.g., Fab or 
F(ab') 2 ) can be used. The term "labeled", with regard to the probe or antibody, is intended 
to encompass direct labeling of the probe or antibody by coupling (i.e., physically linking) 
a detectable substance to the probe or antibody, as well as indirect labeling of the probe or 
antibody by reactivity with another reagent that is directly labeled. Examples of indirect 
labeling include detection of a primary antibody using a fluorescently-labeled secondary 
antibody and end-labeling of a DNA probe with biotin such that it can be detected with 
fluorescently-labeled streptavidin. The term "biological sample" is intended to include 
tissues, cells and biological fluids isolated from a subject, as well as tissues, cells and fluids 
present within a subject. That is, the detection method of the invention can be used to 
detect NOVX mRNA, protein, or genomic DNA in a biological sample in vitro as well as 
in vivo. For example, in vitro techniques for detection of NOVX mRNA include Northern 
hybridizations and in situ hybridizations. In vitro techniques for detection of NOVX 
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protein include enzyme linked immunosorbent assays (ELISAs), Western blots, 
immunoprecipitations, and immunofluorescence. In vitro techniques for detection of 
NOVX genomic DNA include Southern hybridizations. Furthermore, in vivo techniques 
for detection of NOVX protein include introducing into a subject a labeled anti-NOVX 
5 antibody. For example, the antibody can be labeled with a radioactive marker whose 
presence and location in a subject can be detected by standard imaging techniques. 

In one embodiment, the biological sample contains protein molecules from the test 
subject. Alternatively, the biological sample can contain mRNA molecules from the test 
subject or genomic DNA molecules from the test subject. A preferred biological sample is 

10 a peripheral blood leukocyte sample isolated by conventional means from a subject. 

In another embodiment, the methods further involve obtaining a control biological 
sample from a control subject, contacting the control sample with a compound or agent 
capable of detecting NOVX protein, mRNA, or genomic DNA, such that the presence of 
NOVX protein, mRNA or genomic DNA is detected in the biological sample, and 

15 comparing the presence of NOVX protein, mRNA or genomic DNA in the control sample 
with the presence of NOVX protein, mRNA or genomic DNA in the test sample. 

The invention also encompasses kits for detecting the presence of NOVX in a 
biological sample. For example, the kit can comprise: a labeled compound or agent 
capable of detecting NOVX protein or mRNA in a biological sample; means for 

20 determining the amount of NOVX in the sample; and means for comparing the amount of 
NOVX in the sample with a standard. The compound or agent can be packaged in a 
suitable container. The kit can further comprise instructions for using the kit to detect 
NOVX protein or nucleic acid. 

25 Prognostic Assays 

The diagnostic methods described herein can furthermore be utilized to identify 
subjects having or at risk of developing a disease or disorder associated with aberrant 
NOVX expression or activity. For example, the assays described herein, such as the 

30 preceding diagnostic assays or the following assays, can be utilized to identify a subject 
having or at risk of developing a disorder associated with NOVX protein, nucleic acid 
expression or activity. Alternatively, the prognostic assays can be utilized to identify a 
subject having or at risk for developing a disease or disorder. Thus, the invention provides 
a method for identifying a disease or disorder associated with aberrant NOVX expression 

35 or activity in which a test sample is obtained from a subject and NOVX protein or nucleic 
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acid (e.g., mRNA, genomic DNA) is detected, wherein the presence of NOVX protein or 
nucleic acid is diagnostic for a subject having or at risk of developing a disease or disorder 
associated with aberrant NOVX expression or activity. As used herein, a "test sample" 
refers to a biological sample obtained from a subject of interest. For example, a test sample 
5 can be a biological fluid (e.g., serum), cell sample, or tissue. 

Furthermore, the prognostic assays described herein can be used to determine 
whether a subject can be administered an agent (e.g., an agonist, antagonist, 
peptidomimetic, protein, peptide, nucleic acid, small molecule, or other drug candidate) to 
treat a disease or disorder associated with aberrant NOVX expression or activity. For 

1 0 example, such methods can be used to determine whether a subject can be effectively 

treated with an agent for a disorder. Thus, the invention provides methods for determining 
whether a subject can be effectively treated with an agent for a disorder associated with 
aberrant NOVX expression or activity in which a test sample is obtained and NOVX 
protein or nucleic acid is detected (e.g., wherein the presence of NOVX protein or nucleic 

15 acid is diagnostic for a subject that can be administered the agent to treat a disorder 
associated with aberrant NOVX expression or activity). 

The methods of the invention can also be used to detect genetic lesions in an 
NOVX gene, thereby determining if a subject with the lesioned gene is at risk for a 
disorder characterized by aberrant cell proliferation and/or differentiation. In various 

20 embodiments, the methods include detecting, in a sample of cells from the subject, the 
presence or absence of a genetic lesion characterized by at least one of an alteration 
affecting the integrity of a gene encoding an NOVX-protein, or the misexpression of the 
NOVX gene. For example, such genetic lesions can be detected by ascertaining the 
existence of at least one of: (/) a deletion of one or more nucleotides from an NOVX gene; 

25 (ii) an addition of one or more nucleotides to an NOVX gene; (Hi) a substitution of one or 
more nucleotides of an NOVX gene, (z'v) a chromosomal rearrangement of an NOVX gene; 
(v) an alteration in the level of a messenger RNA transcript of an NOVX gene, (vi) aberrant 
modification of an NOVX gene, such as of the methylation pattern of the genomic DNA, 
(vif) the presence of a non-wild-type splicing pattern of a messenger RNA transcript of an 

30 NOVX gene, (viif) a non-wild-type level of an NOVX protein, (ix) allelic loss of an NOVX 
gene, and (x) inappropriate post-translational modification of an NOVX protein. As 
described herein, there are a large number of assay techniques known in the art which can 
be used for detecting lesions in an NOVX gene. A preferred biological sample is a 
peripheral blood leukocyte sample isolated by conventional means from a subject. 



However, any biological sample containing nucleated cells may be used, including, for 
example, buccal mucosal cells. 

In certain embodiments, detection of the lesion involves the use of a probe/primer 
in a polymerase chain reaction (PCR) (see, e.g., U.S. Patent Nos. 4,683,195 and 
5 4,683,202), such as anchor PCR or RACE PCR, or, alternatively, in a ligation chain 

reaction (LCR) (see, e.g., Landegran, et al, 1988. Science 241: 1077-1080; and Nakazawa, 
et al., 1994. Proc. Natl. Acad. Sci. USA 91 : 360-364), the latter of which can be particularly 
useful for detecting point mutations in the NOVX-gene (see, Abravaya, et al, 1 995. Nucl. 
Acids Res. 23: 675-682). This method can include the steps of collecting a sample of cells 

10 from a patient, isolating nucleic acid (e.g., genomic, mRNA or both) from the cells of the 
sample, contacting the nucleic acid sample with one or more primers that specifically 
hybridize to an NOVX gene under conditions such that hybridization and amplification of 
the NOVX gene (if present) occurs, and detecting the presence or absence of an 
amplification product, or detecting the size of the amplification product and comparing the 

15 length to a control sample. It is anticipated that PCR and/or LCR may be desirable to use 
as a preliminary amplification step in conjunction with any of the techniques used for 
detecting mutations described herein. 

Alternative amplification methods include: self sustained sequence replication (see, 
Guatelli, et a!., 1990. Proc. Natl. Acad. Sci. USA 87: 1874-1878), transcriptional 

20 amplification system (see, Kwoh, et al, 1989. Proc. Natl. Acad. Sci. USA 86: 1 173-1 177); 
Q0 Replicase (see, Lizardi, et al, 1988. BioTechnology 6: 1 197), or any other nucleic acid 
amplification method, followed by the detection of the amplified molecules using 
techniques well known to those of skill in the art. These detection schemes are especially 
useful for the detection of nucleic acid molecules if such molecules are present in very low 

25 numbers. 

In an alternative embodiment, mutations in an NOVX gene from a sample cell can 
be identified by alterations in restriction enzyme cleavage patterns. For example, sample 
and control DNA is isolated, amplified (optionally), digested with one or more restriction 
endonucleases, and fragment length sizes are determined by gel electrophoresis and 
30 compared. Differences in fragment length sizes between sample and control DNA 

indicates mutations in the sample DNA. Moreover, the use of sequence specific ribozymes 
(see, e.g., U.S. Patent No. 5,493,531) can be used to score for the presence of specific 
mutations by development or loss of a ribozyme cleavage site. 
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In other embodiments, genetic mutations in NOVX can be identified by hybridizing 
a sample and control nucleic acids, e.g., DNA or RNA, to high-density arrays containing 
hundreds or thousands of oligonucleotides probes. See, e.g., Cronin, et al., 1996. Human 
Mutation 7: 244-255; Kozal, et al, 1996. Nat. Med 2: 753-759. For example, genetic 
mutations in NOVX can be identified in two dimensional arrays containing light-generated 
DNA probes as described in Cronin, et al., supra. Briefly, a first hybridization array of 
probes can be used to scan through long stretches of DNA in a sample and control to 
identify base changes between the sequences by making linear arrays of sequential 
overlapping probes. This step allows the identification of point mutations. This is 
followed by a second hybridization array that allows the characterization of specific 
mutations by using smaller, specialized probe arrays complementary to all variants or 
mutations detected. Each mutation array is composed of parallel probe sets, one 
complementary to the wild-type gene and the other complementary to the mutant gene. 

In yet another embodiment, any of a variety of sequencing reactions known in the 
art can be used to directly sequence the NOVX gene and detect mutations by comparing 
the sequence of the sample NOVX with the corresponding wild-type (control) sequence. 
Examples of sequencing reactions include those based on techniques developed by Maxim 
and Gilbert, 1977. Proc. Natl. Acad. Sci. USA 74: 560 or Sanger, 1977. Proc. Natl Acad. 
Sci. USA 74: 5463. It is also contemplated that any of a variety of automated sequencing 
procedures can be utilized when performing the diagnostic assays {see, e.g., Naeve, et al., 
1995. Biotechniques 19: 448), including sequencing by mass spectrometry (see, e.g., PCT 
International Publication No. WO 94/16101; Cohen, et al, 1996. Adv. Chromatography 36: 
127-162; and Griffin, et al, 1993. Appl. Biochem. Biotechnol. 38: 147-159). 

Other methods for detecting mutations in the NOVX gene include methods in 
which protection from cleavage agents is used to detect mismatched bases in RNA/RNA or 
RNA/DNA heteroduplexes. See, e.g., Myers, et al, 1985. Science 230: 1242. In general, 
the art technique of "mismatch cleavage" starts by providing heteroduplexes of formed by 
hybridizing (labeled) RNA or DNA containing the wild-type NOVX sequence with 
potentially mutant RNA or DNA obtained from a tissue sample. The double-stranded 
duplexes are treated with an agent that cleaves single-stranded regions of the duplex such 
as which will exist due to basepair mismatches between the control and sample strands. 
For instance, RNA/DNA duplexes can be treated with RNase and DNA/DNA hybrids 
treated with Si nuclease to enzymatically digesting the mismatched regions. In other 
embodiments, either DNA/DNA or RNA/DNA duplexes can be treated with 
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hydroxylamine or osmium tetroxide and with piperidine in order to digest mismatched 
regions. After digestion of the mismatched regions, the resulting material is then separated 
by size on denaturing polyacrylamide gels to determine the site of mutation. See, e.g., 
Cotton, etal, 1988. Proc. Natl. Acad. Sci. USA 85: 4397; Saleeba, et al, 1992. Methods 
Enzymol. 217: 286-295. In an embodiment, the control DNA or RNA can be labeled for 
detection. 

In still another embodiment, the mismatch cleavage reaction employs one or more 
proteins that recognize mismatched base pairs in double-stranded DNA (so called "DNA 
mismatch repair" enzymes) in defined systems for detecting and mapping point mutations 
in NOVX cDNAs obtained from samples of cells. For example, the mutY enzyme of E. 
coli cleaves A at G/A mismatches and the thymidine DNA glycosylase from HeLa cells 
cleaves T at G/T mismatches. See, e.g., Hsu, et al, 1994. Carcinogenesis 15: 1657-1662. 
According to an exemplary embodiment, a probe based on an NOVX sequence, e.g., a 
wild-type NOVX sequence, is hybridized to a cDNA or other DNA product from a test 
cell(s). The duplex is treated with a DNA mismatch repair enzyme, and the cleavage 
products, if any, can be detected from electrophoresis protocols or the like. See, e.g., U.S. 
Patent No. 5,459,039. 

In other embodiments, alterations in electrophoretic mobility will be used to 
identify mutations in NOVX genes. For example, single strand conformation 
polymorphism (SSCP) may be used to detect differences in electrophoretic mobility 
between mutant and wild type nucleic acids. See, e.g., Orita, et al, 1989. Proc. Natl. Acad. 
Sci. USA: 86: 2766; Cotton, 1993. Mutat. Res. 285: 125-144; Hayashi, 1992. Genet. Anal. 
Tech. Appl. 9: 73-79. Single-stranded DNA fragments of sample and control NOVX 
nucleic acids will be denatured and allowed to renature. The secondary structure of 
single-stranded nucleic acids varies according to sequence, the resulting alteration in 
electrophoretic mobility enables the detection of even a single base change. The DNA 
fragments may be labeled or detected with labeled probes. The sensitivity of the assay may 
be enhanced by using RNA (rather than DNA), in which the secondary structure is more 
sensitive to a change in sequence. In one embodiment, the subject method utilizes 
heteroduplex analysis to separate double stranded heteroduplex molecules on the basis of 
changes in electrophoretic mobility. See, e.g., Keen, etal., 1991. Trends Genet. 7: 5. 

In yet another embodiment, the movement of mutant or wild-type fragments in 
polyacrylamide gels containing a gradient of denaturant is assayed using denaturing 
gradient gel electrophoresis (DGGE). See, e.g., Myers, et al., 1985. Nature 313: 495. 
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When DGGE is used as the method of analysis, DNA will be modified to insure that it does 
not completely denature, for example by adding a GC clamp of approximately 40 bp of 
high-melting GC-rich DNA by PCR. In a further embodiment, a temperature gradient is 
used in place of a denaturing gradient to identify differences in the mobility of control and 
5 sample DNA. See, e.g., Rosenbaum and Reissner, 1987. Biophys. Chem. 265: 12753. 

Examples of other techniques for detecting point mutations include, but are not 
limited to, selective oligonucleotide hybridization, selective amplification, or selective 
primer extension. For example, oligonucleotide primers may be prepared in which the 
known mutation is placed centrally and then hybridized to target DNA under conditions 

1 0 that permit hybridization only if a perfect match is found. See, e.g., Saiki, et al, 1 986. 
Nature 324: 163; Saiki, etal., 1989. Proc. Natl. Acad. Sci. USA 86: 6230. Such allele 
specific oligonucleotides are hybridized to PCR amplified target DNA or a number of 
different mutations when the oligonucleotides are attached to the hybridizing membrane 
and hybridized with labeled target DNA. 

1 5 Alternatively, allele specific amplification technology that depends on selective 

PCR amplification may be used in conjunction with the instant invention. 
Oligonucleotides used as primers for specific amplification may carry the mutation of 
interest in the center of the molecule (so that amplification depends on differential 
hybridization; see, e.g., Gibbs, et al, 1989. Nucl. Acids Res. 17: 2437-2448) or at the 

20 extreme 3'-terminus of one primer where, under appropriate conditions, mismatch can 
prevent, or reduce polymerase extension (see, e.g., Prossner, 1993. Tibtech. 11: 238). In 
addition it may be desirable to introduce a novel restriction site in the region of the 
mutation to create cleavage-based detection. See, e.g., Gasparini, et al, 1992. Mol. Cell 
Probes 6: 1. It is anticipated that in certain embodiments amplification may also be 

25 performed using Taq ligase for amplification. See, e.g., Barany, 1991. Proc. Natl Acad. 
Sci. USA 88: 189. In such cases, ligation will occur only if there is a perfect match at the 
3'-terminus of the 5' sequence, making it possible to detect the presence of a known 
mutation at a specific site by looking for the presence or absence of amplification. 
The methods described herein may be performed, for example, by utilizing 

30 pre-packaged diagnostic kits comprising at least one probe nucleic acid or antibody reagent 
described herein, which may be conveniently used, e.g., in clinical settings to diagnose 
patients exhibiting symptoms or family history of a disease or illness involving an NOVX 
gene. 
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Furthermore, any cell type or tissue, preferably peripheral blood leukocytes, in which 
NOVX is expressed may be utilized in the prognostic assays described herein. However, 
any biological sample containing nucleated cells may be used, including, for example, 
buccal mucosal cells. 

Pharmacogenomics 

Agents, or modulators that have a stimulatory or inhibitory effect on NOVX 
activity (e.g., NOVX gene expression), as identified by a screening assay described herein 
can be administered to individuals to treat (prophylactically or therapeutically) disorders 
(The disorders include metabolic disorders, diabetes, obesity, infectious disease, anorexia, 
cancer-associated cachexia, cancer, neurodegenerative disorders, Alzheimer's Disease, 
Parkinson's Disorder, immune disorders, and hematopoietic disorders, and the various 
dyslipidemias, metabolic disturbances associated with obesity, the metabolic syndrome X 
and wasting disorders associated with chronic diseases and various cancers.) In 
conjunction with such treatment, the pharmacogenomics (i.e., the study of the relationship 
between an individual's genotype and that individual's response to a foreign compound or 
drug) of the individual may be considered. Differences in metabolism of therapeutics can 
lead to severe toxicity or therapeutic failure by altering the relation between dose and blood 
concentration of the pharmacologically active drug. Thus, the pharmacogenomics of the 
individual permits the selection of effective agents (e.g., drugs) for prophylactic or 
therapeutic treatments based on a consideration of the individual's genotype. Such 
pharmacogenomics can further be used to determine appropriate dosages and therapeutic 
regimens. Accordingly, the activity of NOVX protein, expression of NOVX nucleic acid, 
or mutation content of NOVX genes in an individual can be determined to thereby select 
appropriate agent(s) for therapeutic or prophylactic treatment of the individual. 

Pharmacogenomics deals with clinically significant hereditary variations in the 
response to drugs due to altered drug disposition and abnormal action in affected persons. 
See e.g., Eichelbaum, 1996. Clin. Exp. Pharmacol. Physiol, 23: 983-985; Linder, 1997. 
Clin. Chem., 43: 254-266. In general, two types of pharmacogenetic conditions can be 
differentiated. Genetic conditions transmitted as a single factor altering the way drugs act 
on the body (altered drug action) or genetic conditions transmitted as single factors altering 
the way the body acts on drugs (altered drug metabolism). These pharmacogenetic 
conditions can occur either as rare defects or as polymorphisms. For example, 
glucose-6-phosphate dehydrogenase (G6PD) deficiency is a common inherited 
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enzymopathy in which the main clinical complication is hemolysis after ingestion of 
oxidant drugs (anti-malarials, sulfonamides, analgesics, nitrofurans) and consumption of 
fava beans. 

As an illustrative embodiment, the activity of drug metabolizing enzymes is a major 
5 determinant of both the intensity and duration of drug action. The discovery of genetic 
polymorphisms of drug metabolizing enzymes (e.g., N-acetyltransferase 2 (NAT 2) and 
cytochrome PREGNANCY ZONE PROTEIN PRECURSOR enzymes CYP2D6 and 
CYP2C19) has provided an explanation as to why some patients do not obtain the expected 
drug effects or show exaggerated drug response and serious toxicity after taking the 

1 0 standard and safe dose of a drug. These polymorphisms are expressed in two phenotypes 
in the population, the extensive metabolizer (EM) and poor metabolizer (PM). The 
prevalence of PM is different among different populations. For example, the gene coding 
for CYP2D6 is highly polymorphic and several mutations have been identified in PM, 
which all lead to the absence of functional CYP2D6. Poor metabolizers of CYP2D6 and 

1 5 CYP2C19 quite frequently experience exaggerated drug response and side effects when 
they receive standard doses. If a metabolite is the active therapeutic moiety, PM show no 
therapeutic response, as demonstrated for the analgesic effect of codeine mediated by its 
CYP2D6-formed metabolite morphine. At the other extreme are the so called ultra-rapid 
metabolizers who do not respond to standard doses. Recently, the molecular basis of 

20 ultra-rapid metabolism has been identified to be due to CYP2D6 gene amplification. 

Thus, the activity of NOVX protein, expression of NOVX nucleic acid, or mutation content 
of NOVX genes in an individual can be determined to thereby select appropriate agent(s) 
for therapeutic or prophylactic treatment of the individual. In addition, pharmacogenetic 
studies can be used to apply genotyping of polymorphic alleles encoding 

25 drug-metabolizing enzymes to the identification of an individual's drug responsiveness 
phenotype. This knowledge, when applied to dosing or drug selection, can avoid adverse 
reactions or therapeutic failure and thus enhance therapeutic or prophylactic efficiency 
when treating a subject with an NOVX modulator, such as a modulator identified by one of 
the exemplary screening assays described herein. 

30 

Monitoring of Effects During Clinical Trials 

Monitoring the influence of agents (e.g., drugs, compounds) on the expression or 
activity of NOVX (e.g., the ability to modulate aberrant ceil proliferation and/or 
35 differentiation) can be applied not only in basic drug screening, but also in clinical trials. 
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For example, the effectiveness of an agent determined by a screening assay as described 
herein to increase NOVX gene expression, protein levels, or upregulate NOVX activity, 
can be monitored in clinical trails of subjects exhibiting decreased NOVX gene expression, 
protein levels, or downregulated NOVX activity. Alternatively, the effectiveness of an 
5 agent determined by a screening assay to decrease NOVX gene expression, protein levels, 
or downregulate NOVX activity, can be monitored in clinical trails of subjects exhibiting 
increased NOVX gene expression, protein levels, or upregulated NOVX activity. In such 
clinical trials, the expression or activity of NOVX and, preferably, other genes that have 
been implicated in, for example, a cellular proliferation or immune disorder can be used as 

10 a "read out" or markers of the immune responsiveness of a particular cell. 

By way of example, and not of limitation, genes, including NOVX, that are 
modulated in cells by treatment with an agent (e.g., compound, drug or small molecule) 
that modulates NOVX activity (e.g., identified in a screening assay as described herein) can 
be identified. Thus, to study the effect of agents on cellular proliferation disorders, for 

15 example, in a clinical trial, cells can be isolated and RNA prepared and analyzed for the 
levels of expression of NOVX and other genes implicated in the disorder. The levels of 
gene expression (i.e., a gene expression pattern) can be quantified by Northern blot analysis 
or RT-PCR, as described herein, or alternatively by measuring the amount of protein 
produced, by one of the methods as described herein, or by measuring the levels of activity 

20 of NOVX or other genes. In this manner, the gene expression pattern can serve as a 

marker, indicative of the physiological response of the cells to the agent. Accordingly, this 
response state may be determined before, and at various points during, treatment of the 
individual with the agent. 

In one embodiment, the invention provides a method for monitoring the 

25 effectiveness of treatment of a subject with an agent (e.g., an agonist, antagonist, protein, 
peptide, peptidomimetic, nucleic acid, small molecule, or other drug candidate identified 
by the screening assays described herein) comprising the steps of (/) obtaining a 
pre-administration sample from a subject prior to administration of the agent; (/'/) detecting 
the level of expression of an NOVX protein, mRNA, or genomic DNA in the 

30 preadministration sample; (Hi) obtaining one or more post-administration samples from the 
subject; (iv) detecting the level of expression or activity of the NOVX protein, mRNA, or 
genomic DNA in the post-administration samples; (v) comparing the level of expression or 
activity of the NOVX protein, mRNA, or genomic DNA in the pre-administration sample 
with the NOVX protein, mRNA, or genomic DNA in the post administration sample or 



samples; and (vz) altering the administration of the agent to the subject accordingly. For 
example, increased administration of the agent may be desirable to increase the expression 
or activity of NOVX to higher levels than detected, i.e., to increase the effectiveness of the 
agent. Alternatively, decreased administration of the agent may be desirable to decrease 
5 expression or activity of NOVX to lower levels than detected, i.e., to decrease the 
effectiveness of the agent. 

Methods of Treatment 

1 0 The invention provides for both prophylactic and therapeutic methods of treating a 

subject at risk of (or susceptible to) a disorder or having a disorder associated with aberrant 
NOVX expression or activity. The disorders include cardiomyopathy, atherosclerosis, 
hypertension, congenital heart defects, aortic stenosis, atrial septal defect (ASD), 
atrioventricular (A-V) canal defect, ductus arteriosus, pulmonary stenosis, subaortic 

1 5 stenosis, ventricular septal defect (VSD), valve diseases, tuberous sclerosis, scleroderma, 
obesity, transplantation, adrenoleukodystrophy, congenital adrenal hyperplasia, prostate 
cancer, neoplasm; adenocarcinoma, lymphoma, uterus cancer, fertility, hemophilia, 
hypercoagulation, idiopathic thrombocytopenic purpura, immunodeficiencies, graft versus 
host disease, AIDS, bronchial asthma, Crohn's disease; multiple sclerosis, treatment of 

20 Albright Hereditary Ostoeodystrophy, and other diseases, disorders and conditions of the 
like. 

These methods of treatment will be discussed more fully, below. 
Disease and Disorders 

25 

Diseases and disorders that are characterized by increased (relative to a subject not 
suffering from the disease or disorder) levels or biological activity may be treated with 
Therapeutics that antagonize {i.e., reduce or inhibit) activity. Therapeutics that antagonize 
activity may be administered in a therapeutic or prophylactic manner. Therapeutics that 

30 may be utilized include, but are not limited to: (i) an aforementioned peptide, or analogs, 
derivatives, fragments or homologs thereof; (if) antibodies to an aforementioned peptide; 
(Hi) nucleic acids encoding an aforementioned peptide; (rV) administration of antisense 
nucleic acid and nucleic acids that are "dysfunctional" (i.e., due to a heterologous insertion 
within the coding sequences of coding sequences to an aforementioned peptide) that are 

35 utilized to "knockout" endogenous function of an aforementioned peptide by homologous 
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recombination (see, e.g., Capecchi, 1989. Science 244: 1288-1292); or (v) modulators ( i.e., 
inhibitors, agonists and antagonists, including additional peptide mimetic of the invention 
or antibodies specific to a peptide of the invention) that alter the interaction between an 
aforementioned peptide and its binding partner. 
5 Diseases and disorders that are characterized by decreased (relative to a subject not 
suffering from the disease or disorder) levels or biological activity may be treated with 
Therapeutics that increase (i.e., are agonists to) activity. Therapeutics that upregulate 
activity may be administered in a therapeutic or prophylactic manner. Therapeutics that 
may be utilized include, but are not limited to, an aforementioned peptide, or analogs, 

1 0 derivatives, fragments or homologs thereof; or an agonist that increases bioavailability. 

Increased or decreased levels can be readily detected by quantifying peptide and/or 
RNA, by obtaining a patient tissue sample (e.g., from biopsy tissue) and assaying it in vitro 
for RNA or peptide levels, structure and/or activity of the expressed peptides (or mRNAs 
of an aforementioned peptide). Methods that are well-known within the art include, but are 

1 5 not limited to, immunoassays (e.g., by Western blot analysis, immunoprecipitation 
followed by sodium dodecyl sulfate (SDS) polyacrylamide gel electrophoresis, 
immunocytochemistry, etc.) and/or hybridization assays to detect expression of mRNAs 
(e.g., Northern assays, dot blots, in situ hybridization, and the like). 

20 Prophylactic Methods 

In one aspect, the invention provides a method for preventing, in a subject, a 
disease or condition associated with an aberrant NOVX expression or activity, by 
administering to the subject an agent that modulates NOVX expression or at least one 

25 NOVX activity. Subjects at risk for a disease that is caused or contributed to by aberrant 
NOVX expression or activity can be identified by, for example, any or a combination of 
diagnostic or prognostic assays as described herein. Administration of a prophylactic agent 
can occur prior to the manifestation of symptoms characteristic of the NOVX aberrancy, 
such that a disease or disorder is prevented or, alternatively, delayed in its progression. 

30 Depending upon the type of NOVX aberrancy, for example, an NOVX agonist or NOVX 
antagonist agent can be used for treating the subject. The appropriate agent can be 
determined based on screening assays described herein. The prophylactic methods of the 
invention are further discussed in the following subsections. 



93 



Therapeutic Methods 

Another aspect of the invention pertains to methods of modulating NOVX 
expression or activity for therapeutic purposes. The modulatory method of the invention 
involves contacting a cell with an agent that modulates one or more of the activities of 
5 NOVX protein activity associated with the cell. An agent that modulates NOVX protein 
activity can be an agent as described herein, such as a nucleic acid or a protein, a 
naturally-occurring cognate ligand of an NOVX protein, a peptide, an NOVX 
peptidomimetic, or other small molecule. In one embodiment, the agent stimulates one or 
more NOVX protein activity. Examples of such stimulatory agents include active NOVX 

10 protein and a nucleic acid molecule encoding NOVX that has been introduced into the cell. 
In another embodiment, the agent inhibits one or more NOVX protein activity. Examples 
of such inhibitory agents include antisense NOVX nucleic acid molecules and anti-NOVX 
antibodies. These modulatory methods can be performed in vitro (e.g., by culturing the cell 
with the agent) or, alternatively, in vivo (e.g., by administering the agent to a subject). As 

1 5 such, the invention provides methods of treating an individual afflicted with a disease or 
disorder characterized by aberrant expression or activity of an NOVX protein or nucleic 
acid molecule. In one embodiment, the method involves administering an agent (e.g., an 
agent identified by a screening assay described herein), or combination of agents that 
modulates (e.g., up-regulates or down-regulates) NOVX expression or activity. In another 

20 embodiment, the method involves administering an NOVX protein or nucleic acid 

molecule as therapy to compensate for reduced or aberrant NOVX expression or activity. 

Stimulation of NOVX activity is desirable in stations in which NOVX is 
abnormally downregulated and/or in which increased NOVX activity is likely to have a 
beneficial effect. One example of such a situation is where a subject has a disorder 

25 characterized by aberrant cell proliferation and/or differentiation (e.g., cancer or immune 
associated disorders). Another example of such a situation is where the subject has a 
gestational disease (e.g., preclampsia). 

Determination of the Biological Effect of the Therapeutic 

30 

In various embodiments of the invention, suitable in vitro or in vivo assays are 
performed to determine the effect of a specific Therapeutic and whether its administration 
is indicated for treatment of the affected tissue. 

In various specific embodiments, in vitro assays may be performed with representative 
35 cells of the type(s) involved in the patient's disorder, to determine if a given Therapeutic 
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exerts the desired effect upon the cell type(s). Compounds for use in therapy may be tested 
in suitable animal model systems including, but not limited to rats, mice, chicken, cows, 
monkeys, rabbits, and the like, prior to testing in human subjects. Similarly, for in vivo 
testing, any of the animal model system known in the art may be used prior to 
5 administration to human subjects. 

Prophylactic and Therapeutic Uses of the Compositions of the Invention 

The NOVX nucleic acids and proteins of the invention are useful in potential 
10 prophylactic and therapeutic applications implicated in a variety of disorders including, but 
P not limited to: metabolic disorders, diabetes, obesity, infectious disease, anorexia, cancer- 

~: associated cancer, neurodegenerative disorders, Alzheimer's Disease, Parkinson's 

ilj Disorder, immune disorders, hematopoietic disorders, and the various dyslipidemias, 

% metabolic disturbances associated with obesity, the metabolic syndrome X and wasting 

D 1 5 disorders associated with chronic diseases and various cancers. 

q As an example, a cDNA encoding the NOVX protein of the invention may be 

useful in gene therapy, and the protein may be useful when administered to a subject in 
need thereof. By way of non-limiting example, the compositions of the invention will have 
^ efficacy for treatment of patients suffering from: metabolic disorders, diabetes, obesity, 

20 infectious disease, anorexia, cancer-associated cachexia, cancer, neurodegenerative 

disorders, Alzheimer's Disease, Parkinson's Disorder, immune disorders, hematopoietic 
disorders, and the various dyslipidemias. 

Both the novel nucleic acid encoding the NOVX protein, and the NOVX protein of the 
invention, or fragments thereof, may also be useful in diagnostic applications, wherein the 
25 presence or amount of the nucleic acid or the protein are to be assessed. A further use 

could be as an anti-bacterial molecule (i.e., some peptides have been found to possess anti- 
bacterial properties). These materials are further useful in the generation of antibodies, 
which immunospecifically-bind to the novel substances of the invention for use in 
therapeutic or diagnostic methods. 

30 

Sequence Analyses 



The sequence of NOVX was derived by laboratory cloning of cDNA fragments, by 
in silico prediction of the sequence. cDNA fragments covering either the full length of the 
35 DNA sequence, or part of the sequence, or both, were cloned. In silico prediction was 
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based on sequences available in CuraGen's proprietary sequence databases or in the public 
human sequence databases, and provided either the full length DNA sequence, or some 
portion thereof. 



5 The laboratory cloning was performed using one or more of the methods 

summarized below: 

SeqCalling™Technology: cDNA was derived from various human samples 
representing multiple tissue types, normal and diseased states, physiological states, and 

1 0 developmental states from different donors. Samples were obtained as whole tissue, 

primary cells or tissue cultured primary cells or cell lines. Cells and cell lines may have 
been treated with biological or chemical agents that regulate gene expression, for example, 
growth factors, chemokines or steroids. The cDNA thus derived was then sequenced using 
CuraGen Corporation's SeqCalling technology which is disclosed in full in U. S. Ser. Nos. 

15 09/417,386 filed Oct. 13, 1999, and 09/614,505 filed July 1 1, 2000. Sequence traces were 
evaluated manually and edited for corrections if appropriate. cDNA sequences from all 
samples were assembled together, sometimes including public human sequences, using 
bioinformatics programs to produce a consensus sequence for each assembly. Each 
assembly is included in CuraGen Corporation's database. Sequences were included as 

20 components for assembly when the extent of identity with another component was at least 
95% over 50 bp. Each assembly represents a gene or portion thereof and includes 
information on variants, such as splice forms single nucleotide polymorphisms (SNPs), 
insertions, deletions and other sequence variations. 

Variant sequences are also included in this application. A variant sequence can 

25 include a single nucleotide polymorphism (SNP). A SNP can, in some instances, be 
referred to as a "cSNP" to denote that the nucleotide sequence containing the SNP 
originates as a cDNA. A SNP can arise in several ways. For example, a SNP may be due to 
a substitution of one nucleotide for another at the polymorphic site. Such a substitution can 
be either a transition or a transversion. A SNP can also arise from a deletion of a nucleotide 

30 or an insertion of a nucleotide, relative to a reference allele. In this case, the polymorphic 
site is a site at which one allele bears a gap with respect to a particular nucleotide in 
another allele. SNPs occurring within genes may result in an alteration of the amino acid 
encoded by the gene at the position of the SNP. Intragenic SNPs may also be silent, when a 
codon including a SNP encodes the same amino acid as a result of the redundancy of the 
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genetic code. SNPs occurring outside the region of a gene, or in an intron within a gene, do 
not result in changes in any amino acid sequence of a protein but may result in altered 
regulation of the expression pattern. Examples include alteration in temporal expression, 
physiological response regulation, cell type expression regulation, intensity of expression, 
and stability of transcribed message. 

Presented information includes that associated with genomic clones, public genes 
and ESTs sharing sequence identity with the disclosed sequence and CuraGen 
Corporation's Electronic Northern bioinformatic tool. 

Examples 

Example A: Sequence related information 



The NOV1 clone was analyzed, and the nucleotide and predicted polypeptide 
sequences are shown in Table 1 A. 



Table 1A. NOV1 Sequence Analysis 




SEQ ID NO: 1 [711 bp 


NOV la, 

CG5 8522-01 DNA Sequence 


TGCAGAATGAACCAAGGAGACTCAAACCCAGCAGCTACTCCGCATGCGGCAGAAGACA 
TTCAAGGAGATGACAGATGGATGTGTCAGCACAACAGATTTGTTTTGGACTGTAAAGA 
CAAACAGCCTGATGTACCATTTGCGGGAGGCTCCGTGGTGCAGTTACTGCAGCCATAT 
GAGATATGGCGAGAGCTTTTTTCCCCACTTCATGCACTGAATTTTGGAACTGGGGGAG 
ATACAACAAGACATGTTTTGTGGAGACTAAAGAGCGGAGAACTGGGGAATACTAAGCC 




GCAGGTGGTATGGCGGCCATCGTACAACTTATCAACACAAGGCAGCCACAGGCCAAAA 
TCATTGTATTTGATCTGTTACCTCAAGGTGAGAAACCCAACCCTTTGAGGCAAAAGAA 
CGCCAAGGTGAACCCACTCGTCAAGATTTCGCTGCTGAAACTTACCAACGTGCAGCTC 
CTGGATACTGACAGGGGTTTCGTGCACTC CGACCGTGCCAT CTCCTGCCACGACATGT 
TTGATTTTCTGCATTTGACAGGAGGTGGCTACTCAAAGGTCTGCAAACCCTTGAATGA 
ACTGATCATGCAGTTGTTGGAGGAAACACCTGAGGAGAAACAAACCACCATTGCCTGA 
CTGGCTCCCATGAGT 




ORF Start: ATG at 7 


ORF Stop: TGA at 694 




SEQ ID NO: 2 


229 aa JMW at 25656.2kD 


NOVla, 

CG58522-01 Protein Sequence 


MNQGDSNPAATPHAAEDIQGDDRWMCQHNRFVLDCKDKQPDVPFAGGSWQLLQPYEI 
WRELFSPLHALNFGTGGDTTRHVLWRLKSGELGNTKPKVIVFWLGRN13HENMAEEVAG 
GMAAIVQLINTRQPQAKIIVFDLLPQGEKPNPLRQKNAKVNPLVKISLLItLTNVOLLD 
TDRGFVHSDRAISCHDMFDFLHLTGGGYSKVCKPLNELIMQLLEETPEEKQTTIA 



Further analysis of the NOVla protein yielded the following properties shown in 
Table IB. 



Table IB. Protein Sequence Properties NOVla 


Psort 
analysis: 


0.6500 probability located in cytoplasm; 0.2340 probability located 
in lysosome (lumen); 0.1000 probability located in mitochondrial 
matrix space; 0.0000 probability located in endoplasmic reticulum 
(membrane) 
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SignalP No Known Signal Sequence Predicted 
analysis: 



A search of the NOV la protein against the Geneseq database, a proprietary 
database that contains sequences published in patents and patent publication, yielded 
several homologous proteins shown in Table 1C. 



Table 1C. Geneseq Results for NOVla 


Geneseq 
Identifier 


Protein/Organism/Length [Patent 
#, Date] 


NOVla 
Residues/ 

Match 
Residues 


Identities/ 
Similarities 
for the 
Matched 
Region 


Expect 
Value 


AAB49433 


Human beta platelet activating 
factor acetylhydrolase - Homo 
sapiens, 229 aa. [US6146868-A, 
14-JNUV-zuUUJ 


1..229 
1..229 


196/229 
(85%) 

209/229 
(90%) 


e-114 


AAB49432 


Rat beta platelet activating factor 
acetylhydrolase - Rattus 
norvegicus, 229 aa. [US6146868-A, 
14-NOV-2000] 


1..229 
1..229 


195/229 
(85%) 

208/229 
(90%) 


e-114 


AAB49434 


Murine beta platelet activating 
factor acetylhydrolase - Mus 
musculus, 229 aa. [US6146868-A, 
14-NOV-2000] 


1..229 
1..229 


192/229 
(83%) 

205/229 
(88%) 


e-111 


AAB49436 


Bovine gamma platelet activating 
factor acetylhydrolase - Bos taurus, 
232 aa. [US6146868-A, 14-NOV- 
2000] 


4..219 
3..218 


124/216 
(57%) 

165/216 
(75%) 


5e-74 


AAB49435 


Human gamma platelet activating 
factor acetylhydrolase - Homo 
sapiens, 231 aa. [US6146868-A, 
14-NOV-2000] 


4..219 
3..218 


124/216 
(57%) 

164/216 
(75%) 


2e-73 



In a BLAST search of public sequence databases, the NOVla protein was found to 
have homology to the proteins shown in the BLASTP data in Table 1 D. 
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Table ID. Public BLASTP Results for NO VI a 


Protein 
Accession 
Number 


Protein/Organism/Length 


NO VI a 
Residues/ 

Match 
Residues 


Identities/ 
Similarities for 
the Matched 
Portion 


Expect 
Value 


Q29459 


Platelet-activating factor 
acetylhydrolase IB beta subunit (EC 
3.1.1.47) (PAF acetylhydrolase 30 
kDa subunit) (PAF-AH 30 kDa 
subunit) (PAF-AH beta subunit) 
(PAF AH beta subunit) - Homo 
sapiens (Human), and, 229 aa. 


1..229 
1..229 


196/229 (85%) 
209/229 (90%) 


e-114 


035264 


Platelet-activating factor 
acetylhydrolase IB beta subunit (EC 
3.1.1.47) (PAF acetylhydrolase 30 
kDa subunit) (PAF-AH 30 kDa 
subunit) (PAF-AH beta subunit) 
(PAFAH beta subunit) (Platelet- 
activating factor acetylhydrolase alpha 
2 subunit) (PAF-AH alpha 2) - Rattus 
norvegicus (Rat), 229 aa. 


L.229 
1..229 


195/229 (85%) 
208/229 (90%) 


e-113 


Q61206 


Platelet-activating factor 
acetylhydrolase IB beta subunit (EC 
3.1 .1 .47) (PAF acetylhydrolase 30 
kDa subunit) (PAF-AH 30 kDa 
subunit) (PAF-AH beta subunit) 
(PAFAH beta subunit) - Mus 
musculus (Mouse), 229 aa. 


1..229 
1..229 


192/229 (83%) 
205/229 (88%) 


e-111 


Q29460 


Platelet-activating factor 
acetylhydrolase IB gamma subunit 
(EC 3.1.1.47) (PAF acetylhydrolase 
29 kDa subunit) (PAF-AH 29 kDa 
subunit) (PAF-AH gamma subunit) 
(PAFAH gamma subunit) - Bos taurus 
(Bovine), 232 aa. 


4..219 
3..218 


125/216(57%) 
165/216(75%) 


8e-74 


Q15102 


Platelet-activating factor 
acetylhydrolase IB gamma subunit 
(EC 3.1.1.47) (PAF acetylhydrolase 
29 kDa subunit) (PAF-AH 29 kDa 
subunit) (PAF-AH gamma subunit) 
(PAFAH gamma subunit) - Homo 
sapiens (Human), 231 aa. 


4..219 
3..218 


124/216(57%) 
164/216(75%) 


7e-73 



PFam analysis predicts that the NOVla protein contains the domains shown in the 
Table IE. 
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Table IE. Domain Analysis of NO VI a 


Pfam Domain 


NOVla Match Region 


Identities/ 
Similarities 
for the Matched Region 


Expect 
Value 


PAF-AH: domain 1 
of 1 


7..221 


150/215 (70%) 
186/215(87%) 


6e-147 



Example 2. 

The NOV2 clone was analyzed, and the nucleotide and predicted polypeptide 
sequences are shown in Table 2A. 



Table 2A. NOV2 Sequence Analysis 




SEQIDNO:3 1 1457 bp 


NOV2a, 

u\jjojzu-ui ui\/\ sequence 


CGATTCCGATGGGTCCTTTGAAAGCTTTTCTCTTCTCCCCTTTTCTTCTOCRfiAfiTrA 
AAGTAGAGGGGTGAGGTTGGTCTTCTTGTTACTGACCCTGCATTTGGGAAACGTTGAT 
AAGGCAGATGATGAAGATGATGAGGATTTAACGGTGAACAAAACCTGGGTCTTGGCCC 
CAAAAATTCATGAAGGAGATATCACACAAATTCTGAATTCATTGCTTCAAGGCTATGA 
CAATAAACTTCGTCCAGATATAGGAGTGAGGCCCACAGTAATTGAAACTGATGTTTAT 
GTAAACAGCATTGGACCAGTTGATCCAATTAATATGGAATATACAATAGATATAATTT 
TTGCCCAAACCTGGTTTGACAGTCGTTTAAAATTCAATAGTACCATGAAAGTGCTTAT 
G CTTAACAGTAATATGGTTGGAAAAATTTGGATT CCTGACACTTTC TTCAGAAACTCA 
AGAAAATCTGATGCTCACTGGATAACAACTCCTAATCGTCTGCTTCGAATTTGGAATG 
ATGGACGAGTTCTGTATACTCTAAGGAGATTGACAATTAATGCAGAATGTTATCTTCA 
GCTTCATAACTTTCCCATGGATGAACATTCCTGTCCACTGGAATTTTCAAGCTTCTCT 
ATAGATGGATACCCTAAAAATGAAATTGAGTTATCAATGGAAGCGAAGTTCTGTGGAA 
GTGGGCGACACAAGATCCGGAGATTATATCAGTTTGCATTTGTAGGGTTACGGAACTC 

CTGAGCAGAAGAATGGGATATTTCACTATTCAGACCTACATTCCATGCATTCTGACAG 
TTGTTCTTTCTTGGGTGTCTTTTTGGATCAATAAAGATGCAGTGCCTGCAAGAACATC 
GTTGGGTATGACATCTATAGGTATCACTACAGTTCTGACTATGACAACCCTGAGTACA 
ATTGCCAGGAAGTCTTTACCTAAGGTTTCTTATGTGACTGCGATGGATCTCTTTGTTT 
CTGTTTGTTTCATTTTTGTTTTTGCAGCCTTGATGGAATATGGAACCTTGCATTATTT 
TACCAGCAACCAAAAAGGAAAGACTGCTACTAAAGACAGAAAGCTAAAAAATAAAGCC 
TCGACTCCTGGTCTCCATCCTGGATCCACTCTGATTCCAATGAATAATATTTCTGTGC 
CGCAAGAAGATGATTATGGGTATCAGTGTTTGGAGGGCAAAGATTGTGCCAGCTTCTT 
CTGTTGCTTTGAAGACTGCAGAACAGGATCTTGGAGGGAAGGAAGGATACACATACGC 
ATTGCCAAAATTGACTCTTATTCTAGAATATTTTTCCCAACCGCTTTTGCCCTGTTCA 
ACTTGGTTTATTGGGTTGGCTATCTTTACTTATAAAATCTACTTCATAAGCAAAAATC 
AAAAGAA 




ORF Start: ATG at 9 


ORF Stop: TAA at 1425 




SEQ ID NO: 4 


472 aa ImW at 54100.9kD 


NOV2a, 

CG58520-01 Protein Sequence 


MGPLKAFLFSPFLI.RSQSRGVRLVFLLLTLHLGNVDKADDEDDEDLTVNKTWVLAPKI 
HEGDITQILNSLLQGYDNKLRPDIGVRPTVIETDVYVNSIGPVDPINMEYTIDIIFAQ 
TWFDSRLKFNSTMKVLMLNSNMVGKIWIPDTFFRNSRKSDAHWITTPNRIjLRIWNDGR 

vlytlrrltinaecylqlhnfpmdehscplefssfsidgypkneielsmeakfcgsgr 
hkirrlyqfafvglrnsteithtisgdyvimtiffdlsrrmgyftiqtyipciltwl 
swvsfwinkdavpartslgmtsigittvltmttlstiarkslpkvsyvtamdlfvsvc 
fifwaalmeygtlhyftsnqkgktatkdrklknkastpglhpgstlipmnnisvpqe 
ddygyqclegkdcasffccfedcrtgswregrihiriakidsysriffptafalfnlv 
ywvgylyl 




SEQ ID NO: 5 1 1521 bp 


NOV2b, 

CG5 8 520-02 DNA Sequence 


CAACCAAGAGGCAAGAGGCGAGAGAAGGAAAAAAAAAAAAGCGATGAGTTCGCCAAAT 


ATATGGAGCACAGGAAGCTCAGTCTACTCGACTCCTGTATTTTCACAGAAAATGACGG 
TGTGGATTCTGCTCCTGCTGTCGCTCTACCCTGGCTTCACTAGCCAGAAATCTGATGA 
TGACTATGAAGATTATGCTTCTAACAAAACATGGGTCTTGACTCCAAAAGTTCCTGAG 
GGTGATGTCACTGTCATCTTAAACAACCTGCTGGAAGGATATGACAATAAACTTCGGC 
CTGATATAGGAGTGAAGCCAACGTTAATTCACACAGACATGTATGTGAATAGCATTGG 
TCCAGTGAACGCTATCAATATGGAATACACTATTGATATATTTTTTGCGCAAACGTGG 
TATGACAGACGTTTGAAATTTAACAGCACCATTAAAGTCCTCCGATTGAACAGCAACA 
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TGGTGGGGAAAATCTGGATTCCAGACACTTTCTTCAGAAATTCCAAAAAAGCTGATGC 
ACACTGGATCACCACCCCCAACAGGATGCTGAGAATTTGGAATGATGGTCGAGTGCTC 
TACACCCTAAGGTTGACAATTGATGCTGAGTGCCAATTACAATTGCACAACTTTCCAA 
TGGATGAACACTCCTGCCCCTTGGAGTTCTCCAGTTATGGCTATCCACGTGAAGAAAT 
TGTTTATCAATGGAAGCGAAGTTCTGTTGAAGTGGGCGACACAAGATCCTGGAGGCTT 

GAGATTATGTGGTCATGTCTGTCTACTTTGATCTGAGCAGAAGAATGGGATACTTTAC 
CATCCAGACCTATATCCCCTGCACACTCATTGTCGTCCTATCCTGGGTGTCTTTCTGG 
ATCAATAAGGATGCTGTTCCAGCCAGAACATCTTTAGGTATCACCACTGTCCTGACAA 
TGACCACCCTCAGCACCATTGCCCGGAAATCGCTCCCCAAGGTCTCCTATGTCACAGC 
GATGGATCTCTTTGTATCTGTTTGTTTCATCTTTGTCTTCTCTGCTCTGGTGGAGTAT 
GGCACCTTGCATTATTTTGTCAGCAACCGGAAACCAAGCAAGGACAAAGATAAAAAGA 
AGAAAAACCCTCTTCTTCGGATGTTTTCCTTCAAGGCCCCTACCATTGATATCCGCCC 
AAGATCAGCAACCATTCAAATGAATAATGCTACACACCTTCAAGAGAGAGATGAAGAG 
TACGGCTATGAGTGTCTGGACGGCAAGGACTGTGCCAGTTTTTTCTGCTGTTTTGAAG 
ATTGTCGAACAGGAGCTTGGAGACATGGGAGGATACATATCCGCATTGCCAAAATGGA 
CTCCTATGCTCGGATCTTCTTCCCCACTGCCTTCTGCCTGTTTAATCTGGTCTATTGG 
GTCTCCTACCTCTACCTGTGAGGAGGTATGGGTTTTACTGATATGGTTCTTATTCACT 


sagtc; iX ft 




ORF Start: ATG at 44 


ORF Stop: TG A at 1469 




SEQ ID NO: 6 


475 aa ImW at 55 1 84.9kD 


NOV2b, 

CG5 8520-02 Protein Sequence 


MSSPNIWSTGSSVYSTPVFSQKMTVWILLLLSLYPGFTSQKSDDDYEDYASNKTWVLT 
PKVPEGDVTVIIiNNLLEGYDNKLRPDIGVKPTLIHTDMYVNSIGPVNAINMEYTIDIF 
FAQTWYDRRLKFNSTIKVLRLNSNMVGKIWIPDTFFRNSKKADAHWITTPNRMLRIWN 
DGRVLYTLRLTIDAECQLQLHNFPMDEHSCPLEFSSYGYPREEIVYQWKRSSVEVGDT 
RSWRLYQFSFVGLRNTTEWKTTSGDYWMSVYFDLSRRMGYFTIQTYIPCTLIWLS 
WVSFWINKDAVPARTSLGITTVLTMTTLSTIARKSLPKVSYVTAMDLFVSVCFIFVFS 
ALVEYGTLHYFVSNRKPSKDKDKKKKNPLLRMFSFKAPTIDIRPRSATIQMNNATHLQ 
ERDEEYGYECLDGKDCASFFCCFEDCRTGAWRHGRIHIRIAKMDSYARIFFPTAFCLF 
NLVYWVSYLYL 




SEQ ID NO: 7 1 1455 bp 


NOV2c, 

CG58520-03 DNA Sequence 


TAGTGCAGCACACGTAAAAAAGCGATTCCGATGGGTCCTTTGAAAGCTTTTCTCTTCT 
CCCCTTTTCTTCTGCGGAGTCAAAGTAGAGGGGTGAGGTTGGTCTTCTTGTTACTGAC 
CCTGCATTTGGGAAACTGGGTTGATAAGGCAGATGATGAAGATGATGAGGATTTAACG 
GTGAACAAAACCTGGGTCTTGGCCCCAAAAATTCATGAAGGAGATATCACACAAATTC 
TGAATTCATTGCTTCAAGGCTATGACAATAAACTTCGTCCAGATATAGGAGTGAGGCC 
CACAGTAATTGAAACTGATGTTTATGTAAACAGCATTGGACCAGTTGATCCAATTAAT 
ATGGAATATACAATAGATATAATTTTTGCCCAAACCTGGTTTGACAGTCGTTTAAAAT 
TCAATAGTACCATGAAAGTGCTTATGCTTAACAGTAATATGGTTGGAAAAATTTGGAT 
TCCTGACACTTTCTTCAGAAACTCAAGAAAATCTGATGCTCACTGGATAACAACTCCT 
AATCGTCTGCTTCGAATTTGGAATGATGGACGAGTTCTGTATACTCTAAGGTTGACAA 
TTAATGCAGAATGTTATCTTCAGCTTCATAACTTTCCCATGGATGAACATTCCTGTCC 
ACTGGAATTTTCAAGCGATGGATACCCTAAAAATGAAATTGAGTATAAGTGGAAAAAG 
CCCTCCGTAGAAGTGGCTGATCCTAAATACTGGAGATTATATCAGTTTGCATTTGTAG 
GGTTACGGAACTCAACTGAAATCACTCACACGATCTCTGGTGATTATGTTATCATGAC 
AATTTTrTTTGACCTGAGCAGAAGAATGGGATATTTCACTATTCAGACCTACATTCCA 
TGCATTCTGACAGTTGTTCTTTCTTGGGTGTCTTTTTGGATCAATAAAGATGCAGTGC 
CTGCAAGAACATCGTTGGGTATCACTACAGTTCTGACTATGACAACCCTGAGTACAAT 
TGCCAGGAAGTCTTTACCTAAGGTTTCTTATGTGACTGCGATGGATCrCTTTGTTTCT 
GTTTGTTTCATTTTTGTTTTTGCAGCCTTGATGGAATATGGAACCTTGCATTATTTTA 
CCAGCAACCAAAAAGGAAAGACTGCTACTAAAGACAGAAAGCTAAAAAATAAAGCCTC 
GGTAACTCCTGGTCTCCATCCTGGATCCACTCTGATTCCAATGAATAArATTrCTGTG 
CCGCAAGAAGATGATrATGGGTATCAGTGTTTGGAGGGCAAAGATTGTGCCAGCTTCT 
TCTGTTGCTTTGAAGACTGCAGAACAGGATCTTGGAGGGAAGGAAGGATACACATACG 
CATTGCCAAAATTGACTCTTATTCTAGAATATTTTTCCCAACCGCTTTTGCCCTGTTC 
AACTrGGTTTATTGGGTTGGCTATCTTTACTTATAAAATCTACTTCATAAGCAAAAAT 
CAAAA 




ORF Start: ATG at 31 


ORF Stop: TAA at 1426 




SEQ ID NO: 8 


465 aa~lMW at 53597.3kD 


NOV2c, 

CG58520-03 Protein Sequence 


MGPLKAFLFSPFLLRSQSRGVPiWLLLTLHLGNWVDKADDEDDEDLTVNKTWVLAPK 
IHEGDITQILNSLLQGYDNKLRPDIGVRPTVIETDVYVNSIGPVDPINMEYTIDIIFA 
QTWFDSRLKFNSTMKVLMLNSNMVGKIWIPDTFFRNSRKSDAHWITTPNRLLRIWHDG 
RVLYTLRLTINAECYLQLHNFPMDEHSCPLEFSSDGYPKNEIEYKWKKPSVEVADPKY 
WRLYQFAFVGLRNSTEITHTISGDYVIMTIFFDLSRRMGYFTIQTYIPCILTWLSWV 
SFWINKDAVPARTSLGITTVLTMTTLSTIARKSLPKVSYVTAMDLFVSVCFIFVFAAL 
MEYGTLHYFTSNQKGKTATKDRKLKNKASVTPGLHPGSTLIPMNNISVPQEDDYGYQC 
LEGKDCASFFCCFEDCRTGSWREGRIHIRIAKIDSYSRIFFPTAFALFNLVYWVGYLY 
L 
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Sequence comparison of the above protein sequences yields the following sequence 
relationships shown in Table 2B. 



Table 2B. Comparison of NOV2a against NOV2b through NOV2c. 


Protein Sequence 


NOV2a Residues/ 
Match Residues 


Identities/ 
Similarities for the Matched Region 


NOV2b 


24..472 
27..475 


311/458 (67%) 
352/458 (75%) 


NOV2c 


1..472 
1..465 


414/474 (87%) 
415/474(87%) 



Further analysis of the NOV2a protein yielded the following properties shown in 
Table 2C. 



Table 2C. Protein Sequence Properties NOV2a 


Psort 
analysis: 


0.6400 probability located in plasma membrane; 0.4600 probability located 
in Golgi body; 0.3700 probability located in endoplasmic reticulum 
(membrane); 0.1000 probability located in endoplasmic reticulum (lumen) 


SignalP 
analysis: 


Likely cleavage site between residues 38 and 39 



A search of the NOV2a protein against the Geneseq database, a proprietary 
database that contains sequences published in patents and patent publication, yielded 
several homologous proteins shown in Table 2D. 



Table 2D. Geneseq Results for NOV2a 


Geneseq 
Identifier 


Protein/Organism/Length 
[Patent #, Date] 


NOV2a 
Residues/ 

Match 
Residues 


Identities/ 
Similarities 
for the 
Matched 
Region 


Expect 
Value 


AAM41007 


Human polypeptide SEQ ID NO 
5938 - Homo sapiens, 489 aa. 
[ WO200 1 533 12-A 1 , 26-JUL- 
2001] 


24..472 
49..489 


334/451 
(74%) 

379/451 
(83%) 


0.0 


AAM39221 


Human polypeptide SEQ ID NO 
2366 - Homo sapiens, 467 aa. 
[WO200153312-A1, 26-JUL- 
2001] 


24..472 
27..467 


334/451 
(74%) 

379/451 
(83%) 


0.0 
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AAR83968 


GABA-A receptor gamma-3 
subunit - Homo sapiens, 467 aa. 
[W09529234-A1, 02-NOV- 
1995] 


5..467 


iUU/4 11 

(63%) 
356/472 
(74%) 


e-169 


AAW59048 


GABA-A receptor epsilon sub- 
unit related protein - Mammalia, 
506 aa. [DE19644501-A1, 30- 
APR-1998] 


62..472 
70..506 


1 93/448 
(43%) 

274/448 
(61%) 


e-102 


AAW61045 


Human GABA receptor epsilon 
subunit - Homo sapiens, 506 aa. 
[WQ9823742-A1, 04-JUN-1998] 


62..472 
70..506 


193/448 
(43%) 

274/448 
(61%) 


e-102 



o 

.« In a BLAST search of public sequence databases, the NOV2a protein was found to 

W have homology to the proteins shown in the BLASTP data in Table 2E. 



a 



Table 2E. Public BLASTP Results for NOV2a 


Protein 
Accession 
Number 


Protein/Organism/Length 


NOV2a 
Residues/ 

Match 
Residues 


Identities/ 
Similarities 
for the 
Matched 
Portion 


Expect 
Value 


P23574 


Gamma-aminobutyric-acid 
receptor gamma- 1 subunit 
precursor (GABA(A) receptor) - 
Rattus norvegicus (Rat), 465 aa. 


1..472 
1..465 


426/475 
(89%) 

440/475 
(91%) 


0.0 


Q9R0Y8 


Gamma-aminobutyric-acid 
receptor gamma- 1 subunit 
precursor (GABA(A) receptor) - 
Mus musculus (Mouse), 465 aa. 


1..472 
1..465 


420/477 
(88%) 

434/477 
(90%) 


0.0 


JH0824 


gamma-aminobutyric acid A 
receptor gamma 1 chain 
precursor - chicken, 464 aa. 


16..472 
12..464 


390/463 
(84%) 

416/463 
(89%) 


0.0 


JH0316 


gamma-aminobutyric acid A 
receptor gamma 2 chain 
alternatively spliced precursor - 
mouse, 466 aa. 


24..472 
26..466 


336/451 
(74%) 

380/451 
(83%) 


0.0 


P18508 


Gamma-aminobutyric-acid 
receptor gamma-2 subunit 
precursor (GABA(A) receptor) - 
Rattus norvegicus (Rat), 466 aa. 


24..472 
26..466 


335/451 
(74%) 

379/451 
(83%) 


0.0 
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PFam analysis predicts that the NOV2a protein contains the domains shown in the 
Table 2F. 



Table 2F. Domain Analysis of NOV2a 


Pfam Domain 


NOV2a Match 
Region 


Identities/ 
Similarities 
for the Matched 
Region 


Expect 
Value 


Neur_chan_LBD: 
domain 1 of 1 


63„273 


66/271 (24%) 
162/271 (60%) 


2.7e-56 


Cys-protease-3C: 
domain 1 of 1 


363..369 


4/7 (57%) 
6/7 (86%) 


5.2 


Neurchanmemb: 
domain 1 of 1 


280..466 


44/297 (15%) 
164/297 (55%) 


1.2e-60 



Example 3. 

The NOV3 clone was analyzed, and the nucleotide and predicted polypeptide 
sequences are shown in Table 3A. 



Table 3A. NOV3 Sequence Analysis 




SEQ ID NO: 9 


1440 bp 


NOV3a, 

CG585 18-01 DNA Sequence 


GAAGAGATGGTCCTGGCTTTCCAGTTAGTCTCCTTCACCTACATCTGGATCATATTGA 
AACCAAATGTTTGTGCTGCTTCTAACATCAAGATGACACACCAGCGGTGCTCCTCTTC 
AATGAAACAAACCTGGAAACAAGAAACTAGAATGAAGAAAGATGACAGTACCAAAGCG 
CGGCCTCAGAAATATGAGCAACTTCTCCATATAGAGGACAACGATTTCGCAArGAGAC 
CTGGATTTGGAGGTGAGTATTATCCTCTCAAAATTGGGTCTCCAGTGCCAGTAGGTAT 
AGATGTCCATGTTGAAAGCATTGACAGCATTTCAGAGACTAACATGGTAAGTTTCTTC 
ATGGGATATGACTTTACAATGACTTTTTATCTCAGGCATTACTGGAAAGACGAGAGGC 
TCTCCTTTCCTAGCACAGCAAACAAAAGCATGACATTTGATCATAGATTGACCAGAAA 
GATCTGGGTGCCTGATATCTTTTTTGTCCACTCTAAAAGATCCTTCATCCATGATACA 
ACTATGGAGAATATCATGCTGCGCGTACACCCTGATGGAAACGTCCTCCTAAGTCTCA 
GGAGGATAACGGTTTCGGCCATGTGCTTTATGGATTTCAGCAGGTTTCCTCTTGACAC 
TCAAAATTGTTCTCTTGAACTGGAAAGCGCCTACAATGAGGATGACCTAATGCTATAC 
TGGAAACACGGAAACAAGTCCTTAAATACTGAAGAACATATGTCCCTTTCTCAGTTCT 
TCATTGAAGACTTCAGTGCATCTAGTGGATTAGCTTTCTATAGCAGCACAGGTTGGTA 
CAATAGGCTTTTCATCAACTTTGTGCTAAGGAGGCATGTTTTCTTCTTTGTGCTGCAA 
ACCTATTTCCCAGCCATATTGATGGTGATGCTTTCATGGGTTTCATTTTGGATTGACC 
GAAGAGCTGTTCCTGCAAGAGTTTCCCTGGGTGGAATCACCACAGTGCTGACCATGTC 
CACAATCATCACTGCTGTGAGCGCCTCCATGCCCCAGGTGTCCTACCTCAAGGCTGTG 




CTGTGAACTACCTCACCACAGTGGAAGAGCGGAAACAATTCAAGAAGACAGGAAAGGT 
ACAGATTTCTAGGATGTACAATATTGATGCAGTTCAAGCTATGGCCTTTGATGGTTGT 
TACCATGACAGCGAGATTGACATGGACCAGACTTCCCTCTCTCTAAACTCAGAAGACT 
TCATGAGAAGAAAATCGATATGCAGCCCCAGCACCGATTCATCTCGGATAAAGAGAAG 
AAAArCCCTAGGAGGACATGTTGGTAGAATCATTCTGGAAAACAACCATGTCATTGAC 
ACCTATTCTAGGATTTTATTCCCCATTGTGTATATCrTTATTTAATTT 




ORF Start: ATG at 7 


OPvF Stop: TAA at 1435 




SEQ ID NO: 10 


476 aa |MW at 55285 .2kD 


NOV3a, 

CG585 18-01 Protein Sequence 


MVLAFQLVSFTY I WI I LKPNVCAASNIKMTHQRCSSSMKQTWKQETRWKKDDSTKARP 
QKYEQLLHIEDNDFAMRPGFGGEYYPLKIGSPVPVGIDVHVESIDSISETNMVSFFMG 
YDFTMTFYLRHYWKDERLSFPSTANKSMTFDHRLTRKIWVPDIFFVHSKRSFIHDTTM 
ENIMLRVHPDGNVLLSLRRITVSAMCFMDFSRFPLDTQNCSLELESAYNEDDLMLYWK 
HGNKSLNTEEHMSLSQFFIEDFSASSGLAFYSSTGWYNRLFINFVLRRHVFFFVLQTY 



104 



FPAILMVNLSWVSFWIDRRAVPARVSLGGITTVLTMSTIITAVSASMPQVSYLKAVDV 
YLWVSSLFVFLSVIEYAAVNYLTTVEERKQFKKTGKVQISRMYNIDAVQAMAFDGCYH 
DSEIDMDQTSLSLNSEDFMRRKSICSPSTDSSRIKRRKSLGGHVGRIILENNHVIDTY 
SRILFPIVYIFI 



Further analysis of the NOV3a protein yielded the following properties shown in 
Table 3B. 



Table 3B. Protein Sequence Properties NOV3a 


PSort 
analysis: 


0.6850 probability located in endoplasmic reticulum (membrane); 
0.6400 probability located in plasma membrane; 0.4600 probability 
located in Golgi body; 0.2400 probability located in nucleus 


SignalP 
analysis: 


Likely cleavage site between residues 25 and 26 



A search of the NOV3a protein against the Geneseq database, a proprietary 
database that contains sequences published in patents and patent publication, yielded 
several homologous proteins shown in Table 3C. 



Table 3C. Geneseq Results for NOV3a 


Geneseq 
Identifier 


Protein/ Organism/Length 
[Patent #, Date] 


NOV3a 
Residues/ 

Match 
Residues 


Identities/ 
Similarities 
for the 
Matched 
Region 


Expect 
Value 


AAU04467 


Human gamma-amino butyric 
acid (GAB A) receptor protein #1 - 
Homo sapiens, 467 aa. 
[WO200153489-A1, 26-JUL- 
2001] 


1..474 
1..456 


454/475 
(95%) 

454/475 
(95%) 


0.0 


AAU04470 


Human gamma-amino butyric 
acid (GABA) receptor protein #4 - 
Homo sapiens, 420 aa. 
[WO200153489-A1, 26-JUL- 
2001] 


48..474 
1..409 


408/428 
(95%) 

408/428 
(95%) 


0.0 


AAU04468 


Human gamma-amino butyric 
acid (GABA) receptor protein #2 - 
Homo sapiens, 392 aa. 
[WO200153489-A1, 26-JUL- 
2001] 


1..393 
1..377 


370/394 
(93%) 

370/394 
(93%) 


0.0 


AAU04471 


Human gamma-amino butyric 
acid (GABA) receptor protein #5 - 


48..393 
1..330 


324/347 
(93%) 


e-180 
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[WO200153489-A1, 26-JUL- 
2001] 




(93%) 




AAU04469 


Human gamma-amino butyric 
acid (GAB A) receptor protein #3 - 
Homo sapiens, 1 80 aa. 
[WO200153489-A1, 26-JUL- 
2001] 


1..192 
1..177 


176/192 
(91%) 

176/192 
(91%) 


2e-96 



In a BLAST search of public sequence databases, the NOV3a protein was found to 
have homology to the proteins shown in the BLASTP data in Table 3D. 



Table 3D. Public BLASTP Results for NOV3a 


Protein 
Accession 
Number 


Protein/Organism/Length 


NOV3a 
Residues/ 

Match 
Residues 


Identities/ 
Similarities 
for the 
Matched 
Portion 


Expect 
Value 


P50573 


Gamma-aminobutyric-acid receptor 
rho-3 subunit precursor (GABA(A) 
receptor) - Rattus norvegicus (Rat), 
464 aa. 


1..474 
1..453 


383/476 
(80%) 

407/476 
(85%) 


0.0 


Q9YGQ2 


GAMMA-AMINOBUTYRIC- 
ACID RECEPTOR RHO-3 
SUBUNIT - Morone americana 
(White perch), 470 aa. 


1..474 
4..459 


293/485 
(60%) 

363/485 
(74%) 


e-153 


P50572 


Gamma-aminobutyric-acid receptor 
rho-1 subunit precursor (GAB A( A) 
receptor) - Rattus norvegicus (Rat), 
474 aa. 


49..474 
58..463 


270/427 
(63%) 

317/427 
(74%) 


e-144 


P56475 


Gamma-aminobutyric-acid receptor 
rho-1 subunit precursor (GABA(A) 
receptor) - Mus musculus (Mouse), 
474 aa. 


49..474 
58..463 


270/427 
(63%) 

317/427 
(74%) 


e-143 


P24046 


Gamma-aminobutyric-acid receptor 
rho-1 subunit precursor (GABA(A) 
receptor) - Homo sapiens (Human), 
473 aa. 


49..474 
57..462 


268/427 
(62%) 

317/427 
(73%) 


e-143 



PFam analysis predicts that the NOV3a protein contains the domains shown in the 
Table 3E. 
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Table 3E. Domain Analysis of NOV3a 


Pfam Domain 


NOV3a Match Region 


Identities/ 
Similarities 
for the Matched Region 


Expect 
Value 


Neur chan LBD: domain 1 
of 1 


88..282 


70/250 (28%) 
165/250 (66%) 


1.2e- 
54 


Neur chan memb: domain 
1 of 1 


289..475 


44/292(15%) 
141/292 (48%) 


7.6e- 
28 



The NOV4 clone was analyzed, and the nucleotide and predicted polypeptide 
sequences are shown in Table 4A. 



Table 4A. NOV4 Sequence Analysis 




SEQIDNO:ll J 1587 bp 


MOV A a 

CG585 16-01 DNA Sequence 


GAACAGAAATGAATAAAAGTCGCTGGCAGAGTAGAAGACGACATGGGAGAAGAAGCr-A 
CCAGCAGAACCCTTGGTTCAGACTCCGTGATTCTGAAGACAGGTCTGACTCCCGGGCA 
GCACAGCCCGCTCACGATTCCGGCCACGGTGATGACGAGTCTCCGTCAACCTCGTCTG 
GCACAGCTGGGACCTCCTCTGTGCCAGAGCTACCTGGGTTTTACTTTGACCCTGAAAA 
GAAACGCTACTTCCGCTTGCTCCCTGGACATAACAACTGCAACCCCCTGACGAAAGAG 
AGCATCCGGCAGAAGGAGATGGAGAGCAAGAGACTGCGGCTGCTCCAGGAAGAAGACA 
GACGGAAAAAGATTGCCAGGATGGGATTTAATGCATCTTCCATGCTACGAAAAAGCCA 
GCTGGGTTTTCTCAACGTCACCAATTACTGCCATTTAGCCCACGAGCTGCGTCTCAGC 
TGCATGGAGAGGAAAAAGGTCCAGATTCGAAGCATGGATCCCTCCGCCTTGGCAAGCG 
ACCGATTTAACCTCATACTGGCAGATACCAACAGTGACCGGCTCTTCACAGTGAACGA 
TGTTACAGTTGGAGGCTCCAAGTATGGTATCATCAACCTGCAAAGTCTGAAGACCCCT 
ACGCTCAAGGTGTTCATGCCACGAAAACCTCCGATTCTCACCAACCGGAAGGTGAACA 
CTTCGGTGTGCTGGGCCTCGCTGAATCACTTGGATTCCCACATTCTGCTATGCCTCAT 
GGGACTCGCAGAGACTCCAGGCTGTGCCACCCTGCTCCCAGCATCACTGTTCGTCAAT 
AGTCCCCACCCAGGAATAGACCGGCCTGGCATGCTCTGCAGTTTCCGGATCCCTGGGG 
GTGCCTGGTCCTGTGCCTGGTCCCTGAATATCCAAGCAAATAACTGCTTCAGTACAGG 
CTTGTCTCGGCGGGTCCTGTTGACCAACGTGGTGACGGGACACCGGCAGTCCTTTGGG 
ACCAACAGTGATGTCTTGGCCCAGCAGTTTGCTCTCATGGCTCCTCTGCTGTTTAATG 
GCTGCCGCTCTGGGGAAATCTTTGCCATTGATCTGCGTTGTGGAAATCAAGGCAAGGG 
ATGGAAGGCCACCCGCCTGTTTCATGATTCAGCAGTGACCTCTGTGCGGATCCTCCAA 
GATGAGCAATACCTGATGGCTTCAGACATGGCTGGAAAGATCAAGCTGTGGGACCTGA 
GGACCACGAAGTGCGTAAGGCAGTACGAAGGCCACGTGAATGAGTACGCCTACCTGCC 

AGAATCTGGAGCCTCCACGATGCCCGCCTACTGAGAACCATACCCTCCCCGTACCCTG 




GCGCGCCGGGGCTGCrCATGGCTGTCGGGCAGGACCTTTACTGTTACTCCTACAGCTA 
ATTCTGCAGGGCACAGCCCAGAGCCATGTGGATTTGACTTACGGGAGTAAAGCGTAAC 
TTTTTACTGCATCTAATGAGG 




ORF Start: ATG at 9 


ORF Stop: TAA at 1563 




SEQIDNO: 12 


518 aa IMW at 57769.3kD 


NOV4a, 

CG585 16-01 Protein Sequence 


MNKSRWQSRRRHGRESHQQNPWFRLRDSEDRSDSRAAQPAHDSGHGDDESPSTSSGTA 
GTSSVPELPGFYFDPEKKRYFRLLPGHNNCNPLTKESIRQKEMESKRLRLLQEEDRRK 
KIARMGFNASSMLRKSQLGFLNVTNYCHIAHELRLSCMERKKVQIRSMDPSALASDRF 
NL I LADTNSDRLFTVNDVTVGGSKYG 1 1 NLQS LKTPTLKV FMPRKPPI LTNRKVNTSV 
CWASLNHLDSHILLCLMGLAETPGCATLLPASLFVNSPHPGIDRPGMLCSFRIPGGAW 
SCAWSI^IQANNCFSTGLSRRVLLTNVVTGHRQSFGTNSDVLAQQFALMAPLLFNGCR 
SGE I FAI DLRCGNQGKGWKATRLFHDSAVTSVRI LQDEQYLMASDMAGKI KLWDLRTT 
KCVRQYEGHVHEYAYLPLHVHEEEGILVAVGQDCYTRIWSLHDARLLRTIPSPYPASK 
ADIPSVAFSSRLGGSRGRAGAAHGCRAGPLLLLiiQLILQGTAQSHVDLTYGSKA 



Further analysis of the NOV4a protein yielded the following properties shown in 
Table 4B. 

107 



Table 4B. Protein Sequence Properties NOV4a 


Psort 
analysis: 


0.9600 probability located in nucleus; 0.4776 probability located in 
mitochondrial matrix space; 0.3000 probability located in microbody 
(peroxisome); 0.1837 probability located in mitochondrial inner membrane 


SignalP 
analysis: 


No Known Signal Sequence Predicted 



A search of the NOV4a protein against the Geneseq database, a proprietary 
database that contains sequences published in patents and patent publication, yielded 
several homologous proteins shown in Table 4C. 



Table 4C. Geneseq Results for NOV 4a 


Geneseq 
Identifier 


Protein/Organism/Length 
[Patent #, Date] 


NOV4a 
Residues/ 

Match 
Residues 


Identities/ 
Similarities for 
the Matched 
Region 


Expect 
Value 


ABB 11 794 


Human secreted protein 
homologue, SEQ ID NO:2164 - 
Homo sapiens, 500 aa. 
[WO200157188-A2, 09-AUG- 
2001] 


1..484 
5..485 


470/484 (97%) 
471/484(97%) 


0.0 


AAM79804 


Human protein SEQ ID NO 3450 
- Homo sapiens, 500 aa. 
[WO200 1571 90-A2, 09-AUG- 
2001] 


1..484 
5..485 


470/484 (97%) 
471/484 (97%) 


0.0 


AAM41122 


Human polypeptide SEQ ID NO 
6053 - Homo sapiens, 500 aa. 
[WO200153312-A1, 26-JUL- 
2001] 


1..484 
5..485 


470/484 (97%) 
471/484 (97%) 


0.0 


AAG67256 


Amino acid sequence of a human 
liver-associated gene - Homo 
sapiens, 489 aa. [WO200109318- 
A1,08-FEB-2001] 


1..484 
1..474 


459/484 (94%) 
462/484 (94%) 


0.0 


AAB94587 


Human protein sequence SEQ ID 
NO: 15389 - Homo sapiens, 489 
aa. [EP1074617-A2, 07-FEB- 
2001] 


1..484 
1..474 


459/484 (94%) 
462/484 (94%) 


0.0 



In a BLAST search of public sequence databases, the NOV4a protein was found to 
have homology to the proteins shown in the BLASTP data in Table 4D. 
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Table 4D. Public BLASTP Results for NOV4a 


Protein 
Accession 
Number 


Protein/Organism/Length 


NOV4a 
Residues/ 
Match 
Residues 


Identities/ 
Similarities 
for the 
Matched 
Portion 


Expect 
Value 


A A HI SQ70 


HVDflTUUTTr AT ^ c n VY\ K 
ri I V\) l tic, 1 j J.I JtvUA 

PROTEIN - Homo sapiens 
(Human), 495 aa. 


L.484 
1..480 


4/U/4S4 (y/%) 
471/484 (97%) 


0.0 


Q96K22 


OVARC 1 00 1 79 1 - Homo sapiens 
(Human), 489 aa. 


1..484 
1..474 


459/484 (94%) 
462/484 (94%) 


0.0 


Q9Y4P5 


PROTEIN - Homo sapiens 
(Human), 430 aa (fragment). 


2..428 


4ZU/4j1 (y / /o) 

421/431 (97%) 


0.0 


Q99LF7 


HYPOTHETICAL 58.1 KDA 
PROTEIN - Mus musculus 
(Mouse), 519 aa. 


1..484 
1..481 


378/485 (77%) 
423/485 (86%) 


0.0 


Q9UFI0 


HYPOTHETICAL 26.0 KDA 
PROTEIN - Homo sapiens 
(Human), 234 aa (fragment). 


269..483 
4..217 


175/215(81%) 
193/215 (89%) 


4e-99 



PFam analysis predicts that the NOV4a protein contains the domains shown in the 
Table 4E. 



Table 4E. Domain Analysis of NOV4a 


Pfam Domain 


NOV4a Match Region 


Identities/ 
Similarities 
for the Matched Region 


Expect Value 


WD40: domain 1 
of 3 


281..316 


2/37 (5%) 
26/37 (70%) 


5.8e+02 


WD40: domain 2 
of 3 


367..402 


10/37(27%) 
27/37 (73%) 


6.1 


WD40: domain 3 
of 3 


408..446 


10/39(26%) 
23/39 (59%) 


13 



Example 5. 



The NOV5 clone was analyzed, and the nucleotide and predicted polypeptide 
sequences are shown in Table 5A. 
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Table 5A. NOV5 Sequence Analysis 




SEQ ID NO: 13 


1081 bp 


NOV5a, 

CG58473-01 DNA Sequence 


AGGATGGCCCAGAAGGAGAACAGTTATCCCTGGCCCTATGGCAAGCAGACGGCTCCAG 
CCGGCCTGAGTACCCTGCTCCCGCGAGTCCTCCCGAGGATCCCCACCGAAGCTGCGCG 
TGAGCTCCCGAGCTGCGCAGACCCACAGCCCGCAGCGGCCCCTGGCCATGAGGTGGTA 
GAGAACAGTTGTGGGAAGCGCAGCATCTTAACGCGGCCCTTCCTGGTCGACGACCTTG 
AGACTGGGCGTCCCCTGGGCAAAGACAAGTTTGTACATGTGTACTTGGCTCGAAAGAA 
GACAAGCCATTTCATCGTGGCCCTCAAGGCCTTCAAGTCTCAGATAGAGGAGGGCGTG 
GAGCACCAGATGCGCAGGCAGATGGAAATCCAGGCCCCCTTTCAGCATCCCAACATAT 
TGAGTCTCTACAACTATTTTTATGACCTGAGAAAAATCTACTGGATTCTAGAGTACGC 
CCCCGCCACCCCTACCCCCGAGGAGCTGTACCAGGAGCTGCGAAAGAGCCGCACCTTT 
GACAAGAAGCCAACAGCCACCATCACGGGGGAGGTGGCAGATGCTCTGATGTACTGCC 
ACGGGAAGAAGGTGACTCCCAGAGACATGAAGCCAGATAATCTACTCTCAGGGCTTGA 
GGGCGAGCTGAAAGTTGCCGACTTCGGCTGCCCTGTGCACGCCCCCTCACTGAGGAGG 
AAGACAAGACAAATGTGTGGCACCCTGGACTACCTGTCCCCAGAGACAATTGAGGGGC 
GCGCGCACACCGAGAAGGTGGATTTGTGGTACATCGGAGCACTCGGCTATGAGCCGCT 
GGTGGGGAACCCCACACACAATGAGGCCTATGGGCGAATCGTCAAGGTGGCCCTAAAA 
TTCCCCCTTCTGTGCCCAGGAGAGCCCCAGGACCrCATCTCCAAGCTGCTTAGGCATA 




TTCTCGGAGGGTTTTGCCTCCCTCTGCCCATCAGTCTGTCCCCTGGTGGTCCCTGACA 
rTCACTCGGGGGCGT ( I 




ORF Start: ATG at 4 


ORF Stop: TAA at 1069 




SEQ ID NO: 14 


355 aa |MW at 40012.7kD 


NOV5a, 

CG58473-01 Protein Sequence 


MAQKENSYPWPYGKQTAPAGLSTLLPRVLPRIPTEAARELPSCADPQPAAAPGHEWE 
NSCGKRSILTRPFLVDDLETGRPLGKDKFVHVYLARKKTSHFIVALKAFKSQIEEGVE 
HOMRRQMEIQAPFQHPNILSLYNYFYDLRKIYWILEYAPATPTPEELYQELRKSRTFD 
KKPTATITGEVADALMYCHGKKVTPRDMKPDNLLSGLEGELKVADFGCPVHAPSLRRK 
TRQMCGTLDYLSPETIEGRAHTEKVDLWYIGALGYEPLVGNPTHNEAYGRIVKVALKF 
PLLCPGEPQDLISKLLRHNPSERLPLAQVSAHPGILAHSRRVLPPSAHQSVPWWSLTF 
TRGRLCL 



Further analysis of the NOV5a protein yielded the following properties shown in 
Table 5B. 



Table 5B. Protein Sequence Properties NOV5a 


Psort 
analysis: 


0.4500 probability located in cytoplasm; 0.3000 probability located in 
microbody (peroxisome); 0.1897 probability located in lysosome (lumen); 
0.1000 probability located in mitochondrial matrix space 


SignalP 
analysis: 


No Known Signal Sequence Predicted 



A search of the NOV5a protein against the Geneseq database, a proprietary 
database that contains sequences published in patents and patent publication, yielded 
several homologous proteins shown in Table 5C. 



Table 5C. Geneseq Results for NOV5a 


Geneseq 
Identifier 


Protein/Organism/Length 
[Patent #, Date] 


NOV5a 
Residues/ 

Match 
Residues 


Identities/ 
Similarities for 
the Matched 
Region 


Expect 
Value 


AAG67615 


Amino acid sequence of a human 


1..341 
1..343 


247/349 (70%) 
274/349 (77%) 


e-129 
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[WO200109316-A1, 08-FEB- 
2001] 








AAG67436 


Amino acid sequence of a human 
polypeptide - Homo sapiens, 344 
aa. [WO200109345-A1, 08-FEB- 
2001] 


1..341 
1..343 


247/349 (70%) 
274/349 (77%) 


e-129 


AAY22475 


Human AUR1 protein sequence - 
Homo sapiens, 344 aa. 
[W09937788-A2, 29-JUL-1999] 


1..341 
1..343 


247/349 (70%) 
274/349 (77%) 


e-129 


AAW18083 


Human Aurora- 1 - Homo sapiens, 
344 aa. [WO9722702-A1, 26- 
JUN-1997] 


1..341 
1..343 


247/349 (70%) 
274/349 (77%) 


e-129 


AAY27052 


Human protein kinase (HPKM)-l 
(clone ID 2940) - Homo sapiens, 
347 aa. [W09938981-A2, 05- 
AUG-1999] 


1..341 
1..346 


246/352 (69%) 
274/352 (76%) 


e-127 



5 In a BLAST search of public sequence databases, the NOV5a protein was found to 

J!] have homology to the proteins shown in the BLASTP data in Table 5D. 
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Table 5D. Public BLASTP Results for NOV5a 


Protein 
Accession 
Number 


Protein/Organism/Length 


NOV5a 
Residues/ 

Match 
Residues 


Identities/ 
Similarities for 
the Matched 
Portion 


Expect 
Value 


060446 


AURORA-RELATED KINASE 2 
(SERINE/THREONINE KINASE 
12) - Homo sapiens (Human), 344 
aa. 


1..341 
1..343 


247/349 (70%) 
274/349 (77%) 


e-128 


Q96GD4 


UNKNOWN (PROTEIN FOR 
MGC:1 1031) - Homo sapiens 
(Human), 344 aa. 


1..341 
1..343 


247/349 (70%) 
274/349 (77%) 


e-128 


Q96DV5 


UNKNOWN (PROTEIN FOR 
MGC:4243) - Homo sapiens 
(Human), 345 aa. 


1..341 
1..344 


247/350 (70%) 
274/350 (77%) 


e-126 


Q9UQ46 


AIK2 - Homo sapiens (Human), 
343 aa. 


I. .341 
1..342 


245/348 (70%) 
272/348 (77%) 


e-126 


014630 


PROTEIN KINASE - Homo 
sapiens (Human), 347 aa. 


1..341 
1..346 


245/352 (69%) 
272/352 (76%) 


e-125 
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PFam analysis predicts that the NOV5a protein contains the domains shown in the 
Table 5E. 



Table 5E. Domain Analysis of NOV5a 


Pfam Domain 


NOV5a Match Region 


Identities/ 
Similarities 
for the Matched Region 


Expect 
Value 


Pkinase: domain 1 of 
1 


76.325 


81/293 (28%) 
1 84/293 (63%) 


6.5e-36 



Example 6. 

The NOV6 clone was analyzed, and the nucleotide and predicted polypeptide 
sequences are shown in Table 6A. 



Table 6A. NOV6 Sequence Analysis 




SEQ ID NO: 15 


1524 bp 


NOV6a, 

CG58470-01 DNA Sequence 


CACTACGTTTCTGGAATGAGCTTGAAGAAGCCCGACAGGTAGAACTTTATGCAGAGCT 
CCAGGCCATCGACTTTCAGGAACTGAACTTCTTTTTCCAAAAGGCCATTGAAGGATTT 

TACTGGGCAGTGCTGCAGGGAAGCTAGATCAGCTCCAGGCCTGGGAAAGCAAAGTTTT 
CCAGATTTCTGAGAACAAAGTCACAGTTGTTCTAGCTGGTGGGCAGGGGACTAGACTC 




AGATTCAAGCAGAGCATATCCTGAAGCTACAACAGTTAGCTGAAAAATATTATGGCAA 
CAAATGCATTATTCCATATTACGTCATGACCAGCGAGTTCACTCTGGGGCCCACGGCC 
GAGTTCTTCAGGGAGCACAACTTCTTCCACCTGGACCCCGCCAACGTGGTCATGTTTG 
AGCAGCGCCTGCTGCCTGCTGTGACCTTTGATGGCAAGGTTATCCTGGAGCGGAAAGA 




AAGATCCTGGAGGACATGGAGCGCCGGGGAGTGGAGTTTGTGCACGTGTACTGTGTGG 




CGCAGACTGTGGCGCCAAGGTGGTGGAAAAGGCATACCCCGAGGAGCCCGTGGGCGTG 
GTGTGCCAGGTGGACGGTGTCCCCCAGGTGGTGGAGTACAGCGAGATCAGTCCTGAGA 
CCGCACAGCTACGTGTCTCCGACGGGAGCCTGCTGTACAATGCAGGCAACATCTGCAA 
CCACTTCTTCACCCGAGGCTTCCTTAAGGCGGTCACCAGGGAGTTTGAGCCTTTGCTG 
AAGCCACACGTGGCTGTGAAGAAGGTCCCGTATGTGGATGAGGAGGGGAATCTGGTAA 
AGCCGCTAAAACCGAACGGGATAAAGATGGAGAAGTTTGTGTTTGATGTGTTCCGGTT 
TGCTAAGAACTTTGCTGCCTTGGAAGTGCTGCGGGAGGAGGAATTTTCCCCACTGAAG 
AACGCAGAGCCAGCCGACAGGGACAGTCCCCGCACCGCTCGCCAGGCCCTGCTCACCC 
AGCACTACCGGTGGGCTCTGCGGGCCGGGGCCCGCTTCCTGGATGCCCATGGGGCCCG 
GCTCCCAGAGCTGCCCAGCTTGCCCCCAAATGGAGACCCTCCGGCCATCTGTGAGATA 
TCGCCCTTGGTGTCTTACTCTGGAGAGGGTTTAGAAGTGTACCTGCAAGGCCGGGAGT 
TCCAGTCCCCGCTCATCCTGGATGAAGACCAGGCCAGGGAGCTGGTGAAAAATGGTAT 
ATGAACCTGATACCAA 




ORF Start: ATG at 7 


ORF Stop: TGA at 1510 




SEQ ID NO: 16 


501 aa |MWat56461.0kD 


NC)V6a. 

CG5 8470-01 Protein Sequence 


MNTNDLKLRLSKAEQEHPLRFWNELEEARQVELYAELQAIDFQELNFFFQKAIEGFNQ 
SSHQEKVDAGMEPVPREVLGSAAGKLDQLQAWESKVFQISENKVTWLAGGQGTRLVA 
YPKGMYDVGLPSHKTLFQIQAEHILKLQQLAEKYYGNKCI I PYYVMTSEFTLGPTAEF 
FREHNFFHLDPANVVMFEQRLLPAVTFDGKVIIjERKDKVAMAPDGNGGLYCAIjEDHKI 
LEDMERRGVEFVHVYCVDNILVRLADPVFIGFCVLQGADCGAKWEKAYPEEPVGWC 
QVDGVPQWEYSEISPETAQLRVSDGSLLYNAGNICNHFFTRGFLKAVTREFEPLLKP 
HVAVKKVPYVDEEGNLVKPLKPNGI KMEKFVFDVFRFAKNFAALEVLREEEFS PLKNA 
EPADRDSPRTARQALLTQHYRWALRAGARFLDAHGARLPELPSLPPNGDPPAICEISP 
L'. oYSGE/-LE7YLC'j?:Et v-Jt LI Li L C'V AKELVKNG I 



Further analysis of the NOV6a protein yielded the following properties shown in 
Table 6B. 
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Table 6B. Protein Sequence Properties NOV6a 


PSort 
analysis: 


0.4500 probability located in cytoplasm; 0.3490 probability located in 
microbody (peroxisome); 0.1000 probability located in mitochondrial 
matrix space; 0.1000 probability located in lysosome (lumen) 


SignalP 
analysis: 


No Known Signal Sequence Predicted 



A search of the NOV 6a protein against the Geneseq database, a proprietary 
database that contains sequences published in patents and patent publication, yielded 
several homologous proteins shown in Table 6C. 



Table 6C. Geneseq Results for NOV6a 


Geneseq 
Identifier 


Protein/Organism/Length [Patent 
#, Date] 


NOV6a 
Residues/ 

Match 
Residues 


Identities/ 
Similarities for 
the Matched 
Region 


Expect 
Value 


AAB56960 


Human prostate cancer antigen 
protein sequence SEQ ID NO: 1 538 - 
Homo sapiens, 524 aa. 
[WO200055174-A1, 21-SEP-2000] 


1..501 
3..524 


353/522 (67%) 
413/522 (78%) 


0.0 


AAG32392 


Arabidopsis thaliana protein 
fragment SEQ ID NO: 39067 - 
Arabidopsis thaliana, 502 aa. 
[EP1033405-A2, 06-SEP-2000] 


9..485 
36..497 


194/489 (39%) 
275/489 (55%) 


3e-84 


AAG40236 


Arabidopsis thaliana protein 
fragment SEQ ID NO: 49896 - 
Arabidopsis thaliana, 477 aa. 
[EP1033405-A2, 06-SEP-2000] 


9..485 
12..472 


193/488(39%) 
272/488 (55%) 


3e-82 


AAG40235 


Arabidopsis thaliana protein 
fragment SEQ ID NO: 49895 - 
Arabidopsis thaliana, 500 aa. 
[EP1033405-A2, 06-SEP-2000] 


9..485 
35..495 


193/488 (39%) 
272/488 (55%) 


3e-82 


AAG40234 


Arabidopsis thaliana protein 
fragment SEQ ID NO: 49894 - 
Arabidopsis thaliana, 505 aa. 
[EP1033405-A2, 06-SEP-2000] 


9..485 
40..500 


193/488 (39%) 
272/488 (55%) 


3e-82 



In a BLAST search of public sequence databases, the NOV6a protein was found to 
have homology to the proteins shown in the BLASTP data in Table 6D. 
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Table 6D. Public BLASTP Results for NOV6a 


Protein 
Accession 
Number 


Protein/Organism/Length 


NOV6a 
Residues/ 

Match 
Residues 


Identities/ 
Similarities 
for the 
Matched 
Portion 


Expect 
Value 


Q96GM2 


UDP-N-ACTEYLGLUCOSAMINE 
PYROPHOSPHORYLASE 1 - Homo 
sapiens (Human), 505 aa. 


1..501 
1..505 


351/505 (69%) 
412/505 (81%) 


0.0 


Ql 6222 


UDP-N-acetylhexosamine 
pyrophosphorylase (Antigen X) 
(AGX) (Sperm- associated antigen 2) 
[Includes: UDP-N- 
acetylgalactosamine 
pyrophosphorylase (EC 2.7.7.-) 
(AGX-1); UDP-N-acetylglucosamine 
pyrophosphorylase (EC 2.7.7.23) 
(AGX-2)] - Homo sapiens (Human), 
522 aa. 


1..501 
1..522 


352/522 (67%) 
412/522 (78%) 


0.0 


Q91YN5 


HYPOTHETICAL 58.6 KDA 
PROTEIN - Mus musculus (Mouse), 
522 aa. 


1..501 
1..522 


342/522 (65%) 
407/522 (77%) 


0.0 


AAH 17547 


HYPOTHETICAL 58.5 KDA 
PROTEIN - Mus musculus (Mouse), 
521 aa. 


L.501 
L.521 


341/521 (65%) 
407/521 (77%) 


0.0 


Q9Y0Z0 


BCDNA:LD24639 PROTEIN - 
Drosophila melanogaster (Fruit fly), 
520 aa. 


6..492 
44..513 


236/491 (48%) 
330/491 (67%) 


e-124 



PFam analysis predicts that the NOV6a protein contains the domains shown in the 
Table 6E. 



Table 6E. Domain Analysis of NOV6a 


Pfam Domain 


NOV6a Match Region 


Identities/ 
Similarities 
for the Matched Region 


Expect Value 


UDPGP: domain 1 
of 1 


40..434 


108/428 (25%) 
324/428 (76%) 


8.4e-lll 



Example 7. 



The NOV7 clone was analyzed, and the nucleotide and predicted polypeptide 
sequences are shown in Table 7A. 
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Table 7A. NOV7 Sequence Analysis 




SEQ ID NO: 17 |46 1 hp 


NOV 7a. 

CG585 93-01 DNA Sequence 


ACGCAGAGATGCAGATCTTTGTGAAGACCCTCACGGGCAAGACCATCACCCTTGAGGT 
CAAGCCCACCGACACCATTGAGAArGTCAAAACCAAAATTCAGGACAAGGAGGGTATC 

TCTCAGGCTACAACATCCAGAAAGAGTCCACCCTAAACCTGGTGCTGCGCCTGCGAGG 
TGGCATTACTGAGCCTTCCCTCCGCCAGCTCGTCCAGAAATACAACTGCGACGAGATG 
ATCTGCTGCAAGTGCTATGCTTGCCTGCACCCCGGTGCTATCAACTGCCACAAGAAGA 
AATGCGGCCACACCAACAACCTGTACCCCAGGAAGAAGGTCAAATAAGGCTCTTCCTT 
CCTTGAAGGGCAGCAGCCTTCTGCCCAGGCCCCATGGCCCTGGGGCCTCAATAAA 




ORF Start: ATG at 9 


ORF Stop: TAA at 393 




SEQ ID NO: 18 


128 aa |MW at 14540.9kD 


NOV7a, 

CG58593-01 Protein Sequence 


MQIFVKTLTGKTITLEVKPTDTIENVKTKIQDKEGIPPDQQRLIFAGKRLEDGHTLSG 
YNI QKESTIJSILVLRLRGGITEPSLRQLVQKYNCDEMI CCKCYACLHPGAINCHKKKCG 
HTNNLYPRKKVK 



Further analysis of the NOV7a protein yielded the following properties shown in 
Table 7B. 



Table 7B. Protein Sequence Properties NOV7a 


PSort 
analysis: 


0.9800 probability located in nucleus; 0.1000 probability located in 
mitochondrial matrix space; 0.1000 probability located in lysosome 
(lumen); 0.0000 probability located in endoplasmic reticulum (membrane) 


SignalP 
analysis: 


No Known Signal Sequence Predicted 



A search of the NOV7a protein against the Geneseq database, a proprietary 
database that contains sequences published in patents and patent publication, yielded 
several homologous proteins shown in Table 7C. 



Table 7C. Geneseq Results for NOV7a 


Geneseq 
Identifier 


Protein/Organism/Length [Patent 
#, Date] 


NOV7a 
Residues/ 

Match 
Residues 


Identities/ 
Similarities 
for the 
Matched 
Region 


Expect 
Value 


AAB52080 


Gene 16 human secreted protein 
homologous amino acid sequence 
#129 - Sus scrofa, 128 aa. 
[WO200061 596-A1 , 1 9-OCT-2000] 


1..128 
1-128 


111/128 (86%) 
118/128 (91%) 


7e-61 


AAG43861 


Arabidopsis thaliana protein fragment 
SEQ ID NO: 54871 - Arabidopsis 
thaliana, 128 aa. [EP1033405-A2, 06- 
SEP-2000] 


1..128 
1..128 


101/128 (78%) 
113/128 (87%) 


9e-55 
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AAvjJOl oo 


Arabidopsis thaliana protein fragment 
SEQ ID NO: 44314 - Arabidopsis 
thaliana, 249 aa. [EP1033405-A2, 06- 
SEP-2000] 


122..249 


1 fil /1 08 f78°/A 

113/128(87%) 




AAG36187 


Arsbidopsis th3li£tn<i protein fragment 
SEQ ID NO: 44313 - Arabidopsis 
thaliana, 264 aa. [EP1033405-A2, 06- 
SEP-2000] 


1..128 
1 37..264 


101/128 (78%) 
113/128 (87%) 


9e-55 


AAG36186 


Arabidopsis thaliana protein fragment 
SEQ ID NO: 44312 - Arabidopsis 
thaliana, 322 aa. [EP1033405-A2, 06- 
SEP-2000] 


1..128 
195..322 


101/128(78%) 
113/128 (87%) 


9e-55 



In a BLAST search of public sequence databases, the NOV7a protein was found to 
have homology to the proteins shown in the BLASTP data in Table 7D. 



Table 7D. Public BLASTP Results for NOV7a 


Protein 

Accession 
Number 


Protein/Organism/Length 


NOV7a 
Residues/ 

Match 
Residues 


Identities/ 
Similarities for 
the Matched 
Portion 


Expect 
Value 


Q9BX98 


UBIQUITIN A-52 RESIDUE 
RIBOSOMAL PROTEIN FUSION 
PRODUCT 1 - Homo sapiens 
(Human), 141 aa (fragment). 


1..128 
14..141 


111/128 (86%) 
118/128 (91%) 


3e-60 


Q9UPK7 


UBIQUITIN-52 AMINO ACID 
FUSION PROTEIN - Homo sapiens 
(Human), 128 aa. 


1-128 
1..128 


111/128 (86%) 
118/128 (91%) 


3e-60 


Q9PT09 


UBIQUITIN - Oncorhynchus 
mykiss (Rainbow trout) (Salmo 
gairdneri), 128 aa. 


1..128 
1..128 


110/128 (85%) 
118/128 (91%) 


6e-60 


042388 


UBIQUITIN-RIBOSOMAL 
PROTEIN FUSION PROTEIN - 
Gallus gallus (Chicken), 128 aa. 


1..128 
1..128 


110/128 (85%) 
117/128 (90%) 


7e-60 


Q9XSV1 


UBIQUITIN-RIBOSOMAL 
PROTEIN L40 FUSION PROTEIN 
- Canis familiaris (Dog), 128 aa. 


1..128 
1..128 


110/128 (85%) 
117/128 (90%) 


le-59 



PFam analysis predicts that the NOV7a protein contains the domains shown in the 
Table 7E. 
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Table 7E. Domain Analysis of NOV7a 


Pfam Domain 


NOV7a Match 
Region 


Identities/ 
Similarities 
for the Matched Region 


Expect 
Value 


ubiquitin: domain 1 of 1 


I..74 


54/83 (65%) 
72/83 (87%) 


1.9e-38 


Ribosomal_L40e: 
domain 1 of 1 


77..128 


30/52 (58%) 
42/52 (81%) 


7.3e-20 



Example 8. 

The NOV8 clone was analyzed, and the nucleotide and predicted polypeptide 
sequences are shown in Table 8A. 



Table 8A. NOV8 Sequence Analysis 




SEQIDNO:19 |2296bp 


NOV8a, 

CG57871-01 DNA Sequence 


CGGCGGCGGCGGCAGTAGAAATGATGGAAGAATTGCATAGCCTGGACCCACGACGGCA 
GAAATTATTGGAGGCCAGGTTTACTGGAGTAGGTGTTAGTAAGGGACCACTTAATAGT 
GAGTCTTCCAACCAGAGCTTGTGCAGCGTCGGATCCTTGAGTGATAAAGAAGTAGAGA 
CTCCCAAGAAAAAGCAGAATGACCAGCGAAATCGGAAAAGAAAAGCTGAACCATATGA 
AAGTAGCCAAGGGAAAGGCACTCCTAGGGGACATAAAATTAGTGATTACTTTGAGTTT 
GCTGGGGGAAGCGGGCCGGGAACCAGCCCTGGCAGAAGTGTTCCACCAGTTGCACGAT 
CCTCACTGCAACATTCTTTATCCAATCCCTTACCGCGACGAGTAGAACAGCCCCTCTA 
TGGTTTAGATGGCAGTGCTGCAAAGGAGGCAACGGAGGAGCAGTCTGCTCTGCCAACC 
CTCATGTCAGTGATGCTAGCAAAACCTCGGCTTGACACAGAGCAGCTGGCGCAAAGGG 
GAGCTGGCCTCTGCTTCACTTTTGTTTCAGCTCAGCAAAACAGTCCCTCATCTACGGG 
ATCTGGCAACACAGAGCATTCCTGCAGCTCCCAAAAACAGATCTCCATCCAGCACAGA 
CAGACCCAGTCCGACCTCACAATAGAAAAAATATCrGCACTAGAAAACAGTAAGAATT 
CrGACTTAGAGAAGAAGGAGGGAAGAATAGATGATrTATTAAGAGCCATCTGTGATTT 
GAGACGGCAGATTGATGAACAGCAAAAGATGCTAGAGAAATACAAGGAACGATTAAAT 
AGATGTGTGACAATGAGCAAGAAACTCCTTATAGAAAAGTCAAAACAAGAGAAGATGG 
CGTGTAGAGATAAGAGCATGCAAGACCGCTTGAGACTGGGCCACTTTACTACGTCTGA 
CCACGGAGCCAAATTTACTGAGCAGTGGACAGATGGTTATGCTTTTCAGAATCTTATC 
AAGCAACAGGAAAGGATAAATTCACAGAGGGAAGAGATAGAAAGACAACGGAAAATGT 
TAGCAAAGCGGAAACCTCCTGCCATGGGTCAGGCCCCTCCTGCAACCAATGAGCAGAA 
ACAGTGGAAAAGCAAGACCAATGGAGCTGAAAATGAAACGTTAACGTTAAAAGAATAC 
CATGAACAAGAAGAAATCTTCAAACTCAGATTAGGTCATCTTAAAAAGGAGGAAGCAG 
AGATCCAGGCAGAGCTGGAGAGGCTAGAAAGGGTTAGAAAACTACATATCAGGGAAGT 
AAAAAGGATACATAATGAAGATAATTCACAATTTAAATATCATCCAACGCTAAATGAC 




TTGATCTAACAGAGCAAAGATACGTAGCTGTGAAAATTCACCAGTTAAATAAAAACTG 
GAGAGATGAGAAAAAGGAGAATTACCACAAGCATGCATGTAGGGAATACCGGATTCAT 
AAAGAGCTGGACCATCCCAGAATAGTTAAGCTGTATGATTACTTTTCACTGGATACTG 
ACTCGTTTTGTACAGTATTAGAATACTGTGAGGGAAATGATCTGGACTTCTACCTGAA 
ACAGCACAAATTAATGTCAGAGAAAGAGGCCCGGTCCATTATCATGCAGATTGTGAAT 
GCTTTAAAGTACTTAAATGAAATAAAACCTCCCATCATACACTATGACCTCAAACCAG 
GTAATATTCTTTTAGAAAATGGTACAGCGTGTGGAGAGATAAAAATTACAGATTTTGG 
TCTTTCGAAGATCArGGATGATGATAGCTACAATTCAGTGGATGGCATGGAGCTAACA 




GTGTCTTTATGGAAGGAAGCCTTTTGGCCATAACCAGTCTCAGCAAGACATCCTACAA 
GAGAATACGATTCTTAAAGCTACTGAAGTGCAGTTCCCGCCAAAGCCGGTAGTAACAC 
CTGAAGCAAAGGCGTTGATTCGACGATGCTTGGCCTACCGAAAGGAGGACCGCATTGA 
TGTCCAGCAGCTGGCCTGTGATCCCTACTTGTTGCCTCACATCCGAAAGTCAGTCTCT 
ACGAGTAGCCCTGCTGGAGCTGCTATTGCATCAACCTCTGGGGCGTCCAATAACAGTT 
CTTCTAATTGAGACTGACTCCAAGGCCACAAACT 




ORF Start: ATG at 24 


ORF Stop: TGA at 2271 




SEQ ID NO: 20 


749 aa]MWat85415.8kD 


NOV8a, 

CG57871-01 Protein Sequence 


MEELHSLDPRRQKLLEARFTGVGVSKGPLNSESSNQSLCSVGSLSDKEVETPKKKQND 
QRNRKRKAEPYESSQGKGTPRGHKISDYFEFAGGSGPGTSPGRSVPPVARSSLQHSLS 
NPLPRRVEQPLYGLDGSAAKEATEEQSALPTLMSVMLAKPRLDTEQLAQRGAGLCFTF 
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VSAQQNSPSSTGSGKTEHSCSSQKQISIQHRQTQSDLTIEKISALENSKNSDLEKKEG 
RIDDLLRAICDLRRQIDEQQKMLEKYKERLNRCVTMSKKLLIEKSKQEFCMACRDKSMQ 
DRLRLGHFTTSDHGAKFTEQWTDGYAFQNLI KQQER I NSQREEI ERQRKMLAKRKPPA 
MGQAPPATNEQKQWKSKTKGAENETLTLKEYHEQEE I FKLRLGHLKKEEAEI QAELER 
LERVRKLHIREVKRIHNEDNSQFKYHPTLNDRYLLLHLLGRGGFSEVYKAFDLTEQRY 
VAVKIHQLNKKWRDEKKENYHKHACREYRIHKELDHPRIVKLYDYFSLDTDSFCTVLE 
YCEGNDLDFYLKQHKLMSEKEARSI IMQIVNALKYLNEIKPPI IHYDLKPGNILLENG 
TACGEIKITDFGLSKIMDDDSYNSVDGMELTSQGAGTYWYLPPECFWGKEPPKISNK 
VDVWSVGVIFYQCLYGRKPFGHNQSQQDI LQENT I LKATEVQFP PKP WTPEAKALI R 
RCIAYRKEDRIDVQQLACDPYLLPHIRKSVSTSSPAGAAIASTSGASNNSSSN 



Further analysis of the NOV8a protein yielded the following properties shown in 
Table 8B. 



Table 8B. Protein Sequence Properties NOV8a 


PSort 
analysis: 


0.9600 probability located in nucleus; 0.1000 probability located in 
mitochondrial matrix space; 0.1000 probability located in lysosome (lumen); 
0.0000 probability located in endoplasmic reticulum (membrane) 


SignalP 
analysis: 


No Known Signal Sequence Predicted 



A search of the NOV8a protein against the Geneseq database, a proprietary 
database that contains sequences published in patents and patent publication, yielded 
several homologous proteins shown in Table 8C. 



* Table 8C. Geneseq Results for NO V8a 


Geneseq 
Identifier 


Protein/Organism/Length 
[Patent #, Date] 


NOV8a 
Residues/ 

Match 
Residues 


Identities/ 
Similarities for 
the Matched 
Region 


Expect 
Value 


AAM39278 


Human polypeptide SEQ ID NO 
2423 - Homo sapiens, 718 aa. 
[WO200153312-A1, 26-JUL- 
2001] 


1..749 
2..718 


703/749 (93%) 
707/749 (93%) 


0.0 


AAM41064 


Human polypeptide SEQ ID NO 
5995 - Homo sapiens, 809 aa. 
[WO200153312-A1, 26-JUL- 
2001] 


1..749 
92..809 


695/750 (92%) 
701/750 (92%) 


0.0 


AAR76062 


Protein kinase PKU beta - Homo 
sapiens, 540 aa. [JP07132093-A, 
23-MAY-1995] 


210..749 
1..540 


525/540 (97%) 
527/540 (97%) 


0.0 


AAR76061 


Protein kinase PKU alpha - Homo 
sapiens, 787 aa. [JP07132093-A, 
23-MAY-1995] 


1 ..744 
49..783 


537/794 (67%) 
592/794 (73%) 


0.0 


ABB20910 


Protein #2909 encoded by probe 


346.J49 
1..404 


404/404 (100%) 
404/404(100%) 


0.0 
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expression - Homo sapiens, 404 
aa. [WO200157274-A2, 09-AUG- 
2001] 









In a BLAST search of public sequence databases, the NOV8a protein was found to 
have homology to the proteins shown in the BLASTP data in Table 8D. 



Table 8D. Public BLASTP Results for NOV8a 


Protein 
Accession 
Number 


Protein/Organism/Length 


NOV8a 
Residues/ 

Match 
Residues 


Identities/ 
Similarities for 
the Matched 
Portion 


Expect 
Value 


Q9UKI7 


TOUSLED-LIKE KINASE 2 - 
Homo sapiens (Human), 749 aa. 


1..749 
1..749 


731/749 (97%) 
736/749 (97%) 


0.0 


055047 


TOUSLED-LIKE KINASE - Mus 
musculus (Mouse), 717 aa. 


1..749 
1..717 


699/749 (93%) 
705/749 (93%) 


0.0 


Q9Y4F7 


PKU-ALPHA - Homo sapiens 
(Human), 719 aa (fragment). 


1..749 
3..719 


700/749 (93%) 
705/749 (93%) 


0.0 


Q9D5Y5 


TOUSLED-LIKE KINASE 2 
(ARABIDOPSIS) - Mus musculus 
(Mouse), 696 aa. 


1..656 
1..656 


629/656 (95%) 
640/656 (96%) 


0.0 


Q90ZY7 


PKU-ALPHA PROTEIN KINASE 
- Brachydanio rerio (Zebrafish) 
(Zebra danio), 697 aa. 


1..749 
2..697 


580/753 (77%) 
626/753 (83%) 


0.0 



PFam analysis predicts that the NOV8a protein contains the domains shown in the 
Table 8E. 



Table 8E. Domain Analysis of NOV8a 


Pfam Domain 


NOV8a Match Region 


Identities/ 
Similarities 
for the Matched Region 


Expect Value 


A2M: domain 1 of 1 


501..523 


10/23 (43%) 
20/23 (87%) 


4.6 


Pkinase: domain 1 of 
1 


439..718 


96/316(30%) 
213/316(67%) 


5.4e-70 
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Example 9. 

The NOV9 clone was analyzed, and the nucleotide and predicted polypeptide 
sequences are shown in Table 9A. 



Table 9A. NOV9 Sequence Analysis 




SEQIDNO:21 


2060 bp 


NOV9a, 

CG58590-01 DNA Sequence 


GTTTTCATAGATAACCATGACAACATCCCATATGAATGGGCATGTTACAGAGGAATCA 


GACAGCGAAGTAAAAAATGTTGATCTTGCATCACCAGAGGAACATCAGAAGCACCGAG 
AGATGGCTGTTGACTGCCCTGGAGATTTGGGCACCAGGATGATGCCAATACGTCGAAG 
TGCACAGTTGGAGCGTATTCGGCAACAACAGGAGGACATGAGGCGTAGGAGAGAGGAA 
GAAGGGAAAAAGCAAGAACTTGACCTTAATTCTTCCATGAGACTTAAGAAACTAGCCC 
AAATTCCTCCAAAGACCGGAATAGATAACCCTATGTTTGATACAGAGGAAGGAATTGT 
CTTAGAAAGTCCTCATTATGCTGTGAAAATATTAGAAATAGAAGACTTGTTTTCTTCA 
CTTAAACATATCCAACATACTTTGGTAGATTCTCAGAGCCAGGAGGATATTTCACTGC 
TTTTACAACTTGTTCAAAATAAGGATTTCCAGAATGCATTTAAGATACACAATGCCAT 
CACAGTACACATGAACAAGGCCAGTCCTCCATTTCCTCTTATCTCCAACGCACAAGAT 
CTTGCTCAAGAGGTACAAACTGTTTTGAAGCCAGTTCATCATAAGGAAGGACAAGAAC 
TAACTGCTTTGCTGAATACTCCACATATTCAGGCACTTTTACTGGCCCACGATAAGGT 
TGCTGAGCAGGAAATGCAGCTAGAGCCCATTACAGATGAGAGAGTrTATGAAAGTATT 
GGCCAGTATGGAGGAGAAACTGTAAAAATAGTTCGTATAGAAAAGGCTCGTGATATTC 
CGTTGGGTGCTACAGTTCGTAATGAAATGGACTCTGTCATCATTAGCCGGATAGTAAA 
AGGGGGTGCTGCAGAGAAAAGTGGTCTGTTGCATGAAGGAGATGAAGTrCTAGAGATr 

TGCATGGTACTTTGACTTTTGTCCTGATTCCCAGTCAACAGATCAAGCCGCCTCCTGC 
CAAGGAAACAGTAATCCATGTAAAAGCTCATTTTGACTATGACCCCTCAGATGACCCT 
TATGTTCCATGTCGAGAGTTAGGTCTGTCTTTTCAAAAAGGTGATATACTTCATGTGA 
TCAGTCAAGAAGATCCAAACTGGTGGCAGGCCTACAGGGAAGGGGACGAAGATAATCA 
ACCTCTAGCCGGGCTTGTTCCAGGGAAAAGCTTTCAGCAGCAAAGGGAAGCCATGAAA 
CAAACCATAGAAGAAGATAAGGAGCCAGAAAAATCAGGTAAACTGTGGTGTGCAAAGA 
AGAATAAAAAGAAGAGGAAAAAGGTTTTATATAATGCCAATAAAAATGATGATTATGA 
CAACGAGGAGATCTTAACCTATGAGGAAATGTCACTTTATCATCAGCCAGCAAATAGG 
AAGAGACCTATCATCTTGATTGGTCCACAGAACTGTGGCCAGAATGAATTGCGTCAGA 
GGCTCATGAACAAAGAAAAGGACCGCTTTGCATCTGCAGTTCCTCGTACAACCCGGAG 
TAGGCGAGACCAAGAAGTAGCCGGTAGAGATTACCACTTTGTTTCGCGGCAAGCATTC 
GAGGCAGACATAGCAGCTGGAAAGTTCATTGAGCATGGTGAATTTGAGAAGAATTTGT 
ATGGAACTAGCATAGATTCTGTACGGCAAGTGATCAACTCTGGCAAAATATGTCTTTT 
AAGTCTTCGTACACAGTCATTGAAGACTCTCCGGAArTCAGATTTGAAACCATATATT 




AGAATCCAAAGCCTGAAGAGTTGAGAGAAATCATTGAGAAGACAAGAGAGATGGAGCA 
GAACAATGGCCACTACTTTGATACGGCAATTGTGAATTCCGATCTTGATAAAGCCTAT 
CAGGAATTGCTTAGGTTAATTAACAAACTTGATACTGAACCTCAGTGGGTACCATCCA 
CTTGGCTGAGGTGAAAGAAACATCCATTCT 




ORFStart:ATGatl7 


ORF Stop: TGA at 2042 




SEQ ID NO: 22 


675 aa |MW at 7731 1.8kD 


NOV9a, 

CG58590-01 Protein Sequence 


MTTSHMNGHVTEESDSEVKNVDLASPEEHQKHREMAVDCPGDLGTRMMPIRRSAQLER 
IRQQQEDMRRRREEEGKKQELDI.NSSMRLKKLAQIPPKTGIDNPMFDTEEGIVLESPH 
YAVKILE I EDLFSSLKH IQHTLVDSQSQEDI SLLLQLVQNKDFQNAFKIHNAITVHMN 
KASPPFPLISNAQDLAQEVQTVLKPVHHKEGQELTALLNTPHIQALLLAHDKVAEQEM 
QLEPITDERVYES IGQYGGETVKIVRI EKARDI PLGATVRNEMDSVI ISRIVKGGAAE 
KSGLLHEGDEVLEINGIEIRGKDVNEVFDLLSDMHGTLTFVtlPSQQIKPPPAKETVI 
HVKAHFDYDPSDDPYVPCRELGLSFQKGDILHVISQEDPNWWQAYREGDEDNQPLAGL 
VPGKSFQQQREAMKQTIEEDKEPEKSGKLWCAKKNKKKRKKVLYNANKNDDYDNEEIL 
TYEEMSLYHQPANRKRPI I LIGPQNCGQNELRQRLMNKEKDRFASAVPRTTRSRRDQE 
VAGRDYHFVSRQAFEADIAAGKFIEHGEFEKNLYGTSIDSVRQVINSGKICLLSLRTQ 
SLKTLRNSDLKPYIIFIAPPSQERLRALLAKEGKNPKPEELREIIEKTREMEQNNGHY 
FDTAIVNSDI*DKAYQELLRLINKLDTEPQWVPSTWLR 




SEQ ID NO: 23 


2030 bp 


NOV9b, 

CG58590-02 DNA Sequence 


CCATGACAACATCCCATATGAATGGGCATGTTACAGAGGAATCAGACAGCGAAGTAAA 
AAATGTTGATCTTGCATCACCAGAGGAACATCAGAAGCACCGAGAGATGGCTGTTGAC 
TGCCCTGGAGATTTGGGCACCAGGATGATGCCAATACGTCGAAGTGCACAGTTGGAGC 
GTATTCGGCAACAACAGGAGGACATGAGGCGTAGGAGAGAGGAAGAAGGGAAAAAGCA 
AGAACTTGACCTTAATTCTTCCATGAGACTTAAGAAACTAGCCCAAATTCCTCCAAAG 
ACCGGAATAGATAACCCTATGTTTGATACAGAGGAAGGAATTGTCTTAGAAAGTCCTC 
ATTATGCTGTGAAAATATTAGAAATAGAAGACTTGTTTTCTTCACTTAAACATATCCA 
ACATACTTTGGTAGATTCTCAGAGCCAGGAGGATATTTCACTGCTTTTACAACTTGTT 
CAAAATAAGGATTTCCAGAATGCATTTAAGATACACAATGCCATCACAGTACATATGA 
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ACAAGGCCAGTCCTCCATTTCCTCTTATCTCCAACGCACAAGATCTTGCTCAAGAGGT 
ACAAACTGTTTTGAAGCCAGTTCATCATAAGGAAGGACAAGAACTAACTGCTTTGCTG 
AATACTCCACATATTCAGGCACTTTTACTGGCCCACGATAAGGTTGCTGAGCAGGAAA 
TGCAGCTAGAGCCCATTACAGATGAGAGAGTTTATGAAAGTATTGGCCAGTATGGAGG 
AGAAACTGTAAAAATAGTTCGTATAGAAAAGGCTCGTGATATTCCGTTGGGTGCTACA 

AGAAAAGTGGTCTGTTGCATGAAGGAGATGAAGTTCTAGAGATTAATGGCATTGAAAT 
TCGGGGGAAAGATGTCAATGAGGTTTTTGACCTGTTGTCTGATATGCATGGTACTTTG 
ACTTTTGTCCTGATTCCCAGTCAACAGATCAAGCCGCCTCCTGCCAAGGAAACAGTAA 
TCCATGTAAAAGCTCATTTTGACTATGACCCCTCAGATGACCCTTATGTTCCATGTCG 
AGAGTTAGGTCTGTCTTTTCAAAAAGGTGATATACTTCATGTGATCAGTCAAGAAGAT 
CCAAACTGGTGGCAGGCCTACAGGGAAGGGGACGAAGATAATCAACCTCTAGCCGGGC 
TTGTTCCAGGGAAAAGCTTTCAGCAGCAAAGGGAAGCCATGAAACAAACCATAGAAGA 
AGATAAGGAGCCAGAAAAATCAGGAAAACTGTGGTGTGCAAAGAAGAATAAAAAGAAG 
AGGAAAAAGGTTTTATATAATGCCAATAAAAATGATGATTATGACAACGAGGAGATCT 
TAACCTATGAGGAAATGTCACTTTATCATCAGCCAGCAAATAGGAAGAGACCTATCAT 
CTTGATTGGTCCACAGAACTGTGGCCAGAATGAATTGCGTCAGAGGCTCATGAACAAA 
GAAAAGGACCGCTTTGCATCTGCAGTTCCTCATACAACCCGGAGTAGGCGAGACCAAG 
AAGTAGCCGGTAGAGATTACCACTTTGTTTCGCGGCAAGCATTCGAGGCAGACATAGC 
AGCTGGAAAGTTCATTGAGCATGGTGAATTTGAGAAGAATTTGTATGGAACTAGCATA 
GATTCTGTACGGCAAGrGATCAACTCTGGCAAAATATGTCTTTTAAGTCTTCGTACAC 
AGTCATTGAAGACTCTCCGGAATTCAGATTTGAAACCATATATTATCTTCATTGCACC 
CCCTTCACAAGAAAGACTTCGGGCATTATTGGCCAAAGAAGGCAAGAATCCAAAGCCT 
GAAGAGTTGAGAGAAATCATTGAGAAGACAAGAGAGATGGAGCAGAACAATGGCCACT 
ACTTTGATACGGCAATTGTGAATTCCGATCTTGATAAAGCCTATCAGGAATTGCTTAG 
GTTAATTAACAAACTTGATACTGAACCTCAGTGGGT.H-' ATC( AC rTG :TGAGGTGA 




ORJF Start: ATG at 3 


ORF Stop: TGA at 2028 




SEQ ID NO: 24 


675 aa |MW at 77292.8kD 


NOV9b, 

CG5 8590-02 Protein Sequence 


MTTSHMNGHVTEESDSEVKNVDLASPEEHQKHREMAVDCPGDLGTRMMPIRRSAQLER 
IRQQQEDMRRRREEEGKKQELDLNSSMRLKKLAQIPPKTGIDNPMFDTEEGIVLESPH 
YAVKILEIEDLFSSLKHIQHTLVDSQSQEDISLLLQLVQNKDFQNAFKIHNAITVHMN 
KASPPFPLISNAQDIAQEVQTVLKPVHHKEGQELTALLIITPHIQALLLAHDKVAEQEM 
QLEP ITDERV YESI GQYGGETVK IVRIEKARDI PLGATVRNEMDSV 1 1 SR I VKGGAAE 
KSGLLHEGDE VLE INGIEI RGKDVNE VFDLLSDMHGTLTFVLI PSQQI KP PPAKETVI 
HVKAHFDYDPSDDPYVPCRELGLSFQKGDILHVISQEDPNWWQAYREGDEDNQPLAGL 
VPGKSFQQQREAMKQTIEEDKEPEKSGKLWCAKKNKKKRKKVLYNANKNDDYDNEEIL 
TYEEMSLYHQPANRKRPI I LIGPQNCGQNELRQRLMNKEKDRFASAVPHTTRSRRDQE 
VAGRDYHFVSRQAFEADIAAGKFIEHGEFEKNLYGTSIDSVRQVINSGKICLLSLRTQ 
SLKTLRNSDLKPYI IFIAPPSQERLRALLAKEGKNPKPEELREI IEKTREMEQNNGHY 
FDTAIVNSDLDKAYQELLRLINKLDTEPQWVPSTWLR 



Sequence comparison of the above protein sequences yields the following sequence 
relationships shown in Table 9B. 



Table 9B. Comparison of NOV9a against NOV9b. 


Protein 
Sequence 


NOV9a Residues/ 
Match Residues 


Identities/ 
Similarities for the Matched Region 


NOV9b 


1..675 
1..675 


636/675 (94%) 
636/675 (94%) 



Further analysis of the NOV9a protein yielded the following properties shown in 
Table 9C. 



Table 9C. Protein Sequence Properties NOV9a 


PSort 
analysis: 


0.7000 probability located in nucleus; 0.1000 probability located in 
mitochondrial matrix space; 0.1000 probability located in lysosome (lumen); 
0.0000 probability located in endoplasmic reticulum (membrane) 
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SignalP No Known Signal Sequence Predicted 
analysis: 



A search of the NOV9a protein against the Geneseq database, a proprietary 
database that contains sequences published in patents and patent publication, yielded 
several homologous proteins shown in Table 9D. 



Table 9D. Geneseq Results for NOV9a 


Geneseq 
Identifier 


Protein/Organism/Length [Patent 
#, Date] 


NOV9a 
Residues/ 

Match 
Residues 


Identities/ 
Similarities for 
the Matched 
Region 


Expect 
Value 


AAB94180 


Human protein sequence SEQ ID 
NO: 14494 - Homo sapiens, 503 aa. 
[EP1074617-A2, 07-FEB-2001] 


1 73..675 
1..503 


501/503 (99%) 
501/503 (99%) 


0.0 


AAB41921 


Human ORFX ORF1685 
polypeptide sequence SEQ ID 
NO:3370 - Homo sapiens, 269 aa. 
[WO200058473-A2, 05-OCT-2000] 


406..675 
1..269 


261/270 (96%) 
264/270 (97%) 


e-147 


AAU07123 


Human novel human protein, NHP 
#23 - Homo sapiens, 576 aa. 
[WO200161016-A2, 23-AUG-2001] 


143..674 
31. .574 


224/564 (39%) 
339/564 (59%) 


e-109 


AAU07119 


Human novel human protein, NHP 
#19 - Homo sapiens, 560 aa. 
[WO200161016-A2, 23-AUG-2001] 


143..654 
31..554 


213/544 (39%) 
327/544 (59%) 


e-102 


AAU07115 


Human novel human protein, NHP 
#15 - Homo sapiens, 520 aa. 
[WO200161016-A2, 23-AUG-2001] 


143. .606 
31. .495 


196/481 (40%) 
300/481 (61%) 


5e-97 



In a BLAST search of public sequence databases, the NOV9a protein was found to 
have homology to the proteins shown in the BLASTP data in Table 9E. 



Table 9E. Public BLASTP Results for NOV9a 


Protein 
Accession 
Number 


Protein/Organism/Length 


NOV9a 
Residues/ 

Match 
Residues 


Identities/ 
Similarities 
for the 
Matched 
Portion 


Expect 
Value 


Q9JLB2 


PALS1 - Mus musculus (Mouse), 
675 aa. 


1..675 
1..675 


652/675 (96%) 
665/675 (97%) 


0.0 
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Q9H9Q0 


CDNA FL J 1 26 1 5 FI S CLONE 
NT2RM4001629, WEAKLY 
SIMILAR TO MAGUK P55 
SUBFAMILY MEMBER 3 - Homo 
sapiens (Human), 503 aa. 


173 675 
1..503 


501/503 (99%) 
501/503 (99%) 


0.0 


AAL40935 


STARDUST PROTEIN MAGUK 1 
ISOFORM - Drosophila 
melanogaster (Fruit fly), 1289 aa. 


252..674 
829..1282 


252/460 (54%) 
327/460 (70%) 


e-140 


Q9W3H6 


CGI 617 PROTEIN - Drosophila 
melanogaster (Fruit fly), 794 aa. 


252..674 
294..787 


252/500 (50%) 
327/500 (65%) 


e-132 


Q9W7F1 


P5 5 -RELATED MAGUK 
PROTEIN DLG3 - Brachydanio 
rerio (Zebrafish) (Zebra danio), 576 
aa. 


142..673 
30..573 


209/556 (37%) 
335/556 (59%) 


e-105 



PFam analysis predicts that the NOV9a protein contains the domains shown in the 
Table 9F. 



Table 9F. Domain Analysis of NOV9a 


Pfam Domain 


NOV9a Match 
Region 


Identities/ 
Similarities 
for the Matched 
Region 


Expect 
Value 


L27: domain 1 of 1 


1 86..238 


19/56 (34%) 
39/56 (70%) 


0.049 


PDZ: domain 1 of 1 


256.335 


21/83 (25%) 
58/83 (70%) 


9.7e-12 


SH3: domain 1 of 1 


348..415 


19/68 (28%) 
46/68 (68%) 


0.026 


Guanylate kin: domain 1 
of 1 


515..624 


54/113(48%) 
87/113 (77%) 


6.2e-38 


Peptidase S15: domain I 
of 1 


642..658 


6/17(35%) 
13/17(76%) 


8.2 


Caulimo mov: domain 1 
of 1 


420..673 


59/335 (18%) 
156/335 (47%) 


6.1 



Example 10. 



The NOV 10 clone was analyzed, and the nucleotide and predicted polypeptide 
sequences are shown in Table 1 OA. 
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Table 10A. NOV10 Sequence Analysis 




SEQ ID NO: 25 1 576 bp 


NOVlOa, 

CG58572-01 DNA Sequence 


ACCTTACTAGAAAAATGAAACCTGATGAAACTCCTATGTTTGACCCAAGTCTACTCAA 
AGAAGTGGACTGGAGTCAGAATACAGCTACATTTTCTCCAGCCATTTCCCCAACACAT 


TTAAGGTATTGGGTCAGCTAACAGAGACTGGAGTTGTCAGCCCTGAACAATrTATGGA 
ATCTTTTGAGCATATGAAGAAATCTGGGGATTATTATGTTACAGTTGTAGAAGATGTG 
ACTCTAGGACAGATTGTTGCTACGGCAACTCTGATTATAGAACATAAATTCATCCATT 
CCTGTGCTAAGAGAGGAAGAGTAGAAGATGTTGTTGTTAGTGATGAATGCAGAGGAAA 
GCAGCTTGGCAAATTGTTATTATCAACCCTTACTTTGCTAAGCAAGAAACTGAACTGT 




ATACTGTATCTGAAGAAAACTACATGTGTCGGAGGTTTCTAAAGTAAAAATCTT 




ORF Start: ATG at 15 


ORF Stop: TAA at 567 




SEQ ID NO: 26 


184aa |MW at 20749.9kD 


NOV 10a, 

CG5 8572-01 Protein Sequence 


MKPDETPMFDPSLLKEVDWSQOTATFSPAISPTHPGEGLVLRPLCTADLNRGFFKVLG 
QLTETGWSPEQFMESFEHMKKSGDYYVTWEDVTIX3QIVATATLIIEHKFIHSCAKR 
GRVEDVWSDECRGKQLGKLLLSTLTLLSKKLNCYKITLECLPQNVGFYKKFGYTVSE 
ENYMCRRFLK 




SEQ ID NO: 27 [560 bp 


NOV 10b, 

v_.vj-)oj /z-uz LJrytx kSequeiice 


ATGAAACCTGATGAAACTCCTATGTTTGACCCAAGTCTACTCAAAGAAGTGGACTGGA 
GTCAGAATACAGCTACATTTTCTCCAGCCATTTCCCCAACACATCCTGGAGAAGGCTT 
GGTTTTGGGGCCTCTTTGTACTGCTGACTTAAATAGAGGTTTTTTTAAGGTATTGGGT 
CAGCTAACAGAGACTGGAGTTGTCAGCCCTGAACAATTTATGAAATCTTTTGAGCATA 

TGTTGCTACGGCAACTCTGATTATAGAACATAAATTCATCCATTCCTGTGCTAAGAGA 
GGAAGAGTAGAAGATGTTGTTGTTAGTGATGAATGCAGAGGAAAGCAGCTTGGCAAAT 
TGTTATTATCAACCCTTACTTTGCTAAGCAAGAAACTGAACTGTTACAAGATTACCCT 
TGAATGTCTACCACAAAATGTTGGTTTCTATAAAAAGTTTGGATATACTGTATCTGAA 
GAAAACTACATGTGTCGGAGGTTTCTAAAGTAAAAATC 




ORF Start: ATG at 1 


ORF Stop: TAA at 553 




SEQ ID NO: 28 


184 aa |MW at 20649.8kD 


NOVlOb, 

CG58572-02 Protein Sequence 


MKPDETPMFDPSLLKEVDWSQNTATFSPAISPTHPGEGLVLGPLCTADLNRGFFKVLG 
QLTETGWSPEQFMKSFEHMKKSGDYYVTWEDVTLGQIVATATLIIEHKFIHSCAKR 




ENYMCRRFLK 



Sequence comparison of the above protein sequences yields the following sequence 
relationships shown in Table 10B. 



Table 10B. Comparison of NOVlOa against NO VI Ob. 


Protein 
Sequence 


NOVlOa Residues/ 
Match Residues 


Identities/ 
Similarities for the Matched Region 


NOVlOb 


1..184 
1..184 


163/184 (88%) 
164/184 (88%) 



Further analysis of the NOVlOa protein yielded the following properties shown in 
Table IOC. 



Table IOC. Protein Sequence Properties NOVlOa 


PSort 
analysis: 


0.4500 probability located in cytoplasm; 0.1206 probability located in 
microbody (peroxisome); 0.1000 probability located in mitochondrial 
matrix space; 0.1000 probability located in lysosome (lumen) 
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SignalP 

analysis: 



No Known Signal Sequence Predicted 



A search of the NOV 10a protein against the Geneseq database, a proprietary 
database that contains sequences published in patents and patent publication, yielded 
several homologous proteins shown in Table 10D. 



Table 10D. Geneseq Results for NOVlOa 


Geneseq 
Identifier 


Protein/Organism/Length 
[Patent #, Date] 


NOVlOa 
Residues/ 

Match 
Residues 


Identities/ 
Similarities for 
the Matched 
Region 


Expect 
Value 


AAG67123 


Amino acid sequence of human 
50287 transferase - Homo sapiens, 
184 aa. [WO200164904-A2, 07- 
SEP-2001] 


1..184 
1..184 


183/184 (99%) 
184/184 (99%) 


e-105 


AAB73505 


Human transferase HTFS-12, SEQ 
ID NO: 12 - Homo sapiens, 1 84 aa. 
[WO200132888-A2, 10-MAY- 
2001] 


1..184 
1..184 


183/184 (99%) 
184/184 (99%) 


e-105 


AAB63700 


Human gastric cancer associated 
antigen protein sequence SEQ ID 
NO: 1062 - Homo sapiens, 200 aa. 
[WO200073801-A2, 07-DEC- 
2000] 


1..184 
17..200 


183/184 (99%) 
184/184(99%) 


e-105 


AAU07779 


Human novel transferase protein, 
NHP #22 - Homo sapiens, 1 84 aa. 
[WO200164903-A2, 07-SEP-2001] 


1..184 
1..184 


182/184 (98%) 
183/184 (98%) 


e-104 


AAM79992 


Human protein SEQ ID NO 3638 - 
Homo sapiens, 206 aa. 
[WO200157190-A2, 09-AUG- 
2001] 


1..184 
23..206 


181/184(98%) 
183/184 (99%) 


e-104 



In a BLAST search of public sequence databases, the NOVlOa protein was found to 
have homology to the proteins shown in the BLASTP data in Table 10E. 
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Table 10E. Public BLASTP Results for NOVlOa 


Protein 
Accession 
Number 


Protein/Organism/Length 


NOVlOa 
Residues/ 

Match 
Residues 


Identities/ 
Similarities 
for the 
Matched 
Portion 


Expect 
Value 


Q96EK6 


SIMILAR TO GLUCOSAMINE- 
PHOSPHATE N- 

ACETYLTRANSFERASE - Homo 
sapiens (Human), 184 aa. 


1..184 
1..184 


183/184 (99%) 
184/184 (99%) 


e-104 


Q9JK38 


EMEG32 PROTEIN 
(GLUCOSAMINE-PHOSPHATE N- 
ACETYLTRANSFERASE) - Mus 
musculus (Mouse), 1 84 aa. 


1..184 
1..184 


180/184 (97%) 
182/184 (98%) 


e-102 


Q9VA10 


Probable glucosamine -phosphate N- 
acetyltransferase (EC 2.3.1.4) 
(Phosphoglucosamine transacetylase) 
(Phosphoglucosamine acetylase) - 
Drosophila melanogaster (Fruit fly), 
219 aa. 


4.. 176 
6..179 


84/174 (48%) 
123/174(70%) 


2e-43 


Q17427 


Probable glucosamine-phosphate N- 
acetyltransferase (EC 2.3.1.4) 
(Phosphoglucosamine transacetylase) 
(Phosphoglucosamine acetylase) - 
Caenorhabditis elegans, 165 aa. 


32.. 182 
15..165 


65/152 (42%) 
98/152 (63%) 


le-28 


045811 


T23G1 1.2 PROTEIN - 
Caenorhabditis elegans, 347 aa. 


42.. 184 
201. .340 


63/143 (44%) 
88/143 (61%) 


3e-26 



PFam analysis predicts that the NOV1 0a protein contains the domains shown in the 



Table 10F. Domain Analysis of NOVlOa 


Pfam Domain 


NOVlOa Match 
Region 


Identities/ 
Similarities 
for the Matched 
Region 


Expect 
Value 


Acetyltransf: domain 1 
of 1 


89..171 


22/87 (25%) 
62/87 (71%) 


6.5e-13 



Example 11. 



The NOV1 1 clone was analyzed, and the nucleotide and predicted polypeptide 
sequences are shown in Table 1 1 A. 
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Table 11 A. NOV11 Sequence Analysis 




SEQIDNO:29 1 709 bp 


NOV 11 a, 

CG5 85 64-01 DNA Sequence 


CCCGCGGGCCAGCACCATGGAGGACGTGAAGCTGGAGTTCCCTTCCCTTCCACAGTGC 
AAGGAAGACGCCGAGGAGTGGACCTACCCTATGAGACGAGAGATGCAGGAAATTTTAC 
CTGGATTGTTCTTAGGCCCATATTCATCTGCTATGAAAAGCAAGCTACCTGTACTACA 
GAAACATGGAATAACCCATATAATATGCATACGACAAAATATTGAAGCAAACTTTATT 
AAACCAAACTTTCAGCAGTTATTTAGGTATTTAGTCCTGGATATTGCAGATAATCCAG 
TTGAAAATATAATACGTTTTTTCCCTATGACTAAGGAATTTATTGATGGGAGCTTACA 
AATGGGAGGTAAAGTTCTTGTGCATGGAAATGCAGGGATCTCCAGAAGTGCAGCCTTT 
GTTATTGCATACATTATGGAAACATTTGGAATGAAGTACAGGGATGCTTTTGCTTATG 
TTCAAGAAAGAAGATTTTGTATTAATCCTAATGCTGGATTTGTCCATCAACTTCAGGA 

GAAAGGTCATTATCTGTTCATTCTGGTACCACAGGTAGTTTGAAGAGAACACATGAAG 
AAGAGGATGATTTTGGAACCATGCAAGTGGCGACTGCACAGAATGGCTGACTTGAAGA 
G C AACAT CAT AGA 




ORF Start: ATG at 17 


ORF Stop: TGA at 686 




SEQ ID NO: 30 


223 aa |MW at 25492 .2kD 


NOVlla, 

CG58564-01 Protein Sequence 


MEDVKLEFPSLPQCKEDAEEWTYPMRREMQEILPGLFLGPYSSAMKSKLPVLQKHGIT 
HIICIRQNIEANFIKPNFQQLFRYLVLDIADNPVENIIRFFPMTKEFIDGSLQMGGKV 
LVHGNAGISRSAAFVIAYIMETFGMKYRDAFAWQERRFCINPNAGFVHQLQEYEAIY 
LAKLTIQMMSPLQIERSLSVHSGTTGSLKRTHEEEDDFGTMQVATAQNG 




SEQ ID NO: 31 1 724 bp 


NOV lib, 

CUjojo4-Uz JJJNA bequence 


ACTCTCCCACCCCACCCACCAGAATGGCGGGCCAGCACCATGGAGGACGTGAAGCTGG 


ACGAGAGATGCAGGAAATTTTATCTGGATTGTTCTTAGGCCCATATTCATCTGCTATG 
AAAAGCAAGCTACCTGTACTACAGAAACATGGAATAACCCATATAATATGCATACGAC 
AAAATATTGAAGCAAACTTTATTAAACCAAACTTTCAGCAGTTATTTAGATATTTAGT 
CCTGGATATTGCAGATAATCCAGTTGAAAATATAATACGTTTTTTCCCTATGACTAAG 
GAATTTATTGATGGGAGCTTACAAATGGGAGGAAAAGTTCTTGTGCATGGAAATGCAG 
GGATCTCCAGAAGTGCAGCCTTTGTTATTGCATACATTATGGAAACATTTGGAATGAA 
GTACAGAGATGCTTTTGCTTATGTTCAAGAAAGAAGATTTTGTATTAATCCTAATGCT 
GGATTTGTCCATCAACTTCAGGAATATGAAGCCATCTACCTAGCAAAATTAACAATAC 
AGATGATGTCACCACTCCAGATAGAAAGGTCATTATCTGTTCATTCTGGTACCACAGG 
CAGTTTGAAGAGAACACATGAAGAAGAGGATGATTTTGGAACCATGCAAGTGGCGACT 
GCACAGAATGG CTGACTTGAAGAGCAAC 




ORF Start: ATG at 40 


ORF Stop: TGA at 709 




SEQ ID NO: 32 


223 aa IMW at 25482.1kD 


NOV lib, 

CG58564-02 Protein Sequence 


MEDVKLEFPSLPQCKEDAEEWTYPMRREMQEILSGLFLGPYSSAMKSKLPVLQKHGIT 
HI I CIRQNIEANFIKPNFQQLFRYLVLDIADNPVENI I RFFPMTKEFIDGSLQMGGKV 
LVHGNAGISRSAAFVIAYIMETFGMKYRDAFAYVQERRFCINPNAGFVHQL.QEYEAIY 
LAKLTIQMMSPLQIERS] VH< TTGSI RTHEEEDE 5TM \ NG 




SEQ ID NO: 33 1 545 bp 


NOV 11c, 

CG58564-03 DNA Sequence 


ACTCTCCCACCCCACCCACCAGCCCGCGGGCCAGCACCATGGAGGACGTGAAGCTGGA 


GTTCCCTTCCCTTCCACAGTGCAAGGAAGACGCCGAGGAGTGGACCTACCCTATGAGA 
CGAGAGATGCAGGAAATTTTACCTGGATTGTTCTTAGGCCCATATTCATCTGCTATGA 
AAAGCAAGCTACCTGTACTACAGAAACATTTGGAATGAAGTACAGAGATGCTTTTGCT 
TATGTTCAAGAAAGAAGATTTTGTATTAATCCTAATGCTGGATTTGTCCATCAACTTC 


AGGAATATGAAGCCATCTACCTAGCAAAATTAACAATACAGATGATGTCACCACTCCA 


GATAGAAAGGTCATTATCTGTTCATTCTGGTACCACAGGCAGTTTGAAGAGAACACAT 


GAGGAAGAGGATGATTTTGGAACCATGCAAGTGGCGACTGCACAGAATGGCTGACTTG 


AAGAGCAACATCATAGAGTGTGAATTTCTATTTGGGAAGGAGAAAATACAAGAGAAAA 


TTATAATGTAAAATGGTAAAAAA 




ORF Start: ATG at 39 


ORF Stop: TGA at 210 




SEQ ID NO: 34 


57 aa |MW at 6695.7kD 


NOV 11c, 

CG58564-03 Protein Sequence 


MEDVKLEFPSLPQCKEDAEEWTYPMRREMQEILPGLFLGPYSSAMKSKLPVLQKHLE 




SEQ ID NO: 35 j 663 bp 


NOV lid, 

CG58564-04 DNA Sequence 


ACTCTCCCACCCCACCCACCAGCCCGCGGGCCAGCACCATGGAGGACGTGAAGCTGGA 


CGAGAGATGCAGGAAATTTTACCTGGATTGTTCTTAGGCCCATATTCATCTGCTATGA 
AAAGCAAGCTACCTGTACTACAGAAACATGGAATAACCCATATAATATGCATACGACA 
AAATATTGAAGCAAACTTTATTAAACCAAACTTTCAGCAGTTATTTAGACTAAGGAAT 
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TTATTGATGGGAGCTTACAAATGGGAGGAAAAGTTCTTGTGCATGGAAATGCAGGGAT 
CTCCAGAAGTGCAGCCTTTGTTATTGCATACATTATGGAAACATTTGGAATGAAGTAC 
AGAGATGCTTTTGCTTATGTTCAAGAAAGAAGATTTTGTATTAATCCTAATGCTGGAT 


TTGTCCATCAACTTCAGGAATATGAAGCCATCTACCTAGCAAAATTAACAATACAGAT 


GATGTCACCACTCCAGATAGAAAGGTCATTATCTGTTCATTCTGGTACCACAGGCAGT 


TTGAAGAGAACACATGAAGAAGAGGATGATTTTGGAACCATGCAAGTGGCGACTGCAC 


AGAATGGCTGACTTGAAGAGCAACT 




ORF Start: ATG at 39 


ORF Stop: TGA at 399 




SEQIDNO: 36 


120 aa |MW at 14245.6kD 


NOV lid, 

CG58564-04 Protein Sequence 


MEDVKLEFPSLPQCKEDAEEWTYPMRREMQEILPGLFLGPYSSAMKSKLPVLQKHGIT 
HIICIRONIEANFIKPNFQQLFRLRNLLMGAYKWEEKFLCMEMQGSPEVQPLLLHTLW 
KHLE 



Sequence comparison of the above protein sequences yields the following sequence 
relationships shown in Table 1 IB. 



Table 11B. Comparison of NOVlla against NOVllb through NOVlld. 


Protein Sequence 


NOVlla Residues/ 
Match Residues 


Identities/ 
Similarities for the Matched Region 


NOVllb 


1..223 
1..223 


222/223 (99%) 
222/223 (99%) 


NOVllc 


I..55 
1..55 


55/55(100%) 
55/55 (100%) 


NOVlld 


I..81 
1..81 


81/81 (100%) 
81/81 (100%) 



Further analysis of the NOV1 la protein yielded the following properties shown in 
Table 11C. 



Table 11C. Protein Sequence Properties NOVlla 


PSort 
analysis: 


0.4698 probability located in microbody (peroxisome); 0.4500 probability 
located in cytoplasm; 0.1958 probability located in lysosome (lumen); 0.1000 
probability located in mitochondrial matrix space 


SignalP 
analysis: 


No Known Signal Sequence Predicted 



A search of the NOV1 la protein against the Geneseq database, a proprietary 
database that contains sequences published in patents and patent publication, yielded 
several homologous proteins shown in Table 1 ID. 
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Table 11D. Geneseq Results for NO VI la 


Geneseq 
Identifier 


Protein/Organism/Length 
[Patent #, Date] 


NOVlla 
Residues/ 

Match 
Residues 


Identities/ 
Similarities for 
the Matched 
Region 


Expect 
Value 


AAU09017 


Human dual specificity 
phosphatase 38692 - Homo 
sapiens, 223 aa. [WO2001 73059- 
A2, 04-OCT-2001] 


1 ..223 
1..223 


223/223 (100%) 
223/223 (100%) 


e-128 


AAE08552 


Human phosphatase protein - 
Homo sapiens, 223 aa. 
[WO200160992-A2, 23-AUG- 
2001] 


1..223 
1..223 


223/223 (100%) 
223/223 (100%) 


e-128 


AAM41520 


Human polypeptide SEQ ID NO 
645 1 - Homo sapiens, 236 aa. 
[WO200153312-A1, 26-JUL- 
2001] 


1..223 
14..236 


223/223 (100%) 
223/223 (100%) 


e-128 


AAM39734 


Human polypeptide SEQ ID NO 
2879 - Homo sapiens, 223 aa. 
[WO200153312-A1, 26-JUL- 
2001] 


1..223 
1..223 


223/223 (100%) 
223/223(100%) 


e-128 


AAU23521 


Novel human enzyme polypeptide 
#607 - Homo sapiens, 1 90 aa. 
[WO200155301-A2, 02-AUG- 
2001] 


25 ..171 
7..145 


55/147 (37%) 
80/147 (54%) 


le-18 



In a BLAST search of public sequence databases, the NOV1 la protein was found to 
have homology to the proteins shown in the BLASTP data in Table 1 IE. 



Table HE. Public BLASTP Results for NOVlla 


Protein 
Accession 
Number 


Protein/Organism/Length 


NOVlla 
Residues/ 

Match 
Residues 


Identities/ 
Similarities for 
the Matched 
Portion 


Expect 
Value 


CAD10219 


SEQUENCE 4 FROM PATENT 
WO01 73059 - Homo sapiens 
(Human), 223 aa. 


1..223 
1..223 


223/223(100%) 
223/223 (100%) 


e-127 


Q9DCF8 


0610039A20RIK PROTEIN - 
Mus musculus (Mouse), 223 aa. 


1..223 
1..223 


215/223 (96%) 
221/223 (98%) 


e-124 


Q60970 


PROTEIN TYROSINE 
PHOSPHATASE-LIKE - Mus 
musculus (Mouse), 223 aa. 


1..223 
1..223 


214/223 (95%) 
221/223 (98%) 


e-124 
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Q60969 


PROTEIN TYROSINE 
PHOSPHATASE-LIKE - Mus 
musculus (Mouse), 205 aa. 


1.168 
1..168 


163/168 (97%) 
167/168 (99%) 


2e-93 


Q99850 


TYROSINE PHOSPHATASE- 
LIKE PROTEIN HOMOLOG 
HSTYXB - Homo sapiens 
(Human), 66 aa (fragment). 


1 16..181 
1..66 


66/66(100%) 
66/66(100%) 


3e-31 



PFam analysis predicts that the NOV1 1 a protein contains the domains shown in the 
Table 11 F. 



Table 11F. Domain Analysis of NO VI la 


Pfam Domain 


NO VI la Match 
Region 


Identities/ 
Similarities 
for the Matched 
Region 


Expect 
Value 


DSPc: domain 1 of 1 


28.. 173 


64/172 (37%) 
127/172 (74%) 


2.2e-63 


Y_phosphatase: domain 1 
of 1 


35..179 


35/279(13%) 
93/279 (33%) 


1.7 



Example 12. 

The NOV 12 clone was analyzed, and the nucleotide and predicted polypeptide 
sequences are shown in Table 12A. 



Table 12A. NOV12 Sequence Analysis 




SEQIDNO:37 1 3696 bp 


NOV12a, 

CG578 19-01 DNA Sequence 


GTGTAAAAATACTGTCCATTTAATGTTTTCTGGGACTTTAGGTAAGAATATGAAAACT 
CAACCACCCTTGAGCAGGATGAACCGGGAGGAATTGGAGGACAGTTTCTTTCGACTTC 
GCGAAGATCACATGTTGGTGAAGGAGCTTTCTTGGAAGCAACAGGATGAGATCAAAAG 
GCTGAGGACCACCTTGCTGCGGTTGACCGCTGCTGGCCGGGACCTGCGGGTCGCGGAG 
GAGGCGGCGCCGCTCTCGGAGACCGCAAGGCGCGGGCAGAAGGCGGGATGGCGGCAGC 
GCCTCTCCATGCACCAGCGCCCCCAGATGCACCGACTGCAAGGGCATTTCCACTGCGT 
CGGCCCTGCCAGCCCCCGCCGCGCCCAGCCTCGCGTCCAAGTGGGACACAGACAGCTC 
CACACAGCCGGTGCACCGGTGCCGGAGAAACCCAAGAGGGGTAGGGACAGGCTGAGCT 
ACACAGCCCCTCCATCGTTTAAGGAGCATGCGACAAATGAAAACAGAGGTGAAGTAGC 
CAGTAAACCCAGTGAACTGGCCCACATCATGGCCAGCAATACCATGCAAGTGGAAGAG 
CCACCCAAGTCTCCTGAGAAAATGTGGCCTAAAGATGAAAATTTTGAACAGAGAAGCT 
CATTGGAGTGTGCTCAGAAGGCTGCAGAGCTTCGGGCTTCCATTAAAGAGAAGGTAGA 
GCTGATTCGACTTAAGAAGCTCTTACATGAAAGAAATGCTTCATTGGTTATGACAAAA 
GCACAATTAACAGAAGTTCAAGAGGTGAGTTGCCATCTTTTGACCCAGAATCAGGGAA 
TCCTGAGTGCAGCCCATGAGGCCCTCCTCAAGCAAGTGAATGAGCTCAGGGCAGAGCT 
GAAGGAAGAAAGCAAGAAGGCTGTGAGCTTGAAGAGCCAACTGGAAGATGTGTCTATC 
TTGCAGATGACTCTGAAGGAGTTTCAGGAGAGAGTTGAAGATTTGGAAAAAGAACGAA 
AATTGCTGAATGACAATTATGACAAACTCrTAGAAAGCAGTGACAGCTCCAGTCAGCC 
CCACTGGAGCAACGAGCTCATAGCGGAACAGCTACAGCAGCAAGTCTCTCAGCTGCAG 
GATCAGCTGGATGCTGAGCTGGAGGACAAGAGAAAAGTTTTACTTGAGCTGTCCAGGG 
AGAAAGCCCAAAATGAGGATCTGAAGCTTGAAGTCACCAACATACTrCAGAAGCATAA 
ACAGGAAGTAGAGCTCCTCCAAAATGCAGCCACAATTTCCCAACCTCCTGACAGGCAA 
TCTGAACCAGCCACTCACCCAGCTGTATTGCAAGAGAACACTCAGATCCAGCCAAGTG 
AACCCAAAAACCAAGAAGAAAAGAAACTGTCCCAGGTGCTAAATGAGTTGCAAGTATC 
ACACGCAGAGACCACATTGGAACTAGAAAAGACCAGGGACATGCTTATTCTGCAGCGC 
AAAATCAACGTGTGTTATCAGGAGGAACTGGAGGCAATGATGACAAAAGCTGACAATG 
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ATAATAGAGATCACAAAGAAAAGCTGGAGAGGTTGACTCGACTACTAGACCTCAAGAA 
TAACCGTATCAAGCAGCTGGAAGAACAGCTCAAAGATGTTGCTTATGGCACCCGACCG 
TTGTCGTTATGTTTGGAAACACTGCCAGCCCATGGAGATGAGGATAAAGTGGATATTT 
CTCTGCTGCATCAGGGTGAGAATCTTTTTGAACTGCACATCCACCAGGCCTTCCTGAC 
ATCTGCCGCCCTAGCTCAGGCTGGAGATACCCAACCTACCACTTTCTGCACCTATTCC 
TTCTATGACTTTGAAACCCACTGTACCCCArTATCTGTGGGGCCACAGCCCCTCTATG 
ACTTCACCTCCCAGTATGTGATGGAGACAGATTCGCTTTTCTTACACTACCTTCAAGA 
GGCTTCAGCCCGGCTTGACATACACCAGGCCATGGCCAGTGAACACAGCACTCTTGCT 
GCAGGATGGATTTGCTTTGACAGGGTGCTAGAGACTGTGGAGAAAGTCCATGGCTTGG 
CCACACTGATTGGTGCTGGTGGAGAAGAGTTCGGGGTTCTAGAGTACTGGATGAGGCT 
GCGTTTCCCCATAAAACCCAGCCTACAGGCGTGCAATAAACGAAAGAAAGCCCAGGTC 
TACCTGTCAACCGATGTGCTTGGAGGCCGGAAGGCCCAGGAAGAGGAGGTGAGATCGG 
AGTCTTGGGAACCTCAGAACGAGCTGTGGATTGAAATCACCAAGTGCTGTGGCCTCCG 
GAGTCGATGGCTGGGAACTCAACCCAGTCCATATGCTGTGTACCGCTTCTTCACCTTT 
TCTGACCATGACACTGCCATCATTCCAGCCAGTAACAACCCCTACTTTAGAGACCAGG 
CTCGATTCCCAGTGCTTGTGACCTCTGACCTGGACCATTATCTGAGACGGGAGGCCTT 
GTCTATACATGTTTTTGATGATGAAGACTTAGAGCCTGGCTCGTATCTTGGCCGAGCC 
CGAGTGCCTTTACTGCCTCTTGCAAAAAATGAATCTATCAAAGGTGATTTrAACCTCA 
CTGACCCTGCAGAGAAACCCAACGGATCTATTCAAGTGCAACTGGATTGGAAGTTTCC 
CTACATACCCCCTGAGAGCTTCCTGAAACCAGAAGCTCAGACTAAGGGGAAGGATACC 
AAGGACAGTTCAAAGATCTCATCTGAAGAGGAAAAGGCTTCATTTCCTTCCCAGGATC 
AGATGGCATCTCCTGAGGTTCCCATTGAAGCTGGCCAGTATCGATCTAAGAGAAAACC 
TCCTCATGGGGGAGAAAGAAAGGAGAAGGAGCACCAGGTTGTGAGCTACTCAAGAAGA 
AAACATGGCAAAAGAATAGGTGTTCAAGGAAAGAATAGAATGGAGTATCTTAGCCTTA 
ACATCTTAAATGGAAATACACTGAAGCAGGTGAATTACACTGAGTGGAAGTTCTCAGA 
GACTAACAGCTTCATAGGTGATGGCTTTAAAAATCAGCACGAGGAAGAGGAAATGACA 
TTATCCCATTCAGCACTGAAACAGAAGGAACCTCTACATCCTGTAAATGACAAAGAAT 
CCTCTGAACAAGGTTCTGAAGTCAGTGAAGCACAAACTACCGACAGTGATGATGTCAT 
AGTGCCACCCATGTCTCAGAAATATCCTAAGGCAGATTCAGAGAAGATGTGCATTGAA 
ATTGTCTCCCTGGCCTTCTACCCAGAGGCAGAAGTGATGTCTGATGAGAACATAAAAC 
AGGTGTATGTGGAGTACAAATTCTACGACCTACCCTTGTCGGAGACAGAGACTCCAGT 
GTCCCTAAGGAAGCCTAGGGCAGGAGAAGAAATCCACTTTCACTTTAGCAAGGTAATA 
GACCTGGACCCACAGGAGCAGCAAGGCCGAAGGCGGTTTCTGrTCGACATGCTGAATG 
GACAAGATCCTGATCAAGGACAGTTAAAGTTTACAGTGGTAAGTGATCCTCTGGATGA 
AGAAAAGAAAGAATGTGAAGAAGTGGGATATGCATATCTTCAACTGTGGCAGATCCTG 
GAGTCAGGAAGAGATATTCTAGAGCAAGAGCTAGACGTTGTTAGCCCTGAAGATCTGG 
CTACCCCAATAGGAAGGCTGAAGGTTTCCCTTCAAGCAGCTGCTGTCCTCCATGCTAT 
I CAAG T ACTGAAGATTTGTTTTCATGAAGGAACAA 




ORF Start: ATG at 
23 


ORF Stop: TGA at 3686 




SEQ ID NO: 38 


1221 aa |MW at 139825.2kD 


NOV12a, 

CG57819-01 Protein 
Sequence 


MFSGTLGKNMKTQPPLSRMNREELEDSFFRLREDHMLVKELSWKQQDEIKRLRTTLLR 
LTAAGRDLRVAEEAAPLSETARRGQKAGWRQRLSMHQRPQMHRLQGHFHCVGPASPRR 
AQPRVQVGHRQLHTAGAPVPEKPKRGRDRLSYTAPPSFKEHATNENRGEVASKPSELA 
HIMASNTMQVEEPPKSPEKMWPKDENFEQRSSLECAQKAAELRASIKEKVELIRLKKL 
LHERNASLVMTKAQLTEVQEVSCHLLTQNQG I LS AAHEALLKQVNELRAELKEESKKA 
VSLKSQLEDVSILQMTLKEFQERVEDLEKERKLLNDNYDKLLESSDSSSQPHWSNELI 
AEQLQQQVSQLQDQLDAELEDKRKVLLELSREKAQNEDLKLEVTNILQKHKQEVELLQ 
NAATISQPPDRQSEPATHPAVLQENTQIQPSEPKNQEEKKLSQVLNELQVSHAETTLE 
LEKTRDMLILQRKINVCYQEELEAMMTKADNDNRDHKEKLERLTRLLDLKNNRIKQLE 
EQLKDVAYGTRPLSLCLETLPAHGDEDKVDISLLHQGENLFELHIHQAFLTSAALAQA 
GDTQPTTFCTYSFYDFETHCTPLSVGPQPLYDFTSQYVMETDSLFLHYLQEASARLDI 
HQAMASEHSTLAAGWICFDRVLETVEKVHGLATLIGAGGEEFGVLEYWMRLRFPIKPS 
LQACNKRKKAQVYLSTDVLGGRKAQEEEVRSESWEPQNELWIEITKCCGLRSRWLGTQ 
PSPYAVYRFFTFSDHDTAIIPASNNPYFRDQARFPVLVTSDLDHYLRREALSIHVFDD 
EDLEPGSYLGRARVPLLPLAKNESIKGDFNLTDPAEKPNGSIQVQLDWKFPYIPPESF 
LKPEAQTKGKDTKDSSKISSEEEKASFPSQDQMASPEVPIEAGQYRSKRKPPHGGERK 
EKEHQWSYSRRKHGKRIGVQGKNRMEYLSLNILNGNTLKQVNYTEWKFSETNSFIGD 
GFKNQHEEEEMTLSHSALKQKEPLHPVNDKESSEQGSEVSEAQTTDSDDVIVPPMSQK 
YPKADSEKMCIEIVSLAFYPEAEVMSDENIKQVYVEYKFYDLPLSETETPVSLRKPRA 
GEEIHFHFSKVIDLDPQEQQGRRRFLFDMLNGQDPDQGQLKFTWSDPLDEEKKECEE 
VGYAYLQLWQILESGRDILEQELDWSPEDLATPIGRLKVSLQAAAVLHAIYKEMTED 
LFS 



Further analysis of the NOV 12a protein yielded the following properties shown in 
Table 12B. 



131 



Table 12B. Protein Sequence Properties NOV12a 


PSort 
analysis: 


0.9600 probability located in nucleus; 0.3000 probability located in 
microbody (peroxisome); 0.1000 probability located in mitochondrial matrix 
space; 0.1000 probability located in lysosome (lumen) 


SignalP 

analysis: 


No Known Signal Sequence Predicted 



A search of the NOV 12a protein against the Geneseq database, a proprietary 
database that contains sequences published in patents and patent publication, yielded 
several homologous proteins shown in Table 12C. 



Table 12C. Geneseq Results for NOV12a 


Geneseq 
Identifier 


Protein/Organism/Length 
[Patent #, Date] 


NOV12a 
Residues/ 

Match 
Residues 


Identities/ 
Similarities for 
the Matched 
Region 


Expect 
Value 


AAM78558 


Human protein SEQ ID NO 1220 - 
Homo sapiens, 1 179 aa. 
[WO200157190-A2, 09-AUG- 
2001] 


63..1219 
47.. 11 77 


400/1193 (33%) 
640/1193 (53%) 


e-172 


AAM79542 


Human protein SEQ ID NO 31 88 - 
Homo sapiens, 1 160 aa. 
[WO200157190-A2, 09-AUG- 
2001] 


63..1219 
28..1158 


400/1193 (33%) 
640/1193 (53%) 


e-172 


AAM41414 


Human polypeptide SEQ ID NO 
6345 - Homo sapiens, 1 160 aa. 
[WO200153312-A1, 26-JUL-2001] 


63..1219 
28..1158 


400/1193 (33%) 
640/1193 (53%) 


e-172 


AAM39628 


Human polypeptide SEQ ID NO 
2773 - Homo sapiens, 1 128 aa. 
[WO200153312-A1, 26-JUL-2001] 


118..1219 
47.. 1126 


390/1138(34%) 
623/1138(54%) 


e-171 


AAG75661 


Human colon cancer antigen 
protein SEQ ID NO:6425 - Homo 
sapiens, 118 aa. [WO200 122920- 
A2, 05-APR-2001] 


445. .523 
33..111 


40/79 (50%) 
56/79 (70%) 


le-13 



In a BLAST search of public sequence databases, the NOV12a protein was found to 
have homology to the proteins shown in the BLASTP data in Table 12D. 
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Table 12D. Public BLASTP Results for NOV! 2a 


Protein 
Accession 
Number 


Protein/Organism/Length 


NOV12a 
Residues/ 

Match 
Residues 


Identities/ 
Similarities for the 
Matched Portion 


Expect 
Value 


Q96KN7 


RPGR-INTERACTING 
PROTEIN 1 - Homo sapiens 
(Human), 1286 aa. 


7.. 1221 
29.. 1286 


1203/1258 (95%) 
1207/1258(95%) 


0.0 


Q96QA8 


RPGR-INTERACTING 
PROTEIN 1 - Homo sapiens 
(Human), 1286 aa. 


7..1221 
29..1286 


1203/1258 (95%) 
1207/1258 (95%) 


0.0 


Q9GLM3 


RPGR-INTERACTING 
PROTEIN- 1 - Bos taurus 
(Bovine), 1221 aa. 


1..1221 
L.1221 


922/1234 (74%) 
1031/1234 (82%) 


0.0 


Q9NR40 


RPGR-PNTERACTING 
PROTEIN - Homo sapiens 
(Human), 902 aa. 


331..1221 
1..902 


883/902 (97%) 
888/902 (97%) 


0.0 


Q9HBK6 


RPGR-INTERACTING 
PROTEIN-1 - Homo sapiens 
(Human), 762 aa. 


471..1221 

1..762 


742/763 (97%) 
746/763 (97%) 


0.0 



PFam analysis predicts that the NOV 12a protein contains the domains shown in the 
Table 12E. 



Table 12E. Domain Analysis of NOV12a 


Pfam Domain 


NOV12a Match 
Region 


Identities/ 
Similarities 
for the Matched 
Region 


Expect 
Value 


PFEMP: domain 1 of 
1 


293..413 


23/176 (13%) 
82/176(47%) 


7.9 


C2: domain 1 of 1 


736..825 


14/101 (14%) 
54/101 (53%) 


1.4 



Example 13. 



The NOV 13 clone was analyzed, and the nucleotide and predicted polypeptide 
sequences are shown in Table 13A. 
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Table 13A. NOV13 Sequence Analysis 




SEQIDNO:39 [678 hp 


NOV13a, 

CG57789-01 DNA Sequence 


TGGGGCGGGAGGCATGGTCTCCACCTACCGGGTGGCCGTGCTGGGGGCGCGAGGTGTG 
GGCAAGAGTGCCATCGTGCGCCAGTTCTTGTACAACGAGTTCAGCGAGGTCTGCGTCC 
CCACCACCGCCCGCCGCCTTTACCTGCCTGCTGTCGTCATGAACGGCCACGTGCACGA 
CCTCCAGATCCTCGACTTTCCACCCATCAGCGCCTTCCCTGTCAATACGCTCCAGGAG 
TGGGCAGACACCTGCTGCAGGGGACTCCGGAGTGTCCACGCCTACATCCTGGTCTACG 
ACATCTGCTGCTTTGACAGCTTTGAGTACGTCAAGACCATCCGCCAGCAGATCCTGGA 
GACGAGGGTGATCGGAACCTCAGAGACGCCCATCATCATCGTGGGCAACAAGCGGGAC 
CTGCAGCGCGGACGCGTGATCCCGCGCTGGAACGTGTCGCACCTGGTACGCAAGACCT 
GGAAGTGCGGCTACGTGGAATGCTCGGCCAAGTACAACTGGCACATCCTGCTGCTCTT 
CAGCGAGCTGCTCAAGAGCGTCGGCTGCGCCCGTTGCAAGCACGTGCACGCTGCCCTG 
CGCTTCCAGGGCGCGCTGCGCCGCAACCGCTGCGCCATCATGTGACGCCTGCGCGCCC 
CTCGGGCTGCACCGGCACTGGCCGAGCGGAGGGCGGGGCC 




ORF Start: ATG at 14 


ORF Stop: TGA at 623 




SEQIDNO: 40 


203 aa |MW at 23229.0kD 


NOV13a, 

CG57789-01 Protein Sequence 


MVSTYRVAVLGARGVGKSAIVRQFLYNEFSEVCVPTTARRLyLPAWMNGHVHDLQIL 
DFPPISAFPVNTLQEWADTCCRGLRSVHAYILVYDICCFDSFEYVKTIRQQILETRVI 
GTSETPIIIVGNKRDLQRGRVIPRWNVSHLVRKTWKCGYVECSAKYNWHIIjLLFSELL 
KSVGCARCKHVHAALRFQGALRRNRCAIM 




SEQIDNO: 41 1 682 bp 


NOV13b, 

CG57789-02 DNA Sequence 


GAGTGCCATCGTGCGCCAGTTCTTGTACAACGAGTTCAGCGAGGTCTGCGTCCCCACC 
ACCGCCCGCCGCCTTTACCTGCCTGCTGTCGTCATGAACGGCCACGTGCACGACCTCC 
AGATCCTCGACTTTCCACCCATCAGCGCCTTCCCTGTCAATACGCTCCAGGAGTGGGC 
AGACACCTGCTGCAGGGGACTCCGGAGTGTCCACGCCTACATCCTGGTCTACGACATC 
TGCTGCTTTGACAGCTTTGAGTACGTCAAGACCATCCGCCAGCAGATCCTGGAGACGA 
GGGTGATCGGAACCTCAGAGACGCCCATCATCATCGTGGGCAACAAGCGGGACCTGCA 
GCGCGGACGCGTGATCCCGCGCTGGAACGTGTCGCACCTGGTACGCAAGACCTGGAAG 
TGCGGCTACGTGGAATGCTCGGCCAAGTACAACTGGCACATCCTGCTGCTCTTCAGCG 
AGCTGCTCAAGAGCGTCGGCTGCGCCCGTTGCAAGCACGTGCACGCTGCCCTGCGCTT 
CCAGGGCGCGCTGCGCCGCAACCGCTGCGCCATCATGTGACGCCTGCGCGCCCCTCGG 
GCTGCACCGGCACTGGCCGAGCGGAGGGCACTGGCCGAGCGGAG 




ORF Start: ATG at 9 


ORF Stop: TGA at 618 




SEQ ID NO: 42 


203 aa llvlW at 23229.0kD 


NOV13b, 

CG57789-02 Protein Sequence 


MVSTYRVAVLGARGVGKSAIVRQFLYNEFSEVCVPTTARRLYLPAWMNGHVHDLQIL 
DFPPISAFPVNTLQEWADTCCRGLRSVHAY I LVYDI CCFDSFE YVKT I RQQ I LETRVI 
GTSETPIIIVGNKRDLQRGRVIPRWNVSHLVRKTWKCGYVECSAKYNWHILLLFSELL 
KSVGCARCKHVHAALRFQGALRRNRCAIM 



Sequence comparison of the above protein sequences yields the following sequence 
relationships shown in Table 13B. 



Table 13B. Comparison of NOV13a against NOV13b. 


Protein 
Sequence 


NOV13a Residues/ 
Match Residues 


Identities/ 
Similarities for the Matched Region 


NOV 13b 


1..203 
1..203 


203/203 (100%) 
203/203 (100%) 



Further analysis of the NOV13a protein yielded the following properties shown in 
Table 13C. 
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Table 13C. Protein Sequence Properties NOV13a 


PSort 

analysis: 


0.6500 probability located in plasma membrane; 0.5064 probability located 
in mitochondrial matrix space; 0.3844 probability located in microbody 
(peroxisome); 0.2556 probability located in lysosome (lumen) 


SignalP 
analysis: 


No Known Signal Sequence Predicted 



A search of the NOV13a protein against the Geneseq database, a proprietary 
database that contains sequences published in patents and patent publication, yielded 
several homologous proteins shown in Table 13D. 



Table 13D. Geneseq Results for NOV13a 


Geneseq 
Identifier 


Protein/Orgsnisni/Lcngtfa [Patent 
#, Date] 


NOV13a 
Resi d u es/ 

Match 
Residues 


Identities/ 

the Matched 
Region 


Value 


AAB42840 


Human ORFX ORF2604 
polypeptide sequence SEQ ID 
NO:5208 - Homo sapiens, 136 aa. 
[WO200058473-A2, 05-OCT- 
2000] 


1..136 
1..136 


136/136(100%) 
136/136(100%) 


2e-75 


AAM41682 


Human polypeptide SEQ ID NO 
6613 - Homo sapiens, 206 aa. 
[WO200153312-A1, 26-JUL-2001] 


5..174 
15..173 


66/171 (38%) 
89/171 (51%) 


4e-18 


AAM39896 


Human polypeptide SEQ ID NO 
3041 - Homo sapiens, 199 aa. 
[WO200153312-A1, 26-JUL-2001] 


5..174 
8..166 


66/171 (38%) 
89/171 (51%) 


4e-18 


AAY99656 


Human GTPase associated protein- 
7 - Homo sapiens, 281 aa. 
[WO200031263-A2, 02-JUN-2000] 


5..173 
25..191 


59/179 (32%) 
87/179 (47%) 


3e-14 


AAR05075 


RAP1 A Gene product 
incorporating at least one peptide 
associated with ras oncogene - 
Synthetic, 184 aa. [WO9000179-A, 
ll-JAN-1990] 


5..177 
4..165 


57/175 (32%) 
90/175 (50%) 


5e-14 



In a BLAST search of public sequence databases, the NOV 1 3a protein was found to 
have homology to the proteins shown in the BLASTP data in Table 13E. 
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Table 13E. Public BLASTP Results for NOV13a 


Protein 
Accession 
Number 


Protein/Organism/Length 


NOV13a 
Residues/ 

Match 
Residues 


Identities/ 
Similarities for 
the Matched 
Portion 


Expect 
Value 


Q96S79 


RAS-LIKE PROTEIN/VTS58635 - 
Homo sapiens (Human), 203 aa. 


1..203 
1..203 


203/203(100%) 
203/203(100%) 


e-118 




Ras-like protein RRP22 (RAS- 
related protein on chromosome 22) 
- Homo sapiens (Human), 203 aa. 


1..203 
1 ..203 


105/204 (51%) 
134/204(65%) 


3e-50 


Q95KD9 


PROTEIN - Macaca fascicularis 
(Crab eating macaque) 
(Cynomolgus monkey), 199 aa. 


8.. 166 


OO/l /l [60/0) 

89/171 (51%) 


le-17 


Q96HU8 


SIMILAR TO CG8500 GENE 
PRODUCT - Homo sapiens 
(Human), 199 aa. 


5..174 
8..166 


66/171 (38%) 
89/171 (51%) 


le-17 


Q9NF75 


EG:BACR37P7.8 PROTEIN - 
Drosophiia melanogaster (Fruit 
fly), 306 aa. 


5..174 
48..210 


61/174(35%) 
88/174 (50%) 


4e-16 



PFam analysis predicts that the NOV 13a protein contains the domains shown in the 
Table 13F. 



Table 13F. Domain Analysis of NOV13a 


Pfam Domain 


NOV13a Match 
Region 


Identities/ 
Similarities 
for the Matched 
Region 


Expect 
Value 


Semialdhyde dh: domain 1 
of 1 


4..14 


4/11 (36%) 
11/11 (100%) 


0.75 


ras: domain 1 of 1 


6..203 


56/224 (25%) 
125/224 (56%) 


1.2e-12 



Example 14. 



The NOV 14 clone was analyzed, and the nucleotide and predicted polypeptide 
sequences are shown in Table 14A. 
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Table 14A. NOV14 Sequence Analysis 




SEQIDNO:43 1 1790 bp 


NOV14a, 

CG57758-01 DNA Sequence 


TCTCCCTCCCGCGCGATGGCCTCGGCGCTGAGCTATGTCTCCAAGTTCAAGTCCTTCG 
TGATCTTGTTCGTCACCCCGCTCCTGCTGCTGCCACTCGTCATTCTGATGCCCGCCAA 
GGTCAGTTGTGCCTACGTCATCATCCTCATGGCCATTTACTGGTGCACAGAAGTCATC 
CCTCTGGCTGTCACCTCTCTCATGCCTGTCTrGCTTTTCCCACTCTTCCAGATTCTGG 
ACTCCAGGCAGGTGTGTGTCCAGTACATGAAGGACACCAACATGCTGTTCCTGGGCGG 
CCTCATCGTGGCCGTGGCTGTGGAGCGCTGGAACCTGCACAAGAGGATCGCCCTGCGC 
ACGCTCCTCTGGGTGGGGGCCAAGCCTGCACGGCTGATGCTGGGCTTCATGGGCGTCA 
CAGCCCTCCTGTCCATGTGGATCAGTAACACGGCAACCACGGCCATGATGGTGCCCAT 
CGTGGAGGCCATATTGCAGCAGATGGAAGCCACAAGCGCAGCCACCGAGGCCGGCCTG 
GAGCTGGTGGACAAGGGCAAGGCCAAGGAGCTGCCAGGGAGTCAAGTGATTTTTGAAG 
GCCCCACTCTGGGGCAGCAGGAAGACCAAGAGCGGAAGAGGTTGTGTAAGGCCATGAC 
CCTGTGCATCTGCTACGCGGCCAGCATCGGGGGCACCGCCACCCTGACCGGGACGGGA 
CCCAACGTGGTGCTCCTGGGCCAGATGAACGAGTTGTTTCCTGACAGCAAGGACCTCG 
TGAACTTTGCTTCCTGGTTTGCATTTGCCTTTCCCAACATGCTGGTGATGCTGCTGTT 
CGCCTGGCTGTGGCTCCAGTTTGTTTACATGTTCTCCAGTTTTAAAAAGTCCTGGGGC 
TGCGGGCTAGAGAGCAAGAAAAACGAGAAGGCTGCCCTCAAGGTGCTGCAGGAGGAGT 
ACCGGAAGCTGGGGCCCTTGTCCTTCGCGGAGATCAACGTGCTGATCTGCTTCTTCCT 
GCTGGTCATCCTGTGGTTCTCCCGAGACCCCGGCTTCATGCCCGGCTGGCTGACTGTT 
GCCTGGGTGGAGGGTGAGACAAAGTATGTCTCCGATGCCACTGTGGCCATCTTTGTGG 
CCACCCTGCTATTCATTGTGCCTTCACAGAAGCCCAAGTTTAACTTCCGCAGCCAGAC 
TGAGGAAGGTAAGTCTCCTGTTCTGATCGCCCCCCCTCCCCTGCTGGATTGGAAGGTA 
ACCCAGGAGAAAGTGCCCTGGGGCATCGTGCTGCTACTAGGGGGCGGATTTGCTCTGG 
CTAAAGGATCCGAGGCCTCGGGGCTGTCCGTGTGGATGGGGAAGCAGATGGAGCCCTT 




ACTGAGTGCACAAGCAACGTGGCCACCACCACCTTGTTCCTGCCCATCTTTGCCTCCA 
TGTCTCGCTCCATCGGCCTCAATCCGCTGTACATCATGCTGCCCTGTACCCTGAGTGC 
CTCCTTTGCCTTCATGTTGCCTGTGGCCACCCCTCCAAATGCCATCGTGTTCACCTAT 
GGGCACCTCAAGGTTGCTGACATGGTGAAAACAGGAGTCATAATGAACATAATTGGAG 
TCTTCTGTGTGTTTTTGGCTGTCAACACCTGGGGACGGGCCATATTTGACTTGGATCA 
TTTCCCTGACTGGGCTAATGTGACACATATTGAGACTTAGGAAGAGCCACAAGACCAC 
ACACACAC-CCCTTACCCTCCTCAGGACTACCGAACCTTCTGGCACACCTT 




ORF Start: ATG at 16 


ORF Stop: TAG at 1720 




SEQ ID NO: 44 


568 aa |MW at 62592.9kD 


NOV14a, 

CG57758-01 Protein Sequence 


MASALSYVSKFKSFVILFVTPLLLLPLVILMPAKVSCAYVIILMAIYWCTEVIPIjAVT 
SLMPVLLFPLFQILDSRQVCVQYMKDTNMLFLGGLIVAVAVERWNLHKRIALRTLLWV 
GAKPARLMLGFMGVTALLSMWISNTATTAMMVPIVEAILQQMEATSAATEAGLELVDK 
GKAKELPGSQVIFEGPTIjGQQEDQERKRLCKAMTLCICYAASIGGTATLTGTGPIJWL 
LGQMNELFPDSKDLVNFASWFAFAFPNMLVMLLFAWLWLQFVYMFSSFKKSWGCGLES 
KKNEKAALKVLQEEYRKLGPLSFAEINVLICFFLLVILWFSRDPGFMPGWLTVAW7EG 
ETKYVSDATVAIFVATLLFIVPSQKPKFNFRSQTEEGKSPVLIAPPPLLDWKVTQEKV 
PWGIVLLLGGGFALAKGSEASGLSVWMGKQMEPLHAVPPAAITLILSLLVAVFTECTS 
NVATTTLFLPIFASMSRSIGLNPLYIMLPCTLSASFAFMLPVATPPNAIVFTYGHLKV 
ADMVKTGVIMNI IGVFCVFLAVNTWGRAI FDLDHFPDWANVTHI ET 




SEQ ID NO: 45 1 1899 bp 


NOV14b, 

CG57758-02 DNA Sequence 


CGTCTCGCCCGCCAGTCTCCCTCCCGCGCGATGGCCTCGGCGCTGAGCTATGTCTCCA 


TCTGATGCCCGCCAAGGTCAGTTGCTGTGCCTACGTCATCATCCTCATGGCCATTTAC 
TGGTGCACAGAAGTCATCCCTCTGGCTGTCACCTCTCTCATGCCTGTCTTGCTTTTCC 
CACTCTTCCAGATTCTGGACTCCAGGCAGGTGTGTGTCCAGTACATGAAGGACACCAA 
CATGCTGTTCCTGGGCGGCCTCATCGTGGCCGTGGCTGTGGAGCGCTGGAACCTGCAC 
AAGAGGATCGCCCTGCGCACGCTCCTCTGGGTGGGGGCCAAGCCTGCACGGCTGATGC 
TGGGCTTCATGGGCGTCACAGCCCTCCTGTCCATGTGGATCAGTAACACGGCAACCAC 
GGCCATGATGGTGCCCATCGTGGAGGCCATATTGCAGCAGATGGAAGCCACAAGCGCA 
GCCACCGAGGCCGGCCTGGAGGGACAAGGTACCACAATAAACAACCTGAATGCACTGG 
AGGATGATACAGTGAAAGCAGTACTAGGAGGAAAGTGTGTAGCTATAATAAGCACTTA 
CGTCAAAAAAGTAGAAAAACTTCAAATAAACAATCTAATGACACCTCTTAAAAAACTA 
GAAAAGCAAGAGCAACAGGACCTAGGGCCTGGCATCAGGCCTCAGGACTCTGCCCAGT 
GCCAGGAAGACCAAGAGCGGAAGAGGTTGTGTAAGGCCATGACCCTGTGCATCTGCrA 
CGCGGCCAGCATCGGGGGCACCGCCACCCTGACCGGGACGGGACCCAACGTGGTGCTC 
CTGGGCCAGATGAACGAGTTGTTTCCTGACAGCAAGGACCTCGTGAACTTTGCTTCCT 
GGTTTGCATTTGCCTTTCCCAACATGCTGGTGATGCTGCTGTTCGCCTGGCTGTGGCT 
CCAGTTTGTTTACATGTTCTCCAGTTTTAAAAAGTCCTGGGGCTGCGGGCTAGAGAGC 
AAGAAAAACGAGAAGGCTGCCCTCAAGGTGCTGCAGGAGGAGTACCGGAAGCTGGGGC 
CCTTGTCCTTCGCGGAGATCAACGTGCTGATCTGCTTCTTCCTGCTGGTCATCCTGTG 
GTTCTCCCGAGACCCCGGCTTCATGCCCGGCTGGCTGACTGTTGCCTGGGTGGAGGGT 
GAGACAAAGTCAGTCTCCGATGCCACTGTGGCCATCTTTGTGGCCACCCTGCTATTCA 
TTGTGCCTTCACAGAAGCCCAAGTTTAACTTCCGCAGCCAGACTGAGGAAGGTAAGTC 

i ~ GTG ;CTACTAGG 3 tCTTI T T V? 3ATCCGAC 
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CCTCGGGGCTGTCCGTGTGGATGGGGAAGCAGATGGAGCCCTTGCACGCAGTGCCCCC 
GGCAGCCATCACCTTGATCTTGTCCTTGCTCGTTGCCGTGTTCACTGAGTGCACAAGC 
AACGTGGCCACCACCACCTTGTTCCTGCCCATCTTTGCCTCCATGTCTCGCTCCATCG 
GCCTCAATCCGCTGTACATCATGCTGCCCTGTACCCTGAGTGCCTCCTTTGCCTTCAT 
GTTGCCTGTGGCCACCCCTCCAAATGCCATCGTGTTCACCTATGGGCACCTCAAGGTT 
GCTGACATGGTAAAAACAGGAGTCATAATGAACATAATTGGAGTCTTCTGTGTGTTTT 
TGGCTGTCAACACCTGGGGACGGGCCATATTTGACTTGGATCATTTCCCTGACTGGGC 
TAATGTGACACATATTGAGACTTAGGAAGAGCCACAAGACCAC 




ORF Start: ATG at 3 1 


ORF Stop: TAG at 1879 




SEQIDNO: 46 


616 aa |MW at 67816.9kD 


NOV 14b, 

CG57758-02 Protein Sequence 


MASALSYVSKFKSFVILFVTPLLLLPLVILMPAKVSCCAYVIILMAIYWCTEVIPLAV 
TSLMPVLLFPLFQILDSRQVCVQYMKDTNMLFLGGLIVAVAVERWNLHKRIALRTLLW 
VGAKPARLMLGFMGVTALLSMWISNTATTAMMVPIVEAILQQMEATSAATEAGLEGQG 
TTINNLNALEDDTVKAVLGGKCVAIISTYVKKVEKLQINNLMTPLKKLEKQEQQDLGP 
GIRPQDSAQCQEDQERKRLCKAMTLCICYAASIGGTATLTGTGPNVVLLGQMNELFPD 
SKDLVNFASWFAFAFPNNLVMLLFAWLWLQFVXMFSSFKKSWGCGLESKKNEKAALKV 
LQEEYRKLGPLSFAEINVLICFFLLVILWFSRDPGFMPGWLTVAWVEGETKSVSDATV 
AIFVATLLFIVPSQKPKFNFRSQTEEGKSPVLIAPPPLLDWKVTQEKVPWGIVLLLGG 
GFALAKGSEASGLSVWMGKQMEPLHAVPPAAITLILSLLVAVFTECTSNVATTTLFLP 
IFASMSRSIGLNPLYIMLPCTLSASFAFMLPVATPPNAIVFTYGHLKVADMVKTGVIM 
NI IGVFCVFLAVNTWGRAI FDLDHFPDWANVTHIET 




SEQIDNO: 47 1 1899 bp 


NOV 14c, 

CG57758-03 DNA Sequence 


CGTCTCGCCCGCCAGTCTCCCTCCCGCGCGATGGCCTCGGCGCTGAGCTATGTCTCCA 


TCTGATGCCCGCCAAGGTCAGTTGCTGTGCCTACGTCATCATCCTCATGGCCATTTAC 
TGGTGCACAGAAGTCATCCCTCTGGCTGTCACCTCTCTCATGCCTGTCTTGCTTTTCC 
CACTCTTCCAGATTCTGGACTCCAGGCAGGTGTGTGTCCAGTACATGAAGGACACCAA 
CATGCTGTTCCTGGGCGGCCTCATCGTGGCCGTGGCTGTGGAGCGCTGGAACCTGCAC 
AAGAGGATCGCCCTGCGCACGCTCCTCTGGGTGGGGGCCAAGCCTGCACGGCTGATGC 

GGCCATGATGGTGCCCATCGTGGAGGCCATATTGCAGCAGATGGAAGCCACAAGCGCA 

AGGATGATACAGTGAAAGCAGTACTAGGAGGAAAGTGTGTAGCTATAATAAGCACTTA 
CGTCAAAAAAGTAGAAAAACTTCAAATAAACAATCTAATGACACCTCTTAAAAAACTA 
GAAAAGCAAGAGCAACAGGACCTAGGGCCTGGCATCAGGCCTCAGGACTCTGCCCAGT 
GCCAGGAAGACCAAGAGCGGAAGAGGTTGTGTAAGGCCATGACCCTGTGCATCTGCTA 
CGCGGCCAGCATCGGGGGCACCGCCACCCTGACCGGGACGGGACCCAACGTGGTGCTC 
CTGGGCCAGATGAACGAGTTGTTTCCTGACAGCAAGGACCTCGTGAACTTTGCTTCCT 
GGTTTGCATTTGCCTTTCCCAACATGCTGGTGATGCTGCTGTTCGCCTGGCTGTGGCT 
CCAGTTTGTTTACATGTTCTCCAGTTTTAAAAAGTCCTGGGGCTGCGGGCTAGAGAGC 
AAGAAAAACGAGAAGGCTGCCCTCAAGGTGCTGCAGGAGGAGTACCGGAAGCTGGGGC 
CCTTGTCCTTCGCGGAGATCAACGTGCTGATCTGCTTCTTCCTGCTGGTCATCCTGTG 
GTTCTCCCGAGACCCCGGCTTCATGCCCGGCTGGCTGACTGTTGCCTGGGTGGAGGGT 
GAGACAAAGTCAGTCTCCGATGCCACTGTGGCCATCTTTGTGGCCACCCTGCTATTCA 
TTGTGCCTTCACAGAAGCCCAAGTTTAACTTCCGCAGCCAGACTGAGGAAGGTAAGTC 
TCCTGTTCTGATCGCCCCCCCTCCCCTGCTGGATTGGAAGGTAACCCAGGAGAAAGTG 
CCCTGGGGCATCGTGCTGCTACTAGGGGGCGGATTTGCTCTGGCTAAAGGATCCGAGG 
CCTCGGGGCTGTCCGTGTGGATGGGGAAGCAGATGGAGCCCTTGCACGCAGTGCCCCC 
GGCAGCCATCACCTTGATCTTGTCCTTGCTCGTTGCCGTGTTCACTGAGTGCACAAGC 
AACGTGGCCACCACCACCTTGTTCCTGCCCATCTTTGCCTCCATGTCTCGCTCCATCG 
GCCTCAATCCGCTGTACATCATGCTGCCCTGTACCCTGAGTGCCTCCTTTGCCTTCAT 
GTTGCCTGTGGCCACCCCTCCAAATGCCATCGTGTTCACCTATGGGCACCTCAAGGTT 
GCTGACATGGTAAAAACAGGAGTCATAATGAACATAATTGGAGTCTTCTGTGTGTTTT 
TGGCTGTCAACACCTGGGGACGGGCCATATTTGACTTGGATCATTTCCCTGACTGGGC 
"i 3TGACACATATTGAG.Z T TP^ pj>. ; - AC AGACCAC 




ORF Start: ATG at 31 joRF Stop: TAG at 1 879 




SEQIDNO: 48 |616aa |MW at 67816.9kD 


NOVl4c, 

CG57758-03 Protein Sequence 


MASALSYVSKFKSFVILFVTPLLLLPLVILMPAKVSCCAYVIILMAIYWCTEVIPLAV 
TSLMPVLLFPLFQILDSRQVCVQYMKDTNMLFLGGLIVAVAVERWNLHKRIALRTLLW 
VGAKPARLMLGFMGVTALLSMWISNTATTAMMVPIVEAILQQMEATSAATEAGLEGQG 
TTINNLNALEDDTVKAVLGGKCVAIISTYVKKVEKLQINNLMTPLKKLEKQEQQDLGP 
GIRPQDSAQCQEDQERKRLCKAMTLCICYAASIGGTATLTGTGPNVVLLGQMNELFPD 
SKDLVNFASWFAFAFPIWLVMLLFAWLWLQFVYMFSSFKKSWGCGLESKKNEKAALKV 
LQEEYRKLGPLSFAEINVLICFFLLVILWFSRDPGFMPGWLTVAWVEGETKSVSDATV 
AI FVATLLFI VPSQKPKFNFRSQTEEGKSPVLI AP PPLLDWKVTQEKVPWG I VLLLGG 
GFALAKGSEASGLSVWMGKQMEPLHAVPPAAITLILSLLVAVFTECTSNVATTTLFLP 
IFASMSRSIGLNPLYIMLPCTLSASFAFMLPVATPPNAIVFTYGHLKVADMVKTGVIM 
NI IGVFCVFLAVNTWGRAI FDLDHFPDWANVTHIET 




SEQIDNO: 49 ] 1606 bp 
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NOV14d, 

CG57758-04 DNA Sequence 


GATGGCCTCGGCGCTGAGCTATGTCTCCAAGTTCAAGTCCTTCGTGATCTTGTTCGTC 
ACCCCGCTCCTGCTGCTGCCACTCGTCATTCTGATGCCCGCCAAGTTTGTCAGGTGTG 
CCTACGTCATCATCCTCATGGCCATTTACTGGTGCACAGAAGTCATCCCTCTGGCTGT 
CACCTCTCTCATGCCTGTCTTGCTTTTCCCACTCTTCCAGATTCTGGACTCCAGGCAG 
GTGTGTGTCCAGTACATGAAGGACACCAACATGCTGTTCCTGGGCGGCCTCATCGTGG 
CCGTGGCTGTGGAGCGCTGGAACCTGCACAAGAGGATCGCCCTGCGCACGCTCCTCTG 
GGTGGGGGCCAAGCCTGCACGGCTGATGCTGGGCTTCATGGGCGTCACAGCCCTCCTG 
TCCATGTGGATCAGTAACACGGCAACCACGGCCATGATGGTGCCCATCGTGGAGGCCA 
TATTGCAGCAGATGGAAGCCACAAGCGCAGCCACCGAGGCCGGCCTGGAGCTGGTGGA 
CAAGGGCAAGGCCAAGGAGCTGCCAGGGAGTCAAGTGATTTTTGAAGGCCCCACTCTG 
GGGCAGCAGGAAGACCAAGAGCGGAAGAGGTTGTGTAAGGCCATGACCCTGTGCATCT 
GCTACGCGGCCAGCATCGGGGGCACCGCCACCCTGACCGGGACGGGACCCAACGTGGT 
GCTCCTGGGCCAGATGAACGAGTTGTTTCCTGACAGCAAGGACCTCGTGAACTTTGCT 
TCCTGGTTTGCATTTGCCTTTCCCAACATGCTGGTGATGCTGCTGTTCGCCTGGCTGT 
GGCTCCAGTTTGTTTACATGAGATTCAATTTTAAAAAGTCCTGGGGCTGCGGGCTAGA 




TGTGGTTCTCCCGAGACCCCGGCTTCATGCCCGGCTGGCTGACTGTTGCCTGGGTGGA 
GGGTGAGACAAAGTATGTCTCCGATGCCACTGTGGCCATCTTTGTGGCCACCCTGCTA 
TTCATTGTGCCTTCACAGAAGCCCAAGTTTAACTTCCGCAGCCAGACTGAGGAAGAAA 




GCCTCGGGGCTGTCCGTGTGGATGGGGAAGCAGATGGAGCCCTTGCACGCAGTGCCCC 
CGGCAGCCATCACCTTGATCTTGTCCTTGCTCGTTGCCGTGTTCACTGAGTGCACAAG 
CAACGTGGCCACCACCACCTTGTTCCTGCCCATCTTTGCCTCCATGGTGAAAACAGGA 
GTCATAATGAACATAATTGGAGTCTTCTGTGTGTTTTTGGCTGTCAACACCTGGGGAC 
GGGCCATATTTGACTTGGATCATTTCCCTGACTGGGCTAATGTGACACATATTGAGAC 
TTAGGAAGAGCCACAAGACCACACACATAGCCCTTACCCT 




ORP Start: ATG at 2 


ORF Stop: TAG at 1568 




SEQ ID NO: 50 


522 aa llMW at 58109.6kD 


NOV14d, 

CG57758-04 Protein Sequence 


MASALSYVSKFKSFVI LFVTPLLLLPLVI LMPAKFVRCAYVI I LMAI YWCTEVI PLAV 
TSLMPVLLFPLFQILDSRQVCVQYMKDTNMLFLGGLIVAVAVERWNLHKRIALRTLLW 
VGAKPARLMLGFMGVTALLSMWISNTATTAMMVPIVEAILQQMEATSAATEAGLELVD 
KGKAKELPGSQVI FEGPTLGQQEDQERKRLCKAMTLCICYAASIGGTATLTGTGPNW 
LLGQMNELFPDSKDLVNFASWFAFAFPNMLVMLLFAWLWLQFVYMRFNFKKSWGCGLE 
SKKNEKAALKVLQEEYRKLGPLSFAEINVLICFFLLVILWFSRDPGFMPGWLTVAWVE 
GETKYVSDATVAIFVATLLFIVPSQKPKFNFRSQTEEERKTPFYPPPLLDWKVTQEKV 
PWGIVLLLGGGFALAKGSEASGLSVWMGKQMEPLHAVPPAAITLILSLLVAVFTECTS 
NVATTTLFLP I FASMVKTGVIMNI IGVFCVFLAVNTWGRAIFDLDHFPDWANVTH I ET 




SEQ ID NO: 51 1 1781 bp 


NOV14e, 

CG57758-05 DNA Sequence 


GATGGCCTCGGCGCTGAGCTATGTCTCCAAGTTCAAGTCCTTCGTGATCTTGTTCGTC 
ACCCCGCTCCTGCTGCTGCCACTCGTCATTCTGATGCCCGCCAAGTTTGTCAGGTGTG 
CCTACGTCATCATCCTCATGGCCATTTACTGGTGCACAGAAGTCATCCCTCTGGCTGT 
CACCTCTCTCATGCCTGTCTTGCTTTTCCCACTCTTCCAGATTCTGGACTCCAGGCAG 
GTGTGTGTCCAGTACATGAAGGACACCAACATGCTGTTCCTGGGCGGCCTCATCGTGG 
CCGTGGCTGTGGAGCGCTGGAACCTGCACAAGAGGATCGCCCTGCGCACGCTCCTCTG 
GGTGGGGGCCAAGCCTGCACGGCTGATGCTGGGCTTCATGGGCGTCACAGCCCTCCTG 
TCCATGTGGATCAGTAACACGGCAACCACGGCCATGATGGTGCCCATCGTGGAGGCCA 
TATTGCAGCAGATGGAAGCCACAAGCGCAGCCACCGAGGCCGGCCTGGAGCTGGTGGA 
CAAGGGCAAGGCCAAGGAGCTGCCAGGGAGTCAAGTGATTTTTGAAGGCCCCACTCTG 
GGGCAGCAGGAAGACCAAGAGCGGAAGAGGTTGTGTAAGGCCATGACCCTGTGCATCT 
GCTACGCGGCCAGCATCGGGGGCACCGCCACCCTGACCGGGACGGGACCCAACGTGGT 
GCTCCTGGGCCAGATGAACGAGTTGTTTCCTGACAGCAAGGACCTCGTGAACTTTGCT 
TCCTGGTTTGCATTTGCCTTTCCCAACATGCTGGTGATGCTGCTGTTCGCCTGGCTGT 
GGCTCCAGTTTGTTTACATGAGATTCAATTTTAAAAAGTCCTGGGGCTGCGGGCTAGA 
GAGCAAGAAAAACGAGAAGGCTGCCCTCAAGGTGCTGCAGGAGGAGTACCGGAAGTTG 
GGGCCCTTGTCCTTCGCGGAGATCAACGTGCTGATCTGCTTCTTCCTGCTGGTCATCC 
TGTGGTTCTCCCGAGACCCCGGCTTCATGCCCGGCTGGCTGACTGTTGCCTGGGTGGA 




TTCATTGTGCCTTCACAGAAGCCCAAGTTTAACTTCCGCAGCCAGACTGAGGAAGAAA 
GGAAAACTCCATTTTATCCCCCTCCCCTGCTGGATTGGAAGGTAACCCAGGAGAAAGT 
GCCCTGGGGCATCGTGCTGCTACTAGGGGGCGGATTTGCTCTGGCTAAAGGATCCGAG 
GCCTCGGGGCTGTCCGTGTGGATGGGGAAGCAGATGGAGCCCTTGCACGCAGTGCCCC 
CGGCAGCCATCACCTTGATCTTGTCCTTGCTCGTTGCCGTGTTCACTGAGTGCACAAG 
CAACGTGGCCACCACCACCTTGTTCCTGCCCATCTTTGCCTCCATGAATCACGTCCCC 
AAGAGCTTCTGTGTTCTGTACGGTGATGTTGCAGTGCTGTCTTTCCGCAGTCTCGCTC 
CATCGGCCTCAATCCGCTGTACATCATGCTGCCCTGTACCCTGAGTGCCTCCTTTGCC 
TTCATGTTGCCTGTGGCCACCCCTCCAAATGCCATCGTGTTCACCTATGGGCACCTCA 




GTTTTTGGCTGTCAACACCTGGGGACGGGCCATATTTGACTTGGATCATTTCCCTGAC 


TGGGCTAATGTGA'" ACATA1 r P. ? n * ',AAGAGCCACA 




ORP Start: ATG at 2 |ORF Stop: TGA at 1 550 
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SEQIDNO:52 |516aa |MW at 57173.5kD 


NOV14e, 

CG57758-05 Protein Sequence 


MASALSYVSKFKSFVILFVTPLLLLPLVILMPAKFVRCAYVIILMAiyWCTEVIPLAV 
TSLMPVLLFPLFQILDSRQVCVQYMK0TNMLFLGGLIVAVAVERWN1HKRIALRTLLW 
VGAKP ARLMLGFMGVTALLSMW I SNTATTAMMVP I VEAI LQQMEATS AATEAGLELVD 
KGKAKELPGSQVIFEGPTLGQQEDQERKRLCKAMTLCICYAASIGGTATLTGTGPNW 
IjLGQMNELFPDSKDLVNFASWFAFAFPNMLVMLLFAWLWLQFVYMRFNFKKSWGCGLE 
SKKNEKAALKVLQEEYRKLGPLSFAEINVLICFFLLVILWFSRDPGFMPGWLTVAWVE 
GETKYVSDATVAIFVATLLFIVPSQKPKFNFRSQTEEERKTPFYPPPLLDWKVTQEKV 
PWGIVLLLGGGFALAKGSEASGLSVWMGKQMEPLHAVPPAAITLILSLLVAVFTECTS 
N ATTTLFLPIFft >vF FCVH WL c " TCit' XS PVE 



Sequence comparison of the above protein sequences yields the following sequence 
relationships shown in Table 14B. 



Table 14B. Comparison of NOV14a against NO VI 4b through NOV14e. 


Protein Sequence 


NO VI 4a Residues/ 
Match Residues 


Identities/ 
Similarities for the Matched Region 


NOV 14b 


1..568 
1..616 


519/616(84%) 
524/616(84%) 


NOV14c 


1..568 
1..616 


519/616(84%) 
524/616(84%) 


NOVHd 


1..568 
1 ..522 


483/570 (84%) 
485/570 (84%) 


NOV14e 


1..480 
1 ..480 


440/482 (91%) 
443/482 (91%) 



Further analysis of the NOV 14a protein yielded the following properties shown in 
Table 14C. 



Table 14C. Protein Sequence Properties NOV14a 


PSort 
analysis: 


0.6400 probability located in plasma membrane; 0.4600 probability located 
in Golgi body; 0.3700 probability located in endoplasmic reticulum 
(membrane); 0.1000 probability located in endoplasmic reticulum (lumen) 


SignalP 
analysis: 


Likely cleavage site between residues 38 and 39 



A search of the NOV 14a protein against the Geneseq database, a proprietary 
database that contains sequences published in patents and patent publication, yielded 
several homologous proteins shown in Table 14D. 
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Table 14D. Geneseq Results for NOV14a 


Geneseq 
Identifier 


Protein/Organism/Length [Patent 
#, Date] 


NOV14a 
Residues/ 

Match 
Residues 


Identities/ 
Similarities for 
the Matched 
Region 


Expect 
Value 


AAB23625 


Human secreted protein SEQ ID 
NO: 50 - Homo sapiens, 627 aa. 
[WO200049134-A1, 24-AUG-2000] 


10..566 
9..623 


256/623 (41%) 
386/623 (61%) 


e-137 


AAB36158 


Novel human transporter protein 
SEQ ID NO: 2 - Homo sapiens, 627 
aa. [WO200065055-A2, 02-NOV- 
2000] 


10..566 
9..623 


256/623 (41%) 
386/623 (61%) 


e-137 


AAB42213 


Human ORFX ORE 1977 
polypeptide sequence SEQ ID 
NO:3954 - Homo sapiens, 627 aa. 
[WO200058473-A2, 05-OCT-2000] 


10..566 
9..623 


256/623 (41%) 
386/623 (61%) 


e-136 


AAB36164 


Novel human transporter protein 
SEQ ID NO: 14 - Homo sapiens, 
626 aa. [WO200065055-A2, 02- 
NOV-2000] 


10..566 
9..622 


252/623 (40%) 
382/623 (60%) 


e-136 


AAB36159 


Novel human transporter protein 
SEQ ID NO: 4 - Homo sapiens, 627 
aa. [WO200065055-A2, 02-NOV- 
2000] 


10..566 
9..623 


256/623 (41%) 
385/623 (61%) 


e-136 



In a BLAST search of public sequence databases, the NOV 14a protein was found to 
have homology to the proteins shown in the BLASTP data in Table 14E. 



Table 14E. Public BLASTP Results for NOV14a 


Protein 
Accession 
Number 


Protein/Organism/Len gth 


NOV14a 
Residues/ 

Match 
Residues 


Identities/ 
Similarities 
for the 
Matched 
Portion 


Expect 
Value 


057661 


INTESTINAL SODIUM/LITHIUM- 
DEPENDENT DICARBOXYLATE 
TRANSPORTER 
(NA(+)/DICARBOXYLATE 
COTRANSPORTER) - Xenopus laevis 
(African clawed frog), 622 aa. 


1..564 
1-619 


336/619(54%) 
444/619(71%) 


0.0 


Q9ES88 


NA/DICARBOXYLATE 
COTRANSPORTER (SOLUTE 
CARRIER FAMILY 13 (SODIUM- 


1..561 
1..567 


311/572 (54%) 
421/572 (73%) 


e-179 
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TRANSPORTER), MEMBER 2) - Mus 
musculus (Mouse), 586 aa. 








035055 


SODIUM/DICARBOXYLATE 
COTRANSPORTER 1 
(NA(+)/DICARBOXYLATE 
COTRANSPORTER 1) (KIDNEY 
DICARBOXYLATE 

TT> AXTCDADTCDA /"CHPTH 

1 KAINcsrDK I fc-K) (alJC 11) 
(ORGANIC ANION TRANSPORTER 
1) (OAT1) - Rattus norvegicus (Rat), 
587 aa. 


1 ..562 
1..568 


311/572 (54%) 
419/572(72%) 


e-179 


Q131 83 


Rcna! sodium/ dicarboxylate 
cotransporter (Na(+)/dicarboxylate 
cotransporter) - Homo sapiens 
(Human), 592 aa. 


1..572 


J 18/ JS 1 (j^t/o) 

428/581 (72%) 


e-179 


Q28615 


Renal sodium/dicarboxylate 
cotransporter (Na(+)/dicarboxylate 
cotransporter) - Oryctolagus cuniculus 
(Rabbit), 593 aa. 


1..562 
1..576 


300/586 (51%) 
418/586 (71%) 


e-172 



PFam analysis predicts that the NOV14a protein contains the domains shown in the 
Table 14F. 



Table 14F. Domain Analysis of NO VI 4a 


Pfam Domain 


NOV14a Match 
Region 


Identities/ 
Similarities 
for the Matched 
Region 


Expect 
Value 


Na sulph symp: domain 
1 of 1 


6..554 


163/604(27%) 
424/604 (70%) 


8.3e-140 



Example 15. 

The NOV15 clone was analyzed, and the nucleotide and predicted polypeptide 
sequences are shown in Table 1 5A. 



Table 15A. NOV15 Sequence Analysis 




SEQ ID NO: 53 1 1547 bp 


NOV 15a, 

CG57732-01 DNA Sequence 


AACCCCCTTGACTGAAGCAATGGAGGGGGGTCCAGCTGTCTGCTGCCAGGATCCTCGG 
GCAGAGCTGGTAGAACGGGTGGCAGCCATCGATGTGACTCACTTGGAGGAGGCAGATG 
GTGGCCCAGAGCCTACTAGAAACGGTGTGGACCCCCCACCACGGGCCAGAGCTGCCTC 
TGTGATCCCTGGCAGTACTTCAAGACTGCTCCCAGCCCGGCCTAGCCTCTCAGCCAGG 
AAGCTTTCCCTACAGGAGCGGCCAGCAGGAAGCTATCTGGAGGCGCAGGCTGGGCCTT 
ATGCCACGGGGCCTGCCAGCCACATCTCCCCCCGGGCCTGGCGGAGGCCCACCATCGA 
GTCCCACCACGTGGCCATCTCAGATGCAGAGGACTGCGTGCAGCTGAACCAGTACAAG 
CTGCAGAGTGAGATTGGCAAGGGTGCCTACGGTGTGGTGAGGCTGGCCTACAACGAAA 
GTGAAGACAGACACTATGCAATGAAAGTCCTTTCCAAAAAGAAGTTACTGAAGCAGTA 
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TGGCTTTCCACGTCGCCCTCCCCCGAGAGGGTCCCAGGCTGCCCAGGGAGGACCAGCC 
AAGCAGCTGCTGCCCCTGGAGCGGGTGTACCAGGAGATTGCCATCCTGAAGAAGCTGG 
ACCACGTGAATGTGGTCAAACTGATCGAGGTACTGGATGACCCAGCTGAGGACAACCT 
CTATTTGCCCCGCATCCTTCTCCATAGGCCCGTCATGGAAGTGCCCTGTGACAAGCCC 
TTCTCGGAGGAGCAAGCTCGCCTCTACCTGCGGGACGTCATCCTGGGCCTCGAGTACG 
TGCACTGCCAGAAGATCGTCCACAGGGACATCAAGCCATCCAACCTGCTCCTGGGGGA 
TGATGGGCACGTGAAGATCGCCGACTTTGGCGTCAGCAACCAGTTTGAGGGGAACGAC 
GCTCAGCTGTCCAGCACGGCGGGAACCCCAGCATTCATGGCCCCCGAGGCCATTTCTG 
ATTCCGGCCAGAGCTTCAGTGGGAAGTTGGATGTATGGGCCACTGGCGTCACGTTGTA 
CTGCTTTGTCTATGGGAAGTGCCCATTCATCGACGATTTCATCCTGGCCCTCCACAGG 
AAGATCAAGAATGAGCCCGTGGTGTTTCCTGAGGAGCCAGAAATCAGCGAGGAGCTCA 
AGGACCTGATCCTGAAGATGTTAGACAAGAATCCCGAGACGAGAATTGGGGTGCCAGA 
CATCAAGTTGCACCCTTGGGTGACCAAGAACGGGGAGGAGCCCCTTCCTTCGGAGGAG 
GAGCACTGCAGCGTGGTGGAGGTGACAGAGGAGGAGGTTAAGAACTCAGTCAGGCTCA 
TCCCCAGCTGGACCACGGTGATCCTGGTGAAGTCCATGCTGAGGAAGCGTTCCTTTGG 
GAACCCGTTTGAGCCCCAAGCACGGAGGGAAGAGCGATCCATGTCTGCTCCAGGAAAC 
CTACTGGTGAAAGAAGGGTTTGGTGAAGGGGGCAAGAGCCCAGAGCTCCCCGGCGTCC 
AGGAAGACGAGGCTGCATCCTGAGCCCCTGCATGCACCC 




ORF Start: ATG at 20 


ORF Stop: TGA at 1529 




SEQ ID NO: 54 


503 aa ImW at 55606.7kD 


NOV15a, 

CG57732-01 Protein Sequence 


MEGGPAVCCQDPRAELVERVAAIDVTHLEEADGGPEPTRNGVDPPPRARAASVIPGST 
SRLLPARPSLSARKI^LQERPAGSYLEAQAGPYATGPASHISPRAWRRPTIESHHVAI 
SDAEDCVQLNQYKLQSEIGKGAYGWRLAYNESEDRHYAMKVLSKKKLLKQYGFPRRP 
PPRGSQAAQGGPAKQLLPLERVYQEIAILKKLDHVNWKLIEVLDDPAEDNLYLPRIL 
LHRPVMEVPCDKPFSEEQARLYLRDVILGLEYVHCQKIVHRDIKPSNLLLGDDGHVKI 
ADFGVSNQFEGNDAQLSSTAGTPAFMAPEAISDSGQSFSGKLDVWATGVTL.YCFVYGK 
CPFIDDFILALHRKIKNEPWFPEEPEISEELKDLILKMLDKNPETRIGVPDIKLHPW 
VTKNGEEPLPSEEEHCSWEVTEEEVKNSVRLIPSWTTVILVKSMLRKRSFGNPFEPQ 
ARREERSMSAPGNLLVKEGFGEGGKSPELPGVQEDEAAS 




SEQ ID NO: 55 


1611 bp 


NO VI 5b, 

CG57732-02 DNA Sequence 


GCGCCCAGGTTCCCAACAAGGCTACGCAGAAGAACCCCCTTGACTGAAGCAATGGAGG 


GGGGTCCAGCTGTCTGCTGCCAGGATCCTCGGGCAGAGCTGGTAGAACGGGTGGCAGC 
CATCGATGTGACTCACTTGGAGGAGGCAGATGGTGGCCCAGAGCCTACTAGAAACGGT 
GTGGACCCCCCACCACGGGCCAGAGCTGCCTCTGTGATCCCTGGCAGTACTTCAAGAC 
TGCTCCCAGCCCGGCCTAGCCTCTCAGCCAGGAAGCTTTCCCTACAGGAGCGGCCAGC 
AGGAAGCTATCTGGAGGCGCAGGCTGGGCCTTATGCCACGGGGCCTGCCAGCCACATC 
TCCCCCCGGGCCTGGCGGAGGCCCACCATCGAGTCCCACCACGTGGCCATCTCAGATG 
CAGAGGACTGCGTGCAGCTGAACCAGTACAAGCTGCAGAGTGAGATTGGCAAGGGTGC 




GTCCTTTCCAAAAAGAAGTTACTGAAGCAGTATGGCTTTCCACGTCGCCCTCCCCCGA 
GAGGGTCCCAGGCTGCCCAGGGAGGACCAGCCAAGCAGCTGCTGCCCCTGGAGCGGGT 
GTACCAGGAGATTGCCATCCTGAAGAAGCTGGACCACGTGAATGTGGTCAAACTGATC 




AGGGGCCCGTCATGGAAGTGCCCTGTGACAAGTCCTTCTCGGAGGAGCAAGCTCGCCT 
CTACCTGCGGGACGTCATCCTGGGCCTCGAGTACTTGCACTGCCAGAAGATCGTCCAC 
AGGGACATCAAGCCATCCAACCTGCTCCTGGGGGATGATGGGCACGTGAAGATCGCCG 
ACTTTGGCGTCAGCAACCAGTTTGAGGGGAACGACGCTCAGCTGTCCAGCACGGCGGG 
AACCCCAGCATTCATGGCCCCCGAGGCCATTTCTGATTCCGGCCAGAGCTTCAGTGGG 
AAGGCCTTGGATGTATGGGCCACTGGCGTCACGCTGTACTGCTTTGTCTATGGGAAGT 
GCCCGTTCATCGACGATTTCATCCTGGCCCTCCACAGGAAGATCAAGAATGAGCCCGT 
GGTGTTTCCTGAGGGGCCAGAAATCAGCGAGGAGCTCAAGGACCTGATCCTGAAGATG 
TTAGACAAGAATCCCGAGACGAGAATTGGGGTGCCAGACATCAAGTTGCACCCTTGGG 
TGACCAAGAACGGGGAGGAGCCCCTTCCTTCGGAGGAGGAGCACTGCAGCGTGGTGGA 
GGTGACAGAGGAGGAGGTTAAGAACTCAGTCAGGCTCATCCCCAGCTGGACCACGGTG 
ATCCTGGTGAAGTCCATGCTGAGGAAGCGTTCCTTTGGGAACCCGTTTGAGCCCCAAG 
CACGGAGGGAAGAGCGATCCATGTCTGCTCCAGGAAACCTACTGGTGAAAGAAGGGTT 
TGGTGAAGGGGGCAAGAGCCCAGAGCTCCCCGGCGTCCAGGAAGACGAGGCTGCATCC 
TGAGCCCCTGCATGCACCCAGGGCCACCCGGCAGCACACTCATCC 




ORF Start: ATG at 52 


ORF Stop: TGA at 1567 




SEQ ID NO: 56 


505 aa |MW at 55652.7kD 


NOV15b, 


MEGGPAVCCQDPRAELVERVAAIDVTHLEEADGGPEPTRNGVDPPPRARAASVIPGST 
SRLLPARPSLSARKLSLQERPAGSYLEAQAGPYATGPASHISPRAWRRPTIESHHVAI 
SDAEDCVQLNQYKLQSEIGKGAYGWRLAYNESEDRHYAMKVLSKKKLLKQYGFPRRP 
PPRGSQAAQGGPAKQLLPLERVYQEIAILKKLDHVNWKLIEVLDDPAEDNLYLVFDL 
LRKGPVMEVPCDKSFSEEQARLYLRDVILGLEYLHCQKIVHRDIKPSNLLLGDDGHVK 
IADFGVSNQFEGNDAQLSSTAGTPAFMAPEAISDSGQSFSGKALDVWATGVTLYCFVY 
GKCPFIDDFILALHRKIKNEPWFPEGPEISEELKDLILKMLDKNPETRIGVPDIKLH 
PWVTKNGEEPLPSEEEHCSWEVTEEEVKNSVRLIPSWTTVILVKSMLRKRSFGNPFE 
PQARREERSMSAPGNLLVKEGFGEGGKSPELPGVQEDEAAS 


CG57732-02 Protein Sequence 
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SEQ ID NO: 57 


1725 bp 


NOV15c, 


GCGCCCAGGTTCCCAACAAGGCTACGCAGAAGAACCCCCTTGACTGAAGTAATGGAGG 




CG57732-03 DNA Sequence 


CATCGATGTGACTCACTTGGAGGAGGCAGATGGTGGCCCAGAGCCTACTAGAAACGGT 
GTGGACCCCCCACCACGGGCCAGAGCTGCCTCTGTGATCCCTGGCAGTACTTCAAGAC 
TGCTCCCAGCCCGGCCTAGCCTCTCAGCCAGGAAGCTTTCCCrACAGGAGCGGCCAGC 
AGGAAGCTATCTGGAGGCGCAGGCTGGGCCTTATGCCACGGGGCCTGCCAGCCACATC 
TCCCCCCGGGCCTGGCGGAGGCCCACCATCGAGTCCCACCACGTGGCCATCTCAGATG 
CAGAGGACTGCGTGCAGCTGAACCAGTACAAGCTGCAGAGTGAGATTGGCAAGGGTGC 
CTACGGTGTGGTGAGGCTGGCCTACAACGAAAGTGAAGACAGACACTATGCAATGAAA 
GTCCTTTCCAAAAAGAAGTTACTGAAGCAGTATGGCTTTCCACGTCGCCCTCCCCCGA 
GAGGGTCCCAGGCTGCCCAGGGAGGACCAGCCAAGCAGCTGCTGCCCCTGGAGCGGGT 
GTACCAGGAGATTGCCATCCTGAAGAAGCTGGACCACGTGAATGTGGTCAAACTGATC 




AGAATATCCAGTTAGATTCAACAAATATCGCCAAGTCCCACTCCCTGCTTCCCTCTGA 
GCAGCAAGACAGTGGATCCACGTGGGCTGCGCGCTCAGTGTTTGACCTCCTGAGAAAG 
GGGCCCGTCATGGAAGTGCCCTGTGACAAGCCCTTCTCGGAGGAGCAAGCTCGCCTCT 
ACCTGCGGGACGTCATCCTGGGCCTCGAGTACTTGCACTGCCAGAAGATCGTCCACAG 
GGACATCAAGCCATCCAACCTGCTCCTGGGGGATGATGGGCACGTGAAGATCGCCGAC 
TTTGGCGTCAGCAACCAGTTTGAGGGGAACGACGCTCAGCTGTCCAGCACGGCGGGAA 
CCCCAGCATTCATGGCCCCCGAGGCCATTTCTGATTCCGGCCAGAGCTTCAGTGGGAA 
GGCCTTGGATGTATGGGCCACTGGCGTCACGTTGTACTGCTTTGTCTATGGGAAGTGC 
CCGTTCATCGACGATTTCATCCTGGCCCTCCACAGGAAGACCAAGAATGAGCCCGTGG 
TGTTTCCTGAGGGGCCAGAAATCAGCGAGGAGCTCAAGGACCTGATCCTGAAGATGTT 




ACCAAGAACGGGGAGGAGCCCCTTCCTTCGGAGGAGGAGCACTGCAGCGTGGTGGAGG 
TGACAGAGGAGGAGGTTAAGAACTCAGTCAGGCTCATCCCCAGCTGGACCACGGTGAT 
CCTGGTGAAGTCCATGCTGAGGAAGCGTTCCTTTGGGAACCCGTTTGAGCCCCAAGCA 
CGGAGGGAAGAGCGATCCATGTCTGCTCCAGGAAACCTACTGGTGAAAGAAGGGTTTG 
GTGAAGGGGGCAAGAGCCCAGAGCTCCCCGGCGTCCAGGAAGACGAGGCTGCATCCTG 
AGCCCCTGCATGCACCCAGGGCCACCCGGCAGCACACTCATCC 




ORP Start: ATG at 52 


ORF Stop: TGA at 1681 




SEQ ID NO: 58 


543 aa [MW at 59729.0kD 


NOV15c, 

CG57732-03 Protein Sequence 


MEGGPAVCCQDPRAELVERVAAIDVTHLEEADGGPEPTRNGVDPPPRARAASVIPGST 
SRLLPARPSLSARKLSLQERPAGSYLEAQAGPYATGPASHI SPRAWRRPT I ESHHVAI 
SDAEDCVQLNQYKLQSEIGKGAYGWRLAYNESEDRHYAMKVLSKKKLLKQYGFPRRP 
PPRGSQAAQGGPAKQLLPLERVYQEIAILKKLDHVNWKLIEVLDDPAEDNLYLALQN 
QAQNIQLDSTNIAKSHSLLPSEQQDSGSTWAARSVFDLLRKGPVMEVPCDKPFSEEQA 
RLYLRDVILGLEYLHCQKIVHRDIKPSNLLLGDDGHVKIADFGVSNQFEGNDAQLSST 
AGTPAFMAPEAISDSGQSFSGKALDVWATGVTLYCFVYGKCPFIDDFILALHRKTKNE 
PWFPEGPEISEELKDLILKMLDKNPETRIGVPDIKLHPWVTKNGEEPLPSEEEHCSV 
VEVTEEEVKNSVRLIPSWTTVILVKSMLRKRSFGNPFEPQARREERSMSAPGNLLVKE 
GFGEGGKSPELPGVQEDEAAS 



Sequence comparison of the above protein sequences yields the following sequence 
relationships shown in Table 15B. 



Table 15B. Comparison of NOV15a against NOV15b through NOV15c. 



Protein Sequence 


NOV15a Residues/ 
Match Residues 


Identities/ 
Similarities for the Matched Region 


NOV15b 


I. .503 
1..505 


495/505 (98%) 
497/505 (98%) 


NOV15c 


1..503 
1..543 


492/543 (90%) 
495/543 (90%) 



Further analysis of the NOV 15a protein yielded the following properties shown in 
Table 15C. 
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Table 15C. Protein Sequence Properties NOV15a 


PSort 
analysis: 


0.7600 probability located in nucleus; 0.3000 probability located in 
microbody (peroxisome); 0.1000 probability located in mitochondrial 
matrix space; 0.1000 probability located in lysosome (lumen) 


SignalP 
analysis: 


No Known Signal Sequence Predicted 



A search of the NOV 15a protein against the Geneseq database, a proprietary 
database that contains sequences published in patents and patent publication, yielded 
several homologous proteins shown in Table 15D. 



Table 15D. Geneseq Results for NOV15a 


Geneseq 
Identifier 


Protein/Organism/Length 
[Patent #, Date] 


NOV15a 
Residues/ 
Match 
Residues 


Identities/ 
Similarities for 
the Matched 
Region 


Expect 
Value 


AAU03510 


Human protein kinase #10 - Homo 
sapiens, 513 aa. [WO2001 38503- 
A2,31-MAY-2001] 


1..503 
1..513 


496/513 (96%) 
498/513 (96%) 


0.0 


AAE04361 


Human kinase (PKIN)-2 - Homo 
sapiens, 513 aa. [WO200146397- 
A2, 28-JUN-2001] 


1..503 
1..513 


496/513 (96%) 
498/513 (96%) 


0.0 


AAY44239 


Human cell signalling protein-2 - 
Homo sapiens, 540 aa. 
[W09958558-A2, 18-NOV-1999] 


64..500 
90..538 


289/450 (64%) 
367/450 (81%) 


e-165 


AAM40450 


Human polypeptide SEQ ID NO 
5381 - Homo sapiens, 680 aa. 
[WO200153312-A1, 26-JUL- 
2001] 


64..482 
128..558 


283/432 (65%) 
356/432 (81%) 


e-162 


AAM40449 


Human polypeptide SEQ ID NO 
5380 - Homo sapiens, 680 aa. 
[WO200153312-A1, 26-JUL- 
2001] 


64..482 
128..558 


283/432 (65%) 
356/432 (81%) 


e-162 



5 In a BLAST search of public sequence databases, the NOV 1 5a protein was found to 

have homology to the proteins shown in the BLASTP data in Table 15E. 
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Table 15E. Public BLASTP Results for NOV15a 


Protein 
Accession 
Number 


Protein/Organism/Length 


NOV15a 
Residues/ 

Match 
Residues 


Identities/ 
Similarities for 
the Matched 
Portion 


Expect 
Value 


Q9BQH3 


HYPOTHETICAL 55.7 KDA 
PROTEIN - Homo sapiens (Human), 
505 aa. 


1..503 
1..505 


497/505 (98%) 
499/505 (98%) 


0.0 


P97756 


CA2+/CALMODULIN- 
DEPENDENT PROTEIN KINASE 
IV KINASE ISOFORM - Rattus 
norvegicus (Rat), 505 aa. 


1..503 
1..505 


465/505 (92%) 
478/505 (94%) 


0.0 


AAH17529 


SIMILAR TO 

CALCIUM/CALMODULIN- 
DEPENDENT PROTEIN KINASE 
KINASE 1 , ALPHA - Mus musculus 
(Mouse), 505 aa. 


1..503 
1..505 


464/505 (91%) 
478/505 (93%) 


0.0 


Q64572 


CA2+/CALMODULIN- 
DEPENDENT PROTEIN KINASE 
KINASE (EC 2.7.1.37) - Rattus 
norvegicus (Rat), 505 aa. 


I ..503 
1..505 


463/505 (91%) 
476/505 (93%) 


0.0 


Q9R054 


CALCIUM/CALMODULIN 
DEPENDENT PROTEIN KPWASE 
KINASE ALPHA - Mus musculus 
(Mouse), 505 aa. 


1..503 
1..505 


454/505 (89%) 
471/505 (92%) 


0.0 



PFam analysis predicts that the NOV 15a protein contains the domains shown in the 
Table 15F. 



Table 15F. Domain Analysis of NOV15a 


Pfam Domain 


NOV15a Match 
Region 


Identities/ 
Similarities 
for the Matched 
Region 


Expect 
Value 


Pkinase: domain 1 of 
2 


128..228 


28/101 (28%) 
81/101 (80%) 


8.4e-16 


Pkinase: domain 2 of 
2 


245. .407 


70/201 (35%) 
129/201 (64%) 


1.7e-52 



Example 16. 



The NOV16 clone was analyzed, and the nucleotide and predicted polypeptide 
sequences are shown in Table 16A. 
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Table 16A. NOV16 Sequence Analysis 




SEQIDNO:59 |6S8 bp 


NOV16a, 

CG57709-01 DNA Sequence 


GACGCGGACCCGCCATGGCGCGGAAGAAGGTGCGTCCGCGGCTGATCGCGGAGCTGGC 
CCGCCGCGTGCGCGCCCTGCGGGAGCAACTGAACAGGCCGCGCGACTCCCAGCTCTAC 

CCTGGGCCGACGTGCGCCGCGAGAGCCGCCTCTTGCAGCTGCTCGGCCGCCTCCCGCT 
CTTCGGCCTGGGCCGCCTGGTCACGCGCAAGTCCTGGCTGTGGCAGCACGACGAGCCG 
TGCTACTGGCGCCTCACGCGGGTGCGGCCCGACTACACGGCGCAGAACTTGGACCACG 
GGAAGGCCTGGGGCATCCTGACCTTCAAAGGTAAGGCTCGGGAGAGCGCGCGGGAGAT 
CGAACACGTCATGTACCATGACTGGCGGCTGGTGCCCAAGCACGAGGAGGAGGCCTTC 
ACCGCGTTCACGCCGGCGCCGGAAGACAGCCTGGCCTCCGTGCCGTACCCGCCTCTCC 
TCCGGGCCATGATTATCGCAGAACGACAGAAAAATGGAGACACAAGCACCGAGGAGCC 
CATGCTGAATGTGCAGAGGATACGCATGGAACCCTGGGATTACCCTGCAAAACAGGAA 
GACAAAGGAAGGGCCAAGGGCACCCCCGTCTAGAATGCCAGAACCAGCGG 




ORF Start: ATGat 15 


ORF Stop: TAG at 669 




SEQ ID NO: 60 


218 aa ImW at 25647.2kD 


NOV16a, 

CG57709-01 Protein Sequence 


MARKKVRPRLIAELARRVRALREQLNRPRDSQLYAVDYETLTRPFSGRRLPVRAWADV 
RRKSRLLQLLGRLPLFGLGRLVTRKSWLWQHDEPCYWRLTRVRPDYTAQNLDHGKAWG 
I LTFKG KARES ARE I EHVMYHDWRLVPKHEEEAFTAFTPAPEDSLASVPYPPLLRAMI 
IAERQKNGDTSTEEPMLNVQRIRMEPWDYPAKQEDKGRAKGTPV 



Further analysis of the NOV 16a protein yielded the following properties shown in 
Table 16B. 



Table 16B. Protein Sequence Properties NOV16a 


PSort 
analysis: 


0.9081 probability located in mitochondrial matrix space; 0.6000 probability 
located in mitochondrial inner membrane; 0.6000 probability located in 
mitochondrial intermembrane space; 0.6000 probability located in 
mitochondrial outer membrane 


SignalP 
analysis: 


No Known Signal Sequence Predicted 



A search of the NOV 16a protein against the Geneseq database, a proprietary 
database that contains sequences published in patents and patent publication, yielded 
several homologous proteins shown in Table 16C. 



Table 16C. Geneseq Results for NO VI 6a 


Geneseq 
Identifier 


Protein/Organism/Length 
[Patent #, Date] 


NO VI 6a 
Residues/ 

Match 
Residues 


Identities/ 
Similarities for 
the Matched 
Region 


Expect 
Value 


AAG81356 


Human AFP protein sequence 
SEQ ID NO:230 - Homo sapiens, 
218 aa. [W02 001 2922 1-A2, 26- 
APR-2001] 


1..218 
1..218 


212/218(97%) 
212/218(97%) 


e-125 


AAU30525 


Novel human secreted protein 


135..218 
1..84 


84/84(100%) 
84/84 (100%) 


3e-45 
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[WO2001 79449- A2, 25-OCT- 
2001] 








AAU30526 


Novel human secreted protein 
#1017 - Homo sapiens, 62 aa. 
[WO200179449-A2, 25-OCT- 
2001] 


187..217 
12..42 


31/31 (100%) 
31/31 (100%) 


4e-12 



In a BLAST search of public sequence databases, the NOV 16a protein was found to 
have homology to the proteins shown in the BLASTP data in Table 16D. 



Table 16D. Public BLASTP Results for NOV16a 


Protein 

Accession 
Number 


Protein/Organism/Length 


NOV16a 
Residues/ 

Match 
Residues 


Identities/ 
Similarities for 
the Matched 
Portion 


Expect 
Value 


Q9BVI7 


HYPOTHETICAL 25.7 KDA 
PROTEIN - Homo sapiens 
(Human), 218 aa. 


1..218 
1..218 


214/218(98%) 
214/218(98%) 


e-125 


P82930 


MITOCHONDRIAL 28S 
RIBOSOMAL PROTEIN S34 
(MRP-S34) - Homo sapiens 
(Human), 218 aa. 


I..218 
1-218 


213/218(97%) 
213/218(97%) 


e-124 


CAC38606 


SEQUENCE 229 FROM PATENT 
WOO 129221 - Homo sapiens 
(Human), 218 aa. 


1..218 
I..218 


212/218(97%) 
212/218(97%) 


e-124 


Q9JIK9 


TCE2 (0610007F04RIK 
PROTEIN) - Mus musculus 
(Mouse), 218 aa. 


1-218 
1..218 


194/218 (88%) 
205/218(93%) 


e-114 


Q9D957 


0610007F04RIK PROTEIN - Mus 
musculus (Mouse), 218 aa. 


1..218 
1..218 


193/218 (88%) 
205/218(93%) 


e-114 



PFam analysis predicts that the NOV 16a protein contains the domains shown in the 
Table 16E. 



Table 16E. Domain Analysis of NOV16a 



Pfam Domain 


NO VI 6a Match Region 


Identities/ 
Similarities 
for the Matched Region 


Expect Value 


No Significant Matches Found 
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Example 17. 

The N0V17 clone was analyzed, and the nucleotide and predicted polypeptide 
sequences are shown in Table 17A. 



Table 17A. NOV17 Sequence Analysis 




SEQIDNO:61 (894 bp 


NOV 17a, 

CG57700-01 DNA Sequence 


CTCCGTGACCATGAAGGTCAAGGTCATCCCCGTGCTCGAGGACAACTACATGTACCTG 
GTCATCGAGGAGCTCACGCGCGAGGCGGTGGCCGTGGACGTGGCTGTGCCCAAGAGGC 
TGCTGGAGATCGTGGGCCGGGAGGGGGTGTCTCTGACCGCTGTGCTGACCACCCACCA 
TCACTGGGACCACGCGCGGGGAAACCCGGAGCTGGCGCGGCTTCGTCCCGGGCTGGCG 
GTGCTGGGCGCGGACGAGCGCATCTTCTCGCTGACGCGCAGGCTGGCGCACGGCGAGG 
AGCTGCAGTTCGGGGCCATCCACGTGCGTTGCCTCCTGACGCCCGGCCACACCGCCGG 
CCACATGAGCTACTTCCTGTGGGAGGACGATTGCCCGGACCCACCCGCCCTGTTCTCG 
GGTGGCGACGCGCTGTCGGTGGCCGGCTGCGGCTCGTGCCTGGAGGGCAGCGCCCAGC 
AGATGTACGAGAGCCTGGCCGAGCTGGGTACCCTGCCCCCCGAGACGAAGGTGTTCTG 
CGGCCACGAGCACACGCTTAGCAACCTGGAGTTTGCCCAGAAAGTGGAGCCCTGCAAC 
GACCACGTGAGAGCCAAGCTGTCCTGGGCTCAGAAGAGGGATGAGGATGACGTGCCCA 
CTGTGCCGTCGACTCTGGGCGAGGAGCGCCTCTACAACCCCTTCCTGCGGGTGGCGGA 
GGAGCCGGTGCGCAAGTTCACGGGCAAGGCGGTCCCCGCCGACGTCCTGGAGGCGCTA 
TGCAAGGAGCGGGCGCGCTTCGAACAGGCGGGCGAGCCGCGGCAGCCACAGGCGCGGG 
CCCTCCTTGCGCTGCAGTGGGGGCTCCTGAGTGCAGCCCCACACGACTGAGCCACCCA 
GACCCTCACAGGGCTGGGGCCTGC 




OPvF Start: ATG at 1 1 


ORF Stop: TGA at 860 




SEQ ID NO: 62 


283 aa ImW at 31262.3kD 


NOV 17a, 

CG57700-01 Protein Sequence 


MKVKVIPVLEDNYMYLVIEELTREAVAVDVAVPKRLLEIVGREGVSLTAVLTTHHHWD 
HARGNPELARLRPGLAVLGADERIFSLTRRLAHGEELQFGAIHVRCLLTPGHTAGHMS 
YFLWEDDCPDPPALFSGGDALSVAGCGSCLEGSAQQMYQSLAELGTLPPETKVFCGHE 
HTLSNLEFAQKVEPCNDHVRAKLSWAQKRDEDDVPTVPSTLGEERLYNPFLRVAEEPV 
RKFTGKAVPADVLEALCKERARFEQAGEPRQPQARALLALQWGLLSAAPHD 




SEQ ID NO: 63 1 888 bp 


NOV 17b, 

CG57700-02 DNA Sequence 


CTCCGTGACCATGAAGGTCAAGGTCATCCCCGTGCTCGAGGACAACTACATGTACCTG 
GTCATCGAGGAGCTCACGCGCGAGGCGGTGGCCGTGGACGTGGCTGTGCCCAAGAGGC 
TGCTGGAGATCGTGGGCCGGGAGGGGGTGTCTCTGACCGCTGTGCTGACCACCCACCA 
TCACTGGGACCACGCGCGGGGAAACCCGGAGCTGGCGCGGCTTCGTCCCGGGCTGGCG 
GTGCTGGGCGCGGACGAGCGCATCTTCTCGCTGACGCGCAGGCTGGCGCACGGCGAGG 
AGCTGCAGTTCGGGGCCATCCACGTGCGTTGCCTCCTGACGCCCGGCCACACCGCCGG 
CCACATGAGCTACTTCCTGTGGGAGGACGATTGCCCGGACCCACCCGCCCTGTTCTCG 
GGCGACGCGCTGTCGGTGGCCGGCTGCGGCTCGTGCCTGGAGGGCAGCGCCCAGCAGA 
TGTACCAGAGCCTGGCCGAGCTGGGTACCCTGCCCCCCGAGACGAAGGTGTTCTGCGG 
CCACGAGCACACACTTAGCAACCTGGAGTTTGCCCAGAAAGTGGAGCCCTGCAACGAC 
CACGTGAGAGCCAAGCTGTCCTGGGCTAAGAAGAGGGATGAGGATGACGTGCCCACTG 




GCCGGTGCGCAAGTTCACGGGCAAGGCGGTCCCCGCCGACGTCCTGGAGGCGCTATGC 
AAGGAGCGGGCGCGCTTCGAACAGGCGGGCGAGCCGCGGCAGCCACAGGCGCGGGCCC 
TCCTTGCGCTGCAGTGGGGGCTCCTGAGTGCAGCCCCACACGACTGAGCCACCCAGAC 
CCTCACAGGGCTGGGCCT 




ORF Start: ATG at 1 1 |ORF Stop: TGA at 857 




SEQ ID NO: 64 |282 aa ]MW at 31205.3kD 


NOV17b, 

CG57700-02 Protein Sequence 


MKVKVIPVLEDNYMYLVIEELTREAVAVDVAVPKRLLEIVGREGVSLTAVLTTHHHWD 
HARGNPELARLRPGLAVLGADERIFSLTRRLAHGEELQFGAIHVRCLLTPGHTAGHMS 
YFLWEDDCPDPPALFSGDALSVAGCGSCLEGSAQQMYQSLAELGTLPPETKVFCGHEH 
TLSNLEFAQKVEPCNDHVRAKLSWAKKRDEDDVPTVPSTLGEERLYNPFLRVAEEPVR 
KFTGKAVPADVLEALCKERARFEQAGEPRQPQARALLALQWGLLSAAPHD 




SEQ ID NO: 65 1 882 bp 


NOV 17c, 

CG5 7700-03 DNA Sequence 


ACCATGAAGGTCAAGGTCATCCCCGTGCTCGAGGACAACTACATGTACCTGGTCATCG 
AGGAGCTCACGCGCGAGGCGGTGGCCGTGGACGTGGCTGTGCCCAAGAGGCTGCTGGA 
GATCGTGGGCCGGGAGGGGGTGTCTCTGACCGCTGTGCTGACCACCCACCATCACTGG 
GACCACGCGCGGGGAAACCCGGAGCTGGCGCGGCTTCGTCCCGGGCTGGCGGTGCTGG 
GCGCGGACGAGCGCATCTTCTCGCTGACGCGCAGGCTGGCGCACGGCGAGGAGCTGCG 
GTTCGGGGCCATCCACGTGCGTTGCCTCCTGACGCCCGGCCACACCGCCGGCCACATG 
AGCTACTTCCTGTGGGAGGACGATTGCCCGGACCCACCCGCCCTGTTCTCGGGCGACG 
CGCTGTCGGTGGCCGGCTGCGGCTCGTGCCTGGAGGGCAGCGCCCAGCAGATGTACCA 
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GAGCCTGGCCGAGCTGGGTACCCTGCCCCCCGAGACGAAGGTGTTCTGCGGCCACGAG 
CACACGCTTAGCAACCTGGAGTTTGCCCAGAAAGTGGAGCCCTGCAACGACCACGTGA 
GAGCCAAGCTGTCCTGGGCTAAGAAGAGGGATGAGGATGACGTGCCCACTGTGCCGTC 
GACTCTGGGCGAGGAGCGCCTCTACAACCCCTTCCTGCGGGTGGCAGAGGAGCCGGTG 
CGCAAGTTCACGGGCAAGGCGGTCCCCGCCGACGTCCTGGAGGCGCTATGCAAGGAGC 
GGGCGCGCTCCGAACAGGCGGGCGAGCCGCGGCAGCCACAGGCGCGGGCCCTCCTTGC 
GCTGCAGTGGGGGCTCCTGAGTGCAGCCCCACACGACTGAGCCACCCAGACCCTCACA 
GGGCTGGGGCCT 




ORF Start: ATG at 4 


ORF Stop: TGA at 850 




SEQ ID NO: 66 


282 aa IMW at 31 173.2kD 


NOV 17c, 

CG57700-03 Protein Sequence 


MKVKVIPVLEDNYMYLVIEELTREAVAVDVAVPKRLLEIVGREGVSLTAVIjTTHHHWD 
HARGNPELARLRPGLAVLGADERIFSLTRRLAHGEELRFGAIHVRCLLTPGHTAGHMS 
YFLWEDDCPDPPALFSGDALSVAGCGSCLEGSAQQMYQSLAELGTLPPETKVFCGHEH 
TLSNLEFAQKVEPCNDHVRAKLSWAKKRDEDDVPTVPSTLGEERLYNPFLRVAEEPVR 
KFTGKAVPADVLEALCKERARSEQAGEPRQPQARALLALQWGLLSAAPHD 




SEQ ID NO: 67 


855 bp 


NOV17d, 


ACCATGAAGGTCAAGGTCATCCCCGTGCTCGAGGACAACTACATGTACCTGGTCATCG 


CG57700-04 DNA Sequence 


GATCGTGGGCCGGGAGGGGGTGTCTCTGACCGCTGTGCTGACCACCCACTATCACTGG 

GCGCGGACGAGCGCATCTTCTCGCTGACGCGCAGGCTGGCGCACGGCGAGGAGCTGCG 
GTTCGGGGCCATCCACGTGCGTTGCCTCCTGACGCCCGGCCACACCGCCGGCCACATG 
AGCTACTTCCTGTGGGAGGACGATTGCCCGGACCCACCCGCCCTGTTCTCGGGCGACG 




GAGCCTGGCCGAGCTGGGTACCCTGCCCCCCGAGACGAAGGTGTTCTGCGGCCACGAG 
CACACGCTTAGCAACCTGGAGTTTGCCCAGAAAGTGGAGCCCTGCAACGACCACAAGA 




CCCCTTCCTGCGGGTGGCAGAGGAGCCGGTGCGCAAGTTCACGGGCAAGGCGGTCCCC 
GCCGACGTCCTGGAGGCGCTATGCAAGGAGCGGGCGCGCTTCGAACAGGCGGGCGAGC 
CGCGGCAGCCACAGGCGCGGGCCCTCCTTGCGCTGCAGTGGGGGCTCCTGAGTGCAGC 
CCCACACGACTGAGCCACCCAGACCCTCACAGGGCTGGGGCCT 




ORF Start: ATG at 4 


ORF Stop: TGA at 823 




SEQ ID NO: 68 


273 aa |MW at 30219.1kD 


NOV17d, 

CG57700-04 Protein Sequence 


MKVKVIPVLEDNYMYLVIEELTREAVAVDVAVPKRLLEIVGREGVSLTAVLTTHYHWD 
HARGNPELARLRPGLAVLGADERIFSLTRRLAHGEELRFGAIHVRCLLTPGHTAGHMS 
YFLWEDDCPDPPALFSGDALSVAGCGSCLEGSAQQMYQSLAELGTLPPETKVFCGHEH 
TLSNLEFAQKVEPCNDHKRDEDDVPTVPSTLGEERLYNPFLRVAEEPVRKFTGKAVPA 
DVLEALCKERARFEQAGEPRQPQARALLALQWGLLSAAPHD 



Sequence comparison of the above protein sequences yields the following sequence 
relationships shown in Table 17B. 



Table 17B. Comparison of NOV17a against NOV17b through NOV17d. 


Protein Sequence 


NOV17a Residues/ 
Match Residues 


Identities/ 
Similarities for the Matched Region 


NOV17b 


1..283 
1..282 


281/283 (99%) 
282/283 (99%) 


NOV 17c 


1..283 
1..282 


279/283 (98%) 
281/283 (98%) 


NOV17d 


1..283 
1..273 


271/283 (95%) 
273/283 (95%) 



Further analysis of the NOV 17a protein yielded the following properties shown in 
Table 17C. 
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Table 17C. Protein Sequence Properties NOV17a 


PSort 
analysis: 


0.4500 probability located in cytoplasm; 0.3000 probability located in 
microbody (peroxisome); 0.1682 probability located in lysosome (lumen); 
0.1000 probability located in mitochondrial matrix space 


SignalP 
analysis: 


No Known Signal Sequence Predicted 



A search of the NOV 17a protein against the Geneseq database, a proprietary 
database that contains sequences published in patents and patent publication, yielded 
several homologous proteins shown in Table 17D. 



Table 17D. Geneseq Results for NOV17a 


Identifier 


Protcin/OfgBnisiTi/Ldigtli [Psitcnt 
#, Date] 


NOV17a 
Residues/ 

Match 
Residues 


Identities/ 
Similarities for 
the Matched 
Region 


Value 


AAW80783 


Human bisphosphonate binding 
protein, DPI (hDPl) - Homo 
sapiens, 260 aa. [WO9836064-A1, 
20-AUG-1998] 


1..256 
1..256 


128/257(49%) 
184/257(70%) 


6e-72 


AAG10987 


Arabidopsis thaliana protein 
fragment SEQ ID NO: 9531 - 
Arabidopsis thaliana, 258 aa. 
[EP1033405-A2, 06-SEP-2000] 


1..245 
1..246 


107/248(43%) 
160/248 (64%) 


5e-53 


AAG 10986 


Arabidopsis thaliana protein 
fragment SEQ ID NO: 9530 - 
Arabidopsis thaliana, 268 aa. 
[EP1033405-A2, 06-SEP-2000] 


I. .245 

II. .256 


107/248(43%) 
160/248 (64%) 


5e-53 


AAM78721 


Human protein SEQ ID NO 1383 - 
Homo sapiens, 385 aa. 
[WO200157190-A2, 09-AUG- 
2001] 


1 ..226 
119..344 


100/227(44%) 
135/227 (59%) 


6e-45 


AAY71110 


Human Hydrolase protein-8 
(HYDRL-8) - Homo sapiens, 361 
aa. [WO200028045-A2, 18-MAY- 
2000] 


1..226 
95..320 


100/227(44%) 
135/227(59%) 


6e-45 



In a BLAST search of public sequence databases, the NOV 1 7a protein was found to 
have homology to the proteins shown in the BLASTP data in Table 17E. 
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Table 17E. Public BLASTP Results for NOV17a 


Protein 
Accession 
Number 


Protein/Organism/Length 


NO VI 7a 
Residues/ 

Match 
Residues 


Identities/ 
Similarities 
for the 
Matched 
Portion 


Expect 
Value 


Q9BT45 


SIMILAR TO RIKEN CDNA 
1500017E18 GENE - Homo sapiens 
(Human), 282 aa. 


1..283 
1..282 


280/283 (98%) 
282/283 (98%) 


e-163 


Q9DB32 


1500017E18RIK PROTEIN - Mus 
musculus (Mouse), 283 aa. 


1..278 
1..278 


231/279 (82%) 
251/279 (89%) 


e-133 


Q96S11 


SIMILAR TO HAGH - Homo sapiens 
(Human), 218 aa. 


1..228 
1..218 


217/228 (95%) 
218/228 (95%) 


e-123 


Q96NR5 


CDNA FLJ30279 FIS, CLONE 
BRACE2002772, MODERATELY 
SIMILAR TO 

HYDROXYACYLGLUTATHIONE 
HYDROLASE (EC 3.1 .2.6) - Homo 
sapiens (Human), 202 aa. 


L.133 
1-133 


132/133 (99%) 
133/133 (99%) 


3e-73 


035952 


Hydroxyacylglutathione hydrolase (EC 
3.1.2.6) (Glyoxalase II) (Glx II) (Round 
spermatid protein RSP29) - Rattus 
norvegicus (Rat), 260 aa. 


1..256 
1..256 


128/257 (49%) 
184/257(70%) 


le-71 



PFam analysis predicts that the NOV 17a protein contains the domains shown in the 



Table 17F. Domain Analysis of NOV17a 


Pfam Domain 


NOV17a Match 
Region 


Identities/ 
Similarities 
for the Matched 
Region 


Expect 
Value 


lactamase B: domain 1 of 
1 


7.. 173 


55/221 (25%) 
129/221 (58%) 


5.8e-32 



Example 18. 



The NOV1 8 clone was analyzed, and the nucleotide and predicted polypeptide 
sequences are shown in Table 18A. 



152 



Table 18A. NOV18 Sequence Analysis 




SEQ ID NO: 69 


2 100 bp 


NOV 18a, 

CG58553-01 DNA Sequence 


GGGrCCGGCGGGCATCGGCAAGACCATGGCGGCCAAAAATATCCTGTACGACrGGGCG 


GCGGGCAAGCTGTACCAGGGCCAGGTGGACTTCGCCTTCTTCATGCCCTGCGGCGAGC 
TGCTGGAGAGGCCGGGCACGCGCAGCCTGGCTGACCTGATCCTGGACCAGTGCCCCGA 
CCGCGGCGCGCCGGTGCCGCAGATGCTGGCCCAGCCGCAGCGGCTGCTCTTCATCCTG 
GACGGCGCGGACGAGCTGCCGGCGCTGGGGGGCCCCGAGGCCGCGCCCTGCACAGACC 
CCTTCGAGGCGGCGAGCGGCGCGCGGGTGCTAGGCGGGCTGCTGAGTAAGGCGCTGCT 




CGCCTGTGTTCCCCGCAGTGCGCCGAGGTGCGCGGCTTCTCCGACAAGGACAAGAAGA 
AGTATTTCTACAAGTTCTTCCGGGATGAGAGGAGGGCCGAGCGCGCCTACCGCTTCGT 
GAAGGAGAACGAGACGCTGTTCGCGCTGTGCTTCGTGCCCTTCGTGTGCTGGATCGTG 
TGCACCGTGCTGCGCCAGCAGCTGGAGCTCGGTCGGGACCTGTCGCGCACGTCCAAGA 
CCACCACGTCAGTGTACCTGCTTTTCATCACCAGCGTTCTGAGCTCGGCTCCGGTAGC 
CGACGGGCCCCGGTTGCAGGGCGACCTGCGCAATCTGTGCCGCCTGGCCCGCGAGGGC 
GTCCTCGGACGCAGGGCGCAGTTTGCCGAGAAGGAACTGGAGCAACTGGAGCTTCGTG 
GCTCCAAAGTGCAGACGCTGTTTCTCAGCAAAAAGGAGCTGCCGGGCGTGCTGGAGAC 
AGAGGTCACCTACCAGTTCATCGACCAGAGCTTCCAGGAGTCCTTCGCGGCACTGTCC 
TACCTGCTGGAGGACGGCGGGGTGCCCAGGACCGCGGCTGGCGGCGTTGGGACACTCC 
TGCGTGGGGACGCCCAGCCGCACAGCCACTTGGTGCTCACCACGCGCTTCCTCTTCGG 




GAGTGGCACCAGAGGTGACCGAGGGGGCCAAAGGGCTCGAGGACACCGAAGAGCCAGA 
GGAGGAGGAGGAGGGAGAGGAGCCCAACTACCCACTGGAGTTGCTGTACTGCCTGTAC 
GAGACGCAGGAGGACGCGTTTGTGCGCCAAGCCCTGGGCCGGTTCCCGGAGCTGGCGC 
TGCAGCGAGTGCGCTTCTGCCGCATGGACGTGGCTGTTCTGAGCTACTGCGTGAGGTG 
CTGCCCTGCTGCACAGGCACTGCGGCTGATCAGCTGCAGATTGGTTGCTGCGCAGGAG 
AAGAAGAAGAAGAGCCTGGGGAAGCGGCTCCAGGCCAGCCTGGGCACCACAAAACAAC 
TGCCAGCCTCCCTTCTTCATCCACTCTTTCAGGCAATGACTGACCCACTGTGCCATCT 
GAGCAGCCTCACGCTGTCCCACTGCAAACTCCCTGACGCGGTCTGCCGAGACCTTTCT 
GAGGCCCTGAGGGCAGCCCCCGCACTGACGGAGCTGGGCCTCCTCCACAACAGGCTCA 
GTGAGGCAGGACTGCGTATGCTGAGTGAGGGCCTAGCCTGGCCGCAGTGCAGGGTGCA 
GACGGTCAGGGTACAGCTGCCTGACCCCCAGCGAGGGCTCCAGTACCTGGTGGGTATG 
CTTCGGCAGAGCCCTGCCCTGACCACCCTGGATCTCAGCGGCTGCCAACTGCCCGCCC 
CCATGGTGACCTACCTGTGTGCAGTCCTGCAGCACCAGGGATGCGGCCTGCAGACCCT 
CAGTCTGGCCTCTGTGGAGCTGAGCGAGCAGTCACTACAGGAGCTTCAGGCTGTGAAG 
AGAGCAAAGCCGGATCTGGTCATCACACACCCAGCGCTGGACGGCCACCCACAACCTC 
CCAAGGAACTCATCTCGACCTTCTGAGGCTCTGGTGGCCAGAGCAGGGTGGAAGACCC 




ORF Start: ATG at 26 


ORE Stop: TGA at 2054 




SEQ ID NO: 70 


676 aa ImW at 74650.3kD 


NOV18a, 

CG58553-01 Protein Sequence 


MAAKNILYDWAAGKLYQGQVDFAFFMPCGELLERPGTRSLADLILDQCPDRGAPVPQM 
LAQPQRLLFILDGADELPALGGPEAAPCTDPFEAASGARVLGGLLSKALLPTALLLVT 
TRAAAPGRLQGRLCSPQCAEVRGFSDKDKKKYFYKFFRDERRAERAYRFVKENETLFA 
LCFVPFVCWIVCTVLRQQLELGRDLSRTSKTTTSVYLLFITSVLSSAPVADGPRLQGD 
LRNLCRLAREGVLGRRAQFAEKELEQLELRGSKVQTLFLSKKELPGVLETEVTYQFID 
QSFQESFAALSYLLEDGGVPRTAAGGVGTLLRGDAQPHSHLVLTTRFLFGLLSAERMR 
DIERHFGCMVSERVKQEALRWVQGQGQGCPGVAPEVTEGAKGLEDTEEPEEEEEGEEP 
NYPLELLYCLYETQEDAFVRQALGRFPELALQRVRFCRMDVAVLSYCVRCCPAAQALR 
LISCRLVAAQEKKKKSLGKRLQASLGTTKQLPASLLHPLFQAMTDPLCHLSSLTLSHC 
KLPDAVCRDLSEALRAAPALTELGLLHNRLSEAGLRMLSEGLAWPQCRVQTVRVQLPD 
PQRGLQYLVGMLRQSPALTTLDLSGCQLPAPMVTYLCAVLQHQGCGLQTLSLASVELS 
EQSLQELQAVKRAKPDLVITHPALDGHPQPPKELISTF 



Further analysis of the NOV 18a protein yielded the following properties shown in 
Table 18B. 
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Table 18B. Protein Sequence Properties NOV18a 


Psort 
analysis: 


0.7400 probability located in nucleus; 0.6000 probability located in 
endoplasmic reticulum (membrane); 0.3000 probability located in microbody 
(peroxisome); 0.1000 probability located in mitochondrial inner membrane 


SignalP 
analysis: 


No Known Signal Sequence Predicted 



A search of the NOV 18a protein against the Geneseq database, a proprietary 
database that contains sequences published in patents and patent publication, yielded 
several homologous proteins shown in Table 18C. 



Table 18C. Geneseq Results for NOV18a 


Geneseq 
Identifier 


Protein/Organism/Length [Patent 
#, Date] 


NOV18a 
Residues/ 

]VIatch 
Residues 


Identities/ 
Similarities for 
the Matched 
Region 


Expect 


AAE04546 


Human G-protein coupled receptor- 
2 (GCREC-2) protein - Homo 
sapiens, 891 aa. [WO200142288- 
A2, 14-JUN-2001] 


1..676 
210..891 


671/682 (98%) 
671/682 (98%) 


0.0 


AAU00023 


Human activated T-lymphocyte 
associated sequence 2, ATLAS-2 - 
Homo sapiens, 1851 aa. 
[WO2001 14564-A2, 01-MAR- 
2001] 


1..633 
210..904 


605/695 (87%) 
610/695 (87%) 


0.0 


ABB 11735 


Human vasopressin receptor 
homologue, SEQ ID NO:2105 - 
Homo sapiens, 597 aa. 
[WO200157188-A2, 09-AUG- 
2001] 


1..490 
106..595 


485/490 (98%) 
485/490 (98%) 


0.0 


AAR33389 


AII/AVPv2 receptor - Synthetic, 
481 aa. [WO9305073-A, 18-MAR- 
1993] 


193. .670 
1..480 


322/481 (66%) 
371/481 (76%) 


e-174 


AAM89960 


Human immune/haematopoietic 
antigen SEQ ID NO: 17553 - Homo 
sapiens, 329 aa. [WO200157182- 
A2, 09-AUG-2001] 


1..274 
9..282 


265/274 (96%) 
266/274 (96%) 


e-151 



In a BLAST search of public sequence databases, the NOV 18a protein was found to 
have homology to the proteins shown in the BLASTP data in Table 1 8D. 
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Table 18D. Public BLASTP Results for NOV18a 


Protein 
Accession 
Number 


Protein/Organism/Length 


NOV18a 
Residues/ 

Match 
Residues 


Identities/ 
Similarities for 
the Matched 
Portion 


Expect 
Value 


CAC34689 


SEQUENCE 3 FROM PATENT 
WO01 14564 - Homo sapiens 
(Human), 1851 aa. 


1..633 
210..904 


605/695 (87%) 
610/695 (87%) 


0.0 


Q91WS2 


HYPOTHETICAL 62.5 KDA 
PROTEIN - Mus musculus 
(Mouse), 556 aa (fragment). 


107..659 
1..554 


390/557 (70%) 
450/557 (80%) 


0.0 


Q63035 


VASOPRESSIN RECEPTOR - 
Rattus norvegicus (Rat), 483 aa. 


193..670 
1..482 


324/483 (67%) 
372/483 (76%) 


e-173 


AAL 12498 


CRYOPYRIN - Homo sapiens 
(Human), 920 aa. 


3..657 
234..914 


232/709 (32%) 
355/709 (49%) 


5e-94 


AAL 12497 


CRYOPYRIN - Homo sapiens 
(Human), 1034 aa. 


3..648 
234..848 


223/658 (33%) 
344/658(51%) 


6e-93 



PFam analysis predicts that the NOV 18a protein contains the domains shown in the 
Table 18E. 



Table 18E. Domain Analysis of NO VI 8a 



Pfam Domain 



NOV18a Match Region 



Identities/ 
Similarities 
for the Matched Region 



Expect Value 



No Significant Matches Found 



Example 19. 

The NOV 19 clone was analyzed, and the nucleotide and predicted polypeptide 
sequences are shown in Table 19A. 



Table 19A. NOV19 Sequence Analysis 




SEQIDNO:71 |2686 bp 


NOV 19a, 

CG58626-01 DNA Sequence 


CCGGCGGCGTCTCCACAGCATGAATTACCCGGGCCGCGGGTCCCCACGGAGCCCCGAG 
CATAACGGCCGAGGCGGCGGCGGCGGCGCCTGGGAGCTGGGCTCAGACGCGAGGCCAG 
CGTTCGGCGGCGGCGTCTGCTGCTTCGAGCACCTGCCCGGCGGGGACCCGGACGACGG 
CGACGTGCCCCTGGCCCTGCTGCGCGGGGAACCCGGGCTGCATTTGGCGCCGGGCACC 
GACGACCACAACCACCACCTCGCGCTGGACCCCTGCCTCAGTGACGAGAACTATGACT 
TCAGCTCCGCCGAGTCGGGCTCCTCGCTGCGCTACTACAGCGAGGGTGAGAGCGGCGG 
CGGCGGCAGCTCCTTGTCGCTGCACCCGCCGCAGCAGCCTCCGCTGGTCCCGACGAAC 
TCGGGGGGCGGCGGCGCGACAGGAGGGTCCCCCGGGGAAAGGAAACGTACCCGGCTTG 




CTCCGCATCGAGCTCGCCTTCCGGACCCTGCTGCAGACCACGGGTGCCCGGCCCCAGG 
GCGGGGACCGGGACGGCGACCATGTGTGCTCCCCCACGGGCCCAGCCTCCAGTTCCGG 
AGAAGATGACGATGAGGACCGCGCCTGCGGCTTCTGCCAGAGTACGACGGGGCACGAG 
CCGGAGATGGTGGAGCTTGTGAACATCGAGCCTGTGTGCGTGCGGGGCGGCCTCTACG 





AGGTGGATGTGACCCAAGGAGAGTGCTACCCGGTGTACTGGAACCGTGCTGATAAAAT 
ACCAGTAATGCGTGGACAGTGGTTTATTGACGGCACTTGGCAGCCTCTAGAAGAGGAA 
GAAAGTAATTTAATTGAGCAAGAACATCTCAATTGTTTTAGGGGCCAGCAGATGCAGG 
AAAATTTCGATATTGAAGTGTCAAAATCCATAGATGGAAAAGATGCTGTTCATAGTTT 
CAAGTTGAGTCGAAACCATGTGGACTGGCACAGTGTGGATGAAGTATATCTTTATAGT 
GATGCAACAACATCTAAAATTGCAAGAACAGTTACCCAAAAACTGGGATTTTCTAAAG 
CATCAAGTAGTGGTACCAGACTTCATAGAGGTTATGTAGAAGAAGCCACATTAGAAGA 
CAAGCCATCACAGACTACCCATATTGTATTTGTTGTGCATGGCATTGGGCAGAAAATG 
GACCAAGGAAGAATTATCAAAAATACAGCTATGATGAGAGAAGCTGCAAGAAAAATAG 
AAGAAAGGCATTTTTCCAACCATGCAACACATGTTGAATTTCTGCCTGTTGAGTGGCG 
GTCAAAACTTACTCTTGATGGAGACACTGTTGATTCCATTACTCCTGACAAAGTACGA 
GGTTTAAGGGATATGCTGAACAGCAGTGCAATGGACATAATGTATTATACTAGTCCAC 
TTTATAGAGATGAACTAGTTAAAGGCCTTCAGCAAGAGCTGAATCGATTGTATTCCCT 
TTTCTGTTCTCGGAATCCAGACTTTGAAGAAAAAGGGGGTAAAGTCTCAATAGTATCA 
CATTCCTTGGGATGTGTAATTACTTATGACATAATGACTGGCTGGAATCCAGTTCGGC 
TGTATGAACAGTTGCTGCAAAAGGAAGAAGAGTTGCCTGATGAACGATGGATGAGCTA 
TGAAGAACGACATCTTCTTGATGAACTCTATATAACTAAACGACGGCTGAAGGAAATA 
GAAGAACGGCTTCACGGATTGAAAGCATCATCTATGACACAAACACCTGCCTTAAAAT 
TTAAGGTAGAGAATTTCTTCTGTATGGGATCCCCATTAGCAGTTTTCTTGGCGTTGCG 
TGGCATCCGCCCAGGAAATACTGGAAGTCAAGACCATATTTTGCCTAGAGAGATTTGT 
AACCGGTTACTAAATATTTTTCATCCTACAGATCCAGTGGCTTATAGATTAGAACCAT 
TAATACTGAAACACTACAGCAACATTTCACCTGTCCAGATCCACTGGTACAATACTTC 
AAATCCTTTACCTTATGAACATATGAAGCCAAGCTTTCTCAACCCAGCTAAAGAACCT 
ACCTCAGTTTCAGAGAATGAAGGCATTTCAACCATACCAAGCCCTGTGACCTCACCAG 
TTTTGTCCCGCCGACACTATGGAGAATCTATAACAAATATAGGCAAAGCAAGCATATT 
AGGTGCTGCTAGCATTGGAAAGGGACTTGGAGGAATGTTGTTCTCAAGATTTGGACGT 
TCATCTACAACACAGTCATCTGAAACATCAAAAGACTCAATGGAAGATGAGAAGAAGC 
CAGTTGCCTCACCTTCTGCTACCACCGTAGGGACACAGACCCTTCCACATAGCAGTTC 
TGGCTTCCTCGATTCTGCAGTGGAGTTGGATCACAGGATTGATTTTGAACTCAGAGAA 
GGCCTTGTGGAGAGCCGCTATTGGTCAGCTGTCACGTCGCATACTGCCTATTGGTCAT 
CCTTGGATGTTGCCCTTTTTCTTTTAACCTTCATGTATAAACATGAGCACGATGATGA 
TGCAAAACCCAATTTAGATCCAATCTGAACTCTCTTGAAGGACATGAATGGCCTAAAA 




ORF Start: ATG at 20 


ORF Stop: TGA at 2636 




SEQ ID NO: 72 


872 aa IMW at 97063.4kD 


NOV19a, 

CG58626-01 Protein Sequence 


MNYPGRGSPRSPEHNGRGGGGGAWELGSDARPAFGGGVCCFEHLPGGDPDDGDVPLAL 
LRGEPGLHLAPGTDDHNHHLALDPCLSDEKYDFSSAESGSSLRYYSEGESGGGGSSLS 
LHPPQQPPLVPTNSGGGGATGGSPGERKRTRLGGPAARHRYEWTELGPEEVRWFYKE 
DKKTWKPFIGYDSLRIEIAFRTLLQTTGARPQGGDRDGDHVCSPTGPASSSGEDDDED 
RACGFCQSTTGHEPEMVELVNIEPVCVRGGLYEVDVTQGECYPVYWNRADKIPVMRGQ 
WFIDGTWQPLEEEESNLI EQEHLNCFRGQQMQEKFDI EVSKS I DGKDAVHSFKLSRNH 
VDWHSVDEVYLYSDATTSKIARTVTQKLGFSKASSSGTRLHRGYVEEATLEDKPSQTT 
HIVFVVHGIGQKMDQGRIIKNTAMMREAARKIEERHFSNHATHVEFLPVEWRSKLTLD 
GDTVDSITPDKVRGLRDMLNSSAMDIMYYTSPLYRDELVKGLQQELNRLYSLFCSRNP 
DFEEKGGKVSIVSHSLGCVITYDIMTGWNPVRLYEQLLQKEEELPDERWMSYEERHLL 
DELYITKRRLKEIEERLHGLKASSMTQTPALKFKVENFFCMGSPLAVFLALRGIRPGN 
TGSQDHILPREICNRLLNIFHPTDPVAYRLEPLILKHYSNISPVQIHWYNTSNPLPYE 
HMKPSFLNPAKEPTSVSENEGISTIPSPVTSPVLSRRHYGESITNIGKASILGAASIG 
KGLGGMLFSRFGRSSTTQSSETSKDSMEDEKKPVASPSATTVGTQTLPHSSSGFLDSA 
VELDHRIDFELREGLVESRYWSAVTSHTAYWSSLDVALFLLTFMYKHEHDDDAKPNLD 
PI 



Further analysis of the NOV1 9a protein yielded the following properties shown in 
Table 19B. 
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Table 19B. Protein Sequence Properties NOV 19a 


PSort 
analysis: 


0.4555 probability located in microbody (peroxisome); 0.4500 probability 
located in cytoplasm; 0.1000 probability located in mitochondrial matrix 
space; 0.1000 probability located in lysosome (lumen) 


SignalP 
analysis: 


No Known Signal Sequence Predicted 



A search of the NOV 19a protein against the Geneseq database, a proprietary 
database that contains sequences published in patents and patent publication, yielded 
several homologous proteins shown in Table 19C. 



Table 19C. Geneseq Results for NO VI 9a 


Geneseq 
Identifier 


Protein/Organism/Length 
[Patent #, Date] 


NO VI 9a 
Residues/ 

Match 
Residues 


Identities/ 
Similarities for 
the Matched 
Region 


Expect 
Value 


AAG64151 


Arabidopsis thaliana gravitropism 
protein - Arabidopsis thaliana, 933 
aa. [JP2001 120279-A, 08-MAY- 
2001] 


257..547 
1 56..454 


104/316(32%) 
156/316(48%) 


le-38 


AAM41595 


Human polypeptide SEQ ID NO 
6526 - Homo sapiens, 677 aa. 
[WO200153312-A1, 26-JUL-2001] 


261..548 
52..328 


94/301 (31%) 
138/301 (45%) 


6e-25 


AAB92643 


Human protein sequence SEQ ID 
NO:10972 - Homo sapiens, 1000 
aa. [EP1074617-A2, 07-FEB-2001] 


119..608 
226..664 


132/524 (25%) 
204/524 (38%) 


2e-24 


AAM39809 


Human polypeptide SEQ ID NO 
2954 - Homo sapiens, 615 aa. 
[WO200153312-A1, 26-JUL-2001] 


274..548 
3. .266 


90/288 (31%) 
131/288(45%) 


4e-23 


AAB93825 


Human protein sequence SEQ ID 
NO: 13636 - Homo sapiens, 694 aa. 
[EP1074617-A2, 07-FEB-2001] 


404..608 
227..449 


76/229 (33%) 
113/229(49%) 


6e-23 



In a BLAST search of public sequence databases, the NOV1 9a protein was found to 
have homology to the proteins shown in the BLASTP data in Table 19D. 
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Table 19D. Public BLASTP Results for NOV19a 


Protein 
Accession 
Number 


Protein/Organism/Length 


NOV19a 
Residues/ 

Match 
Residues 


Identities/ 
Similarities for 
the Matched 
Portion 


Expect 
Value 


046606 


PHOSPHATIDIC ACID- 
PREFERRING PHOSPHOLIPASE 
Al - Bos taurus (Bovine), 875 aa. 


1..872 
1..875 


802/876 (91%) 
829/876 (94%) 


0.0 


Q9C0F8 


KIAA1705 PROTEIN - Homo 
sapiens (Human), 498 aa (fragment). 


378..872 
4„498 


493/495 (99%) 
494/495 (99%) 


0.0 


Q96LL2 


CDNA FLJ25408 FIS, CLONE 
TST02965, HIGHLY SIMILAR TO 
BOS TAURUS PHOSPHATIDIC 
ACID-PREFERRING 
PHOSPHOLIPASE Al MRNA - 
Homo sapiens (Human), 454 aa. 


419..872 
1..454 


453/454 (99%) 
454/454 (99%) 


0.0 


AAH18552 


HYPOTHETICAL 27.3 KDA 
PROTEIN - Mus musculus (Mouse), 
249 aa (fragment). 


624..869 
1..246 


224/246 (91%) 
236/246 (95%) 


e-130 


AAL32232 


HYPOTHETICAL 85.1 KDA 
PROTEIN - Caenorhabditis elegans, 
753 aa. 


122..867 
11. .750 


255/794 (32%) 
374/794 (46%) 


6e-91 



PFam analysis predicts that the NOV 19a protein contains the domains shown in the 
Table 19E. 



Table 19E. Domain Analysis of NOV19a 


Pfam Domain 


NOV19a Match 
Region 


Identities/ 
Similarities 
for the Matched 
Region 


Expect 
Value 


DUF203: domain 1 of 
1 


252..458 


42/219 (19%) 
105/219 (48%) 


7.5 


DDHD: domain 1 of 1 


611. .858 


96/266 (36%) 
236/266 (89%) 


3.3e-116 



Example 20. 



The NOV20 clone was analyzed, and the nucleotide and predicted polypeptide 
sequences are shown in Table 20A. 
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Table 20A. NOV20 Sequence Analysis 




SEQIDNO:73 1 773 bp 


NOV20a, 

CG57597-01 DNA Sequence 


GGTAAGGACACAAGATGCCAAATAGGGTAAGGAATGGTCCAGAAACCTGTGAAnTrTG 
CATTGCAGGCATGCACCACCACTCCTGGCTAATTTTTTGTATTTTTAGTGCCATCGAA 
TCCGGCTCAAACCTTTTATTTCTCTTATGTAAAAGCTGTGTACTTCAGAAAAACATGT 
ACAGTTATCCCTGGCAGTGCCGGGGTGGGGTCTGCGCGGCCCTGGAGGCCTGGCCGGC 
CTTGCAGATCGCTGTGGAGAATGGCTTCGGGGGTGTGCACAGCCAGGAGAAGGCCAAG 
TGGCTGGGGGGTGCAGTGGAGGATTACTTCATGCGCAATGCTGACTTGGAGCTAGATG 
AGGTGGAAGACTTCCTTGGAGAGCTGTTGACCAACGAGTTTGATACAGTTGTGGAAGA 
CGGGAGTCTGCCCCAGGTGAGCCAGCAACTGCAGACCATGTTCCACCACTTCCAGAGG 
GGTGATGGGGCTGCTCTGAGGGAGATGGCCTCCTGCATCACTCAGAGAAAATGCAAGG 
TCACAGCCACTGCACTTAAGACAGCTAGAGAGACTGATGAGGATGAAGATGATGTGGA 
CAGTGTGGAAGAGATGGAGGTCACAGCTACGAATGATGGGGCrGCTACAGATGGGGTC 
TGCCCCCAGCCTGAACCCTCTGATCCAGACGCTCAGACTATTAAGGAAGAGGATATAG 
TGGAAGATGGCTGGACCATTGTCCGGAGAAAAAAATGAGTGGGGATGATTGGAAATGG 
CTTT C TTTGC1 




ORF Start: ATG at 15 


ORF Stop: TGA at 732 




SEQ ID NO: 74 


239 aa ImW at 26579.5kD 


NOV20a, 

CG57597-01 Protein Sequence 


MPNRVENGPETCELCIAGMHHHSWLIFCIFSAIESGSKLLFLLCKSCVLQKNMYSYPW 
QCRGGVCAALEAWPALQIAVENGFGGVHSQEKAKWLGGAVEDYFMRNADLELDEVEDF 
LGELLTNEFDTWEDGSLPQVSQQLQTMFHHFQRGDGAALREMASC I TQRKCKVTATA 
LKTARETDEDEDDVDSVEEMEVTATNDGAATDGVCPQPEPSDPDAQTIKEEDIVEDGW 
TIVRRKK 



Further analysis of the NOV20a protein yielded the following properties shown in 
Table 20B. 



Table 20B. Protein Sequence Properties NOV20a 


PSort 
analysis: 


0.3000 probability located in nucleus; 0.1000 probability located in 
mitochondrial matrix space; 0.1000 probability located in lysosome (lumen); 
0.0000 probability located in endoplasmic reticulum (membrane) 


SignalP 
analysis: 


No Known Signal Sequence Predicted 



A search of the NOV20a protein against the Geneseq database, a proprietary 
database that contains sequences published in patents and patent publication, yielded 
several homologous proteins shown in Table 20C. 



Table 20C. Geneseq Results for NOV20a 


Geneseq 
Identifier 


Protein/Organism/Length [Patent 
#, Date] 


NOV20a 
Residues/ 

Match 
Residues 


Identities/ 
Similarities for 
the Matched 
Region 


Expect 
Value 


AAG81374 


Human AFP protein sequence SEQ 
ID NO:266 - Homo sapiens, 191 aa. 
[WO200129221-A2, 26-APR-2001] 


61. .239 
13..191 


178/179 (99%) 
178/179 (99%) 


e-101 


AAG57770 


Arabidopsis thaliana protein 


63..239 
18..178 


56/182(30%) 
94/182 (50%) 


le-13 



159 





Arabidopsis thaliana, 1 84 aa. 
[EP1033405-A2, 06-SEP-2000] 








AAG57771 


Arabidopsis thaliana protein 
fragment SEQ ID NO: 74487 - 
Arabidopsis thaliana, 156 aa. 
[EP1033405-A2, 06-SEP-2000] 


74..239 
1-150 


52/171 (30%) 
89/171 (51%) 


2e-ll 



In a BLAST search of public sequence databases, the NOV20a protein was found to 
have homology to the proteins shown in the BLASTP data in Table 20D. 



Table 20D. Public BLASTP Results for NOV20a 


Protein 
Accession 
Number 


Protein/Organism/Length 


NOV20a 
Residues/ 

Match 
Residues 


Identities/ 
Similarities for 
the Matched 
Portion 


Expect 
Value 


Q969E8 


UNKNOWN (PROTEIN FOR 
MGC:20451) (PROTEIN FOR 
IMAGE:3953868) - Homo sapiens 
(Human), 191 aa. 


61. .239 
13..191 


178/179 (99%) 
178/179 (99%) 


e-101 


Q9NAD8 


Y5 1H4A. 15 PROTEIN - 
Caenorhabditis elegans, 225 aa. 


1..239 
1..225 


66/239 (27%) 
122/239 (50%) 


5e-23 


Q06672 


HIGHLY ACIDIC C-TERMINUS - 
Saccharomyces cerevisiae (Baker's 
yeast), 249 aa. 


63..238 
79..244 


46/177 (25%) 
82/177(45%) 


5e-ll 


Q9VBI0 


CG14543 PROTEIN - Drosophila 
melanogaster (Fruit fly), 195 aa. 


71. .238 
24..195 


49/174(28%) 
81/174 (46%) 


2e-10 


Q9UUA9 


HYPOTHETICAL HIGHLY 
ACIDIC C-TERMINUS PROTEIN 
- Schizosaccharomyces pombe 
(Fission yeast), 1 79 aa. 


70..239 
22.. 178 


42/172(24%) 
83/172(47%) 


2e-06 



PFam analysis predicts that the NOV20a protein contains the domains shown in the 
Table 20E. 



Table 20E. Domain Analysis of NOV20a 



Pfam Domain 



NOV20a Match Region 



Identities/ 
Similarities 
for the Matched Region 



Expect Value 



No Significant Matches Found 
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Example 2 1 . 

The NOV21 clone was analyzed, and the nucleotide and predicted polypeptide 
sequences are shown in Table 21 A. 



Table 21A. NOV21 Sequence Analysis 




SEQ ID NO: 75 [7741 bp 


NOV21a, 

CG57804-01 DNA Sequence 


TTGTCTCTTTGTGTTTrCCAGACATTCTAAGTGAGACTGTCCACATCATCTAGGAAAA 


TGGTGGCCCTGTCCTTAAAGATTTGTGTGCGCCACTGCAACGTGGTGAAGACCATGCA 
GTTTGAACCATCTACAGCTGTGTACGATGCGTGTCGAGTCATTCGGGAACGGGTGCCT 
GAGGCACAAACTGGGCAAGCTTCTGACTATGGACTCTTTCTTTCGGATGAAGACCCGA 
GGAAAGGGATTTGGCTGGAAGCGGGCAGAACACTGGATTACTACATGTTGCGGAATGG 
GGATATTTTGGAATATAAAAAGAAACAGAGACCTCAGAAAATCCGGATGCTGGATGGA 
TCTGTGAAGACAGTGATGGTGGATGATTCCAAGACTGTGGGGGAGCTCCTGGTCACTA 
TTTGTAGCAGAATAGGAATAACAAATTATGAAGAATACTCCTTAATCCAAGAAACTAT 
TGAAGAAAAGAAAGAGGAAGGAACGGGCACACTCAAAAAAGACAGGACACTGTTACGA 
GATGAGAGGAAAATGGAGAAGTTGAAGGCCAAGCTGCACACAGATGATGACCTAAATT 
GGCTGGATCACAGCCGAACATTCAGAGAACAAGGAGTAGATGAAAACGAAACGTTGCT 
GCTTAGACGGAAGTTCTTTTACTCTGATCAGAATGTAGATTCGAGAGACCCCGTGCAG 
CTGAACTTGCTTTATGTTCAGGCACGGGATGACATCCTGAATGGCTCTCACCCTGTCT 
CCTTCGAGAAAGCTTGTGAGTTTGGTGGATTTCAAGCCCAGATACAATTTGGACCTCA 
TGTGGAACATAAACACAAACCTGGATTTrTAGATCTGAAGGAATTCCTGCCCAAAGAA 
TATATCAAGCAGAGAGGAGCTGAAAAGAGGATCTTTCAGGAGCATAAGAACTGCGGAG 
AGATGAGrGAGATAGAAGCCAAGGTCAAGTACGTCAAACTCGCACGGTCCCTCCGCAC 
ATATGGCGTGTCCTTCTTCCTGGTGAAGGAGAAGATGAAAGGCAAGAACAAGCTGGTG 
CCTCGCCTGCTGGGGATCACCAAAGACTCGGTGATGCGCGTGGATGAGAAGACCAAGG 
AAGrGCTGCAGGAGTGGCCCCTCACCACCGTCAAGCGCTGGGCAGCCTCACCCAAGAG 
CTTCACACTGGATTTTGGGGAGTATCAGGAAAGCTACTATTCAGTACAAACCACCGAG 
GGAGAGCAGATATCCCAGCTGATTGCAGGCTACATTGACATCATCCTGAAAAAGGGAA 
CATACGTGACATCTGTGGGGTCTCCTCATTGCACTCCACATGGCTGGTGTTCTCTCAG 
TGACCAAACCACTTTTCCCGGCAGGTCCACCATCTTGCAGCAGCAGTTCAACCGGACC 
GGGAAGGCAGAGCACGGCTCAGTGGCGCTGCCGGCCGTGATGCGCTCGGGCTCCAGCG 
GGCCTGAGACCTTCAACGTTGGCAGCATGCCCTCGCCACAGCAGCAGGTCATGGTTGG 
GCAGATGCACCGAGGCCACATGCCGCCACTGACCTCAGCCCAGCAGGCCCTGATGGGG 
ACCATCAACACAAGCATGCACGCCGTCCAGCAGGCCCAGGATGATCTCAGTGAGCTCG 
ACTCGCTGCCACCTCTCGGCCAGGATATGGCATCTAGGGTATGGGTTCAGAACAAAGT 
CGACGAATCCAAACACGAAATCCATTCTCAAGTTGATGCTATCACGGCCGGAACGGCT 
TCAGTTGTTAACCTCACAGCTGGTGACCCTGCAGACACTGACTACACAGCTGTGGGAT 
GTGCGATCACCACTATTTCTTCCAACCTGACGGAGATGTCCAAGGGTGTGAAGCTATT 
GGCCGCCCTCATGGATGATGAGGTGGGCAGCGGGGAGGACTTGCTCAGAGCTGCCAGG 




CTCGACAGACAGTTTTGACTGCTGCTGGCAGCATCGGACAAGCCAGTGGGGATCTTCT 
GAGACAGATTGGAGAGAATGAGACTGATGAGCGATTCCAGGATGTTTTAATGAGTTTG 
GCCAAAGCTGTTGCCAATGCAGCTGCCATGTTGGTACTAAAGGCAAAGAATGTTGCCC 
AAGTGGCCGAAGACACTGTCCTACAGAACAGGGTAATTGCTGCTGCCACCCAGTGTGC 
CCTCTCCACCTCCCAGCTTGTGGCATGTGCCAAGGTTGTGAGCCCCACTATTAGCTCC 
CCTGTGTGCCAGGAGCAGCTGATTGAAGCAGGGAAGCTGGTGGACCGCTCGGTGGAGA 
ACTGTGTCCGTGCCTGCCAGGCGGCCACTACCGATAGTGAGCTCCTGAAGCAGGTCAG 
CGCAGCGGCCAGCGTGGTCAGCCAGGCCCTCCATGATCTCCTGCAGCATGTGCGGCAG 
TTTGCCAGCCGAGGCGAGCCCATCGGCCGCTACGACCAGGCTACTGACACCATCATGT 
GTGTCACCGAGAGCATCTTCAGCTCCATGGGTGACGCTGGTGAAATGGTGCGCCAGGC 
GCGGGTTCTGGCCCAAGCCACATCAGACCTCGTCAATGCCATGAGGTCAGATGCAGAA 
GCCGAAATCGACATGGAGAATTCAAAGAAGCTCCTGGCAGCAGCAAAACTCTTAGCTG 
ACTCCACTGCTCGCATGGTGGAAGCTGCAAAGGGGGCTGCAGCCAACCCAGAGAATGA 
GGACCAGCAGCAAAGGCTGAGAGAAGCTGCAGAAGGCCTCCGGGTAGCAACCAACGCA 
GCTGCCCAGAATGCTATTAAGAAAAAAATTGTCAACCGACTGGAGGTTGCAGCCAAGC 
AGGCCGCAGCGGCAGCCACACAGACCATCGCCGCCTCCCAGAATGCAGCTGTTTCCAA 
CAAGAACCCTGCGGCCCAGCAGCAGCTGGTCCAGAGTTGCAAGGCAGTGGCTGATCAC 
ATCCCTCAGCTGGTCCAGGGAGTGAGGGGGAGCCAAGCTCAAGCTGAAGACCTGAGTG 
CCCAGCTGGCTCTCATCATCTCCAGCCAGAACTTCCTCCAGCCTGGAAGCAAGATGGT 
GTCCTCTGCCAAAGCCGCAGTGCCCACCGTGAGTGACCAGGCCGCAGCCATGCAGCTG 
AGCCAGTGTGCCAAGAACCTGGCCACCAGCTTGGCGGAGCTGCGTACCGCCTCGCAGA 
AGGCCCATGAAGCTTGTGGTCCGATGGAAATCGATTCAGCTCTGAATACGGTGCAGAC 
GCTTAAGAATGAACTGCAGGATGCCAAGATGGCAGCCGTGGAGAGCCAGCTGAAGCCA 
CTTCCAGGGGAAACGCTGGAAAAATGTGCTCAGGACCTGGGAAGCACATCCAAGGCGG 
TGGGCTCCTCCATGGCACAGCTGCTGACCTGTGCTGCTCAAGGCAACGAACACTACAC 
AGGGGTGGCTGCTAGAGAGACGGCCCAAGCTCTGAAAACACTGGCCCAGGCCGCCCGT 
GGAGTGGCTGCATCGACAACCGACCCCGCGGCCGCCCATGCCATGTTAGATTCTGCTC 
GAGACGTGATGGAGGGCTCCGCCATGCTCATTCAAGAGGCCAAGCAGGCCCTGATTGC 
ACCTGGAGATGCAGAGCGTCAACAAAGACTGGCTCAGGTGGCTAAAGCCGTCTCACAC 
- rGAAT i l?lATTGCCTCCCTGGGCAGAAGGATGTGGACGTGGCCTTGA 
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GCCTTTCCAGGAAGCCCAGAGTGAACTGAACCAGGCAGCAGCTGATCTGAACCAGTCT 
GCTGGGGAAGTGGTCCATGCCACCCGGGGCCAGAGTGGAGAGTTGGCTGCAGCCTCTG 
GAAAGTTCAGTGATGATTTTGGTGAATTCCTCGATGCTGGCATTGAGATGGCTGGCCA 
AGCTCAGACAAAAGAAGACCAGATCCAAGTGATAGGGAACCTCAAGAATATCTCGATG 
GCATCCAGCAAGCTGCTGTTAGCTGCCAAGTCTCTCTCTGTAGATCCAGGAGCTCCCA 

CATCACTCTGTGTACCCAACAAGCTCCGGGCCAGAAAGAGTGCGATAATGCCCTGCGG 
GAGCTCGAGACTGTGAAGGGGATGTTGGACAATCCTAATGAACCTGTTAGTGACCTCT 
CTTACTTTGACTGCATTGAGAGTGTGATGGAAAACTCCAAGGTTCTGGGTGAATCGAT 
GGCAGGGATTTCACAGAATGCCAAGACCGGAGACCTCCCTGCCTTTGGGGAATGTGTG 
GGGATTGCATCCAAGGCTCTCTGTGGGCTGACAGAGGCTGCAGCCCAGGCTGCATACT 
TGGTTGGCATCTCTGATCCAAACAGCCAGGCAGGCCACCAGGGCCTGGTGGACCCCAT 
CCAGTTTGCCAGGGCTAACCAGGCCATCCAGATGGCATGCCAGAACTTGGTGGACCCT 
GGCAGCAGCCCATCACAGGTCCTGTCAGCCGCCACAATTGTTGCCAAGCACACGTCAG 
CCTTGTGCAATGCCTGCCGCATCGCCTCATCCAAGACGGCCAACCCAGTAGCCAAGAG 
G CACTTCGTCCAGTCAG CCAAGGAAGTCGCCAACAGCACTGC CAAC CTGG TGAAGACC 
ATCAAGGCCCTGGATGGGGATTTCTCTGAAGACAACCGCAATAAGTGTCGCATCGCCA 
CCGCACCCTTGATTGAAGCTGTGGAGAACCTGACAGCGTTCGCCTCAAACCCTGAGTT 
TGTCAGCATTCCTGCCCAGATCAGCTCCGAGGGTTCCCAGGCACAGGAACCAATCCTG 
GTCTCAGCCAAGACCATGCTGGAGAGTTCATCGTACCTCATTCGCACTGCACGCTCTC 
TGGCCATCAACCCCAAAGACCCACCCACCTGGTCTGTACTGGCTGGACATTCCCATAC 
AGTGTCCGACTCCATCAAGAGTCTCATCACTTCTATCAGGGACAAGGCCCCTGGACAG 
AGGGAGTGTGATTACTCCATCGATGGCATCAACCGGTGCATCCGGGACATCGAGCAGG 

CCTGCAGGAGCAGCTGACTTCGGTGGTCCAGGAAATCGGACACCTTATCGATCCCATC 
GCCACAGCGGCTCGGGGAGAAGCAGCTCAGCTGGGACATAAGGTGACACAACTGGCAA 
GCTATTTTGAGCCCTTGATCTTAGCCGCAGTTGGTGTGGCCTCCAAGATTCTTGATCA 

ATGTTGTATGCAGCCAAAGAAGGTGGCGGAAACCCCAAGGCACAACACACCCATGACG 
CCATCACAGAGGCCGCCCAGTTGATGAAGGAAGCCGTGGATGACATCATGGTGACGCT 

ATCAGACGACTGTGGTTAAATACTCCAAAGCCATTGCGGTGACAGCTCAGGAAATGAT 

GACTATGGGCACCTGGCTTTCCAGGGCCAGATGGCAGCAGCCACGGCGGAACCAGAGG 
AGATCGGATTCCAGATTCGCACTCGTGTGCAGGACCTGGGCCACGGCTGTATCTTCCT 
GGTGCAGAAGGCAGGGGCCCTCCAGGTCTGCCCCACAGACAGCTACACCAAGAGGGAG 
CTGATCGAATGCGCCCGTGCCGTCACGGAAAAGGTCTCCTTGGTGCTCTCGGCTCTCC 
AGGCCGGGAACAAAGGAACCCAGGCATGCATTACAGCCGCCACCGCTGTGTCTGGGAT 
CATTGCCGACCTGGACACCACCATTATGTTTGCAACAGCGGGGACGCTGAATGCAGAG 
AACAGTGAGACCTrCGCAGACCACAGGGAGAACATTCTCAAGACGGCCAAGGCCTTGG 
TAGAAGACACGAAACTACTTGTGTCAGGAGCTGCGTCCACTCCTGACAAGCTGGCCCA 
GGCGGCCCAGTCCTCAGCAGCCACCATCACCCAGCTCGCAGAAGTGGTCAAGCTGGGG 
GCAGCCAGCCTGGGCTCCGACGACCCCGAGACCCAGGTGGATTTGATCAATGCCATCA 
AAGATGTGGCCAAGGCCCTTTCTGATCTCATCAGTGCTACCAAGGGAGCTGCCAGCAA 
GCCAGTGGACGACCCTTCCATGTACCAGCTCAAGGGGGCTGCCAAGGTGATGGTGACC 




CCAGGGCGCTTGAGGCCACAATTGAATGCATAAAGCAGGAGCTTACGGTGTTCCAGTC 
AAAAGACGTACCTGAAAAGACATCATCACCTGAAGAATCCATAAGGATGACGAAAGGC 
ATCACCATGGCAACAGCCAAAGCCGTGGCAGCTGGGAACTCATGTAGACAGGAGGACG 
TGATTGCTACTGCCAACCTGAGCCGGAAAGCCGTGTCAGATATGTTGACGGCTTGCAA 
GCAAGCATCCTTCCACCCCGATGTCAGTGACGAGGTGAGAACCAGAGCCTTGCGTTTC 
GGGACGGAGTGCACCCTTGGCTACTTGGACCTCCTGGAGCACGTCTTGGTGATTCTTC 
AGAAACCAACCCCAGAATTCAAGCAGCAGCTGGCCGCTTTCTCCAAGCGAGTCGCCGG 
CGCTGTGACAGAGCTCATCCAGGCGGCGGAAGCCATGAAAGGAACAGAGTGGGTGGAT 
CCAGAAGACCCAACTGTCATTGCAGAAACAGAGTTACTGGGGGCTGCAGCATCCATCG 

TGAGACCCTGGACTTTGAGGAACAGATCTTGGAAGCTGCTAAATCCATTGCTGCTGCC 
ACAAGCGCCCTGGTCAAATCGGCCTCAGCAGCCCAGAGGGAGCTGGTGGCCCAAGGAA 

TTCTGCTGCCCGGATGGTGGCGGCTGCGACCAGCAGTCTCTGTGAGGCGGCCAATGCC 
TCCGTTCAGGGACACGCCAGCGAGGAGAAGCTCATCTCATCTGCCAAGCAGGTCGCCG 
CTTCCACGGCTCAGCTGCTGGTGGCCTGCAAGGTGAAGGCCGACCAGGATTCAGAGGC 
CATGAGGCGGCTACAGGCGGCAGGAAATGCTGTGAAAAGAGCCTCAGACAATCTTGTC 
CGTGCAGCCCAGAAGGCAGCTTTTGGCAAAGCTGATGACGACGATGTTGTAGTGGAAA 
CCAAGTTTGTGGGGGGCATTGCTCAGATCATCGCCGCCCAGGAAGAAATGCTAAAGAA 
AGAGCGAGAACTGGAAGAAGCAAGGAAAAAACTGGCCCAAATCCGCCAGCAGCAGTAT 
AAGTTTTTACCCACCGAGCTGAGGGAAGATGAGGGCTAAAGGTGCGAGCCCAGATGGC 
GAGCCCCAGGGGATGGCCCTGGCTGAA 




ORF Start: ATG at 
58 


ORF Stop: TAA at 7693 




SEQ ID NO: 76 


2545 aa |MW at 271692.8UD 
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NOV21a, 

CG57804-01 Protein 
Sequence 



MVALSLKICVRHCNWKTMQFEPSTAVYDACRVIRERVPEAQTGQASDYGLFLSDEDP 
RKGIWLEAGRTLDYYMLRNGDILEYKKKQRPQKIRMLDGSVKTVMVDDSKTVGELLVT 
tIGITNYEEYSLIQETIEEKKEEGTGTLKKDRTLLRDERKMEKLKAKLHTDDDLN 
WLDHSRTFREQGVDENETLLLRRKFFYSDQNVDSRDPVQLKLLYVQARDDILNGSHPV 
SFEKACEFGGFQAQIQFGPHVEHKHKPGFLDLKEFLPKEYI KQRGAEKRI FQEHKNCG 
EMSEIEAKVKYVKLARSLRTYGVSFFLVKEKMKGKNKLVPRLLGITKDSVMRVDEKTK 
EVLQEWPLTTVKRWAASPKSFTLDFGEYQESYYSVQTTEGEQISQLIAGYIDIILKKG 
TYVTSVGSPHCTPHGWCSLSDQTTFPGRSTILQQQFNRTGKAEHGSVALPAVMRSGSS 
GPETFNVGSMPSPQQQVMVGQMHRGHMPPLTSAQQALMGTINTSMHAVQQAQDDLSEI, 
DSLPPLGQDMASRVWVQNKVDESKHEIHSQVDAITAGTASWNLTAGDPADTDYTAVG 
CAITTISSNLTEMSKGVKLIAALMDDEVGSGEDLLRAARTLAGAVSDLLKAVQPTSGE 
PRQTVLTAAGSIGQASGDLLRQIGENETDERFQDVLMSLAKAVANAAAMLVLKAKNVA 
QVAEDTVLQNRVIAAATQCALSTSQLVACAKWSPTISSPVCQEQLIEAGKLVDRSVE 
NCVRACQAATTDSELLKQVSAAASWSQALHDIAQHVRQFASRGEPIGRYDQATDTIM 
CVTESIFSSMGDAGEMVRQARVLAQATSDLVNAMRSDAEAE IDMENSKKLLAAAKLLA 
DSTARMVEAAKGAAANPENEDQQQRLREAAEGLRVATNAAAQNAIKKKIVNRLEVAAK 
QAAAAATQTIAASQNAAVSNKNPAAQQQLVQSCKAVADHIPQLVQGVRGSQAQAEDLS 
AQIJUjIISSQNFLQPGSKIWSSAKAAVPTVSDQAAAMQLSQCAKNLATSLAELRTASQ 
KAHEACGPMEIDSALNTVQTLKNELQDAKMAAVESQLKPLPGETLEKCAQDLGSTSKA 
VGSSMAQLLTCAAQGNEHYTGVAARETAQALKTLAQAARGVAASTTDPAAAHAMLDSA 
RDVMEGSAMLIQEAKQALIAPGDAERQQRLAQVAKAVSHSLNNCVNCLPGQKDVDVAL 



GKFSDDFGEFLDAGIEMAGQAQTKEDQIQVIGNLKNISMASSKLLLAAKSLSVDPGAP 
NAKNLLAAAARAVTESINQLITLCTQQAPGQKECDNALRELETVKGMLDNPNEPVSDL 
SYFDCIESVMENSKVLGESMAGISQNAKTGDLPAFGECVGIASKALCGLTEAAAQAAY 
LVGISDPNSQAGHQGLVDPIQFARANQAIQMACQNLVDPGSSPSQVLSAATIVAKHTS 
ALCNACRIASSKTANPVAKRHFVQSAKEVANSTANLVKTIKALDGDFSEDNRNKCRIA 
TAPLIEAVENLTAFASNPEFVSIPAQISSEGSQAQEPILVSAKTMLESSSYLIRTARS 
LAIKPKDPPTWSVLAGHSHTVSDSIKSLITSIRDKAPGQRECDYSIDGINRCIRDIEQ 
ASLAAVSQSLATRDDISVEALQEQLTSWQEIGHLIDPIATAARGEAAQLGHKVTQLA 
SYFE PLILAAVGVASK I LDHQQQMTVLDQTKTLAESALQMLYAAKEGGGNPKAQHTHD 
AITEAAQLMKEAVDDIMVTIJSEAASEVGLVGGMVDAIAEAMSKLDEGTPPEPKGTFVD 
YQTTWKYSKAIAVTAQEMMTKSVTNPEELGGLASQMTSDYGHLAFQGQMAAATAEPE 
EIGFQIRTRVQDLGHGCIFLVQKAGALQVCPTDSYTKRELIECARAVTEKVSLVLSAL 
QAGNKGTQACITAATAVSG I IADLDTTIMFATAGTLNAEKSETFADHRENILKTAKAL 
VEDTKLLVSGAASTPDKLAQAAQSSAATITQIAEWKLGAASLGSDDPETQVDLINAI 
KDVAKALSDLISATKGAASKPVDDPSMYQLKGAAKVMVTNVTSLLKTVKAVEDEATRG 
TRALEAT I ECI KQELTVFQSKDVPEKTSSPEESIRMTKGI TMATAKAVAAGNSCRQED 
VIATANLSRKAVSDMLTACKQASFHPDVSDEVRTRALRFGTECTLGYLDLLEHVLVIIi 
QKPTPEFKQQLAAFSKRVAGAVTELIQAAEAMKGTEWVDPEDPTVIAETELLGAAASI 
EAAAKKLEQLKPRAKPKQADETLDFEEQILEAAKSIAAATSALVKSASAAQRELVAOG 
KVGS I PANAADDGQWSQGLI SAARMVAAATSSLCEAANASVQGHASEEKLISSAKQVA 
ASTAQLLVACKVKADQDSEAMRRLQAAGNAVKRASDNLVRAAQKAAFGKADDDDVWE 
TKFVGGIAQIIAAQEEMLKKERELEEARKKLAQIRQQQYKFLPTELREDEG 



Further analysis of the NOV21a protein yielded the following properties shown in 
Table 21B. 



Table 21B. Protein Sequence Properties NOV21a 


PSort 
analysis: 


0.5964 probability located in mitochondrial matrix space; 0.3037 probability 
located in mitochondrial inner membrane; 0.3037 probability located in 
mitochondrial intermembrane space; 0.3037 probability located in 
mitochondrial outer membrane 


SignalP 
analysis: 


No Known Signal Sequence Predicted 



A search of the NOV21a protein against the Geneseq database, a proprietary 
database that contains sequences published in patents and patent publication, yielded 
several homologous proteins shown in Table 21C. 
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Table 21C. Geneseq Results for NOV21a 


Geneseq 
Identifier 


Protein/Organism/Length 
[Patent #, Date] 


NOV21a 
Residues/ 

Match 
Residues 


Identities/ 
Similarities for 
the Matched 
Region 


Expect 
Value 


AAB41087 


Human ORFXORF851 
polypeptide sequence SEQ ID 
NO: 1702 - Homo sapiens, 2541 
aa. [W02 0005 8473 -A2, 05-OCT- 
2000] 


1..2543 
1..2540 


1913/2546 (75%) 
2231/2546 (87%) 


0.0 


AAM39312 


Human polypeptide SEQ ID NO 
2457 - Homo sapiens, 1 165 aa. 
[WO200153312-A1, 26-JUL- 
2001] 


1381. .2545 
1-1 165 


1161/1165 (99%) 
1163/1165 (99%) 


0.0 


AAM79794 


Human protein SEQ ID NO 3440 
- Homo sapiens, 1 177 aa. 
[WO200157190-A2, 09-AUG- 
2001] 


1378..2545 
10..1177 


1156/1168 (98%) 
1160/1168 (98%) 


0.0 


AAM41098 


Human polypeptide SEQ ID NO 
6029 - Homo sapiens, 1 177 aa. 
[WO200153312-A1, 26-JUL- 
2001] 


1 378..2545 
10..1177 


1156/1168 (98%) 
1160/1168 (98%) 


0.0 


AAM41079 


Human polypeptide SEQ ID NO 
6010 - Homo sapiens, 1 177 aa. 
[WO200153312-A1, 26-JUL- 
2001] 


1378..2545 
10..1177 


1156/1168 (98%) 
1160/1168 (98%) 


0.0 


In a BLAST search of public sequence databases, the NOV21a protein was found to 
have homology to the proteins shown in the BLASTP data in Table 21D. 


Table 21D. Public BLASTP Results for NOV21a 


Protein 
Accession 
Number 


Protein/Organism/Length 


NOV21a 
Residues/ 

Match 
Residues 


Identities/ 
Similarities for 
the Matched 
Portion 


Expect 
Value 


Q9Y490 


Talin - Homo sapiens (Human), 
2541 aa. 


1 ..2543 
1 ..2540 


1910/2546 (75%) 
2230/2546 (87%) 


0.0 


P26039 


Talin — Mus musculus (Mouse), 1 1 ..2543 
2541 aa. J 1..2540 


1907/2546 (74%) 
2230/2546 (86%) 


0.0 


Q9UPX3 


KIAA1027 PROTEIN - Homo 


85S..2543 
1..1694 


1262/1694 (74%) 
1483/1694 (87%) 


0.0 
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(fragment). 








Q9VSL8 


CG6831 PROTEIN (TALIN) - 
Drosophila melanogaster (Fruit 
fly), 2836 aa. 


1..2532 
1..2534 


1197/2563 (46%) 
1707/2563 (65%) 


0.0 


Q9Y4G6 


KIAA0320 PROTEIN - Homo 
sapiens (Human), 949 aa 
(fragment). 


1597..2545 
1..949 


947/949 (99%) 
948/949 (99%) 


0.0 



PFam analysis predicts that the NOV21a protein contains the domains shown in the 
Table 2 IE. 
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Table 21E. Domain Analysis of NOV21a 


Pfam Domain 


NOV21a Match 
Region 


Identities/ 
Similarities 
for the Matched 
Region 


Expect 


ubicjuitin* domain 1 of 1 


64 88 


o/z/ /o) 

20/27 (74%) 




Band 41: domain 1 of 1 


123..316 


67/211 (32%) 
172/211 (82%) 


1 .3e-92 


IRS: domain 1 of 1 


312..404 


19/109(17%) 
46/109 (42%) 


1.2 


I_LWEQ: domain 1 of 5 


674..768 


31/98 (32%) 
59/98 (60%) 


11 


transport_prot: domain 1 
of 1 


667..814 


24/182(13%) 
88/182(48%) 


10 


IJLWEQ: domain 2 of 5 


852..894 


18/47 (38%) 
31/47 (66%) 


2.4e+02 


Vinculin: domain 1 of 1 


860..903 


12/48 (25%) 
30/48 (62%) 


1.3 


I_LWEQ: domain 3 of 5 


925-984 


21/62 (34%) 
37/62 (60%) 


5.9e+04 


TP methylase: domain 1 
of 1 


861.. 1036 


26/226(12%) 
105/226(46%) 


8 


Apolipoprotein: domain 
1 of 1 


981. .1229 


48/288(17%) 
141/288 (49%) 


3.5 


CAP: domain 1 of 1 


917..1354 


94/557(17%) 
209/557 (38%) 


4.4 
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IJLWEQ: domain 4 of 5 


1529.. 1545 


10/17(59%) 
13/17 (76%) 


56 


STAT: domain 1 of 1 


1660.. 1821 


35/211 (17%) 
95/211 (45%) 


8.2 


LEA: domain 1 of 1 


1768.. 1834 


15/76(20%) 
42/76 (55%) 


7 


Histone HNS: domain 1 
of 1 


2232..23S6 


29/143 (20%) 
63/143 (44%) 


3.7 


I_LWEQ: domain 5 of 5 


2345..2536 


100/202(50%) 
183/202(91%) 


2e-101 



Example 22. 

The NOV22 clone was analyzed, and the nucleotide and predicted polypeptide 
sequences are shown in Table 22A. 



Table 22A. NOV22 Sequence Analysis 




SEQIDNO:77 (2214 bp 


NOV22a, 

CG57551-01 DNA Sequence 


ATTCCTCCCTGCCCCTCGTGCAGCCGCTGCCATGGCCCAGACACTGCAGATGGAGATC 


CCGAACTTCGGCAACAGCATCCTGGAGTGCCTCAATGAACAGCGGCTGCAGGGCCTGT 
ACTGTGACGTGTCAGTGGTGGTCAAGGGCCATGCCTTCAAGGCCCACCGGGCCGTGCT 




GAGCTGCCGGCGGCTGTGCAGCCCCAGTCTTTCCAGCAGATCCTCAGCTTCTGCTACA 
CGGGCCGGCTGAGCATGAACGTGGGCGACCAGTTCCTGCTCATGTACACGGCTGGCTT 
CCTGCAGATCCAGGAGATCATGGAGAAGGGCACCGAGTTCTTCCTCAAGGTGAGCTCC 
CCGAGCTGCGACTCCCAGGGCCTGCATGCGGAGGAGGCCCCATCGTCGGAGCCCCAGA 
GCCCCGTGGCGCAGACATCGGGCTGGCCAGCCTGTAGCACCCCGCTGCCCCTCGTGTC 
GCGGGTGAAGACGGAGCAGCAGGAGTCGGACTCCGTGCAGTGCATGCCCGTGGCCAAG 
CGGCTGTGGGACAGTGGCCAGAAGGAGGCTGGGGGCGGCGGCAATGGCAGCCGCAAGA 
TGGCCAAGTTCTCCACGCCGGACCTGGCTGCCAACCGGCCTCACCAGCCCCCGCCACC 




CCAGCCGGTGGGGTGGCAGCAGCAGGGGGTGTGGTGAGTGGGCCCAGCACGTCGGAGC 
GGACCAGCCCAGGCACCTCAAGCGCCTACACCAGCGACAGCCCTGGCTCCTACCACAA 
TGAGGAGGACGAGGAGGAGGATGGTGGCGAGGAGGGCATGGATGAGCAGTACCGGCAG 
ATCTGCAACATGTACACCATGTACAGCATGATGAACGTCGGCCAGACAGCCGAGAAGG 
TGGAGGCCCTCCCGGAGCAGGTAGCCCCCGAGTCCCGAAATCGCATCCGGGTTCGGCA 
AGACCTGGCGTCTCTCCCGGCTGAACTTATCAACCAGATTGGGAACCGCTGCCACCCC 
AAGCTCTACGACGAGGGCGACCCCTCTGAGAAGCTGGAGCTGGTGACAGGCACCAACG 
TGTACATCACAAGGGCGCAGCTGATGAACTGCCACGTCAGCGCAGGCACGCGGCACAA 
GGTCCTACTGCGGCGGCTCCTGGCCTCCTTCTTTGACCGGAACACGCTGGCCAACAGC 
TGCGGCACCGGCATCCGCTCTTCTACCAACGATCCCCGTCGGAAGCCCCTGGACAGCC 
GCGTGCTCCACGCTGTCAAGTACTACTGCCAGAACTTCGCCCCCAACTTCAAGGAGAG 
CGAGATGAATGCCATCGCGGCCGACATGTGCACCAACGCCCGCCGCGTCGTGCGCAAG 
AGCTGGATGCCCAAGGTCAAGGTGCTCAAGGCTGAGGATGACGCCTACACCACCTTCA 
TCAGTGAAACGGGCAAGATCGAGCCGGACATGATGGGTGTGGAGCATGGCTTCGAGAC 
CGCCAGCCACGAGGGCGAGGCGGGTCCCATCGCTGAAGCCCTGCAGTAACCCGCCCAG 
CCTCCCGCGGGGCCGCACACTTCCCCTCCCAACACACACACACACCTGCCATCTTGGT 


CATGAGCTACTGTCTGTCCCTCCCCAGGACCCGCGGTGGGTGCTGCATGTTCCCGGCC 


CTCTGCCCCTCCTGTCCTACCCCCTTTCCCCACCGAGAGCTGGGCCGGGAGAGGACCG 


CAGGGCAGGTGGCGTGAGGTCCGTGTTGCCTTCTTTAACACACACTCGTGCAGTGGGG 


GAGTTCTGGCTCCCCAACCTAACCCCTAGCCGTCATCTCCACACTCACCAGGCCCACC 


AGGGGAGGGGGCTGGCCTGGGGGTCTTGGGAAGGCCCCTCCCCAGGCCTTAGGCCACC 


TCGCGGAAGCCTTCAGCCTCCGCCCCTCACTGCAGCCCCTTGGGACTTGAGGGGGGCC 


CCAGGGGTTCTCAGGACCCCTCCCACCACCTCCCAGTGCTTCCACGTCTCCAAAAGCG 


CCTTCCTGTCACCCTCGTCTATCCCTGCGCCTGGGGGCTGGGGTAGGCGAGGCCGTGG 


GGACTACCCATTTTATAGCTGGGGAAACAGGCTCCGAGAAATTGCACAACCGACCTCA 


GGTGGCCGGC 




ORF Start: ATG at 32 


ORF Stop: TAA at 1613 




SEQ ID NO: 78 


527 aa iMW at 57283. 8kD 
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NOV22a, 
CG57551-01 Protein Sequence 



^4AQTLQMEIPNFGNSILECLNEQRLQGLYCDVSVWKGHAFKAHRAVLAASSSYFRDL 
FNNSRSAWELPAAVQPQSFQQILSFCYTGRLSMNVGDQFLLMYTAGFLQIQEIMEKG 
TEFFLKVSSPSCDSQGLHAEEAPSSEPQSPVAQTSGWPACSTPLPLVSRVKTEQQESD 
SVQCMPVAKRLWDSGQKEAGGGGNGSRKMAKFSTPDLAANRPHQPPPPQQAPWAAAQ 
PAVAAGAGQPAGGVAAAGGWSGPSTSERTSPGTSSAYTSDSPGSYHNEEDEEEDGGE 
EGMDEQYRQICNMYTMYSWOTNVGQTAEKVEALPEQVAPESRNRIRVRQDLASLPAELI 
NQIGNRCHPKLYDEGDPSEKLELVTGTNVYITRAQLMNCHVSAGTRHKVLLRRLLASF 
FDRNTLANSCGTGIRSSTNDPRRKPLDSRVLHAVKYYCQNFAPNFKESEMNAIAADMC 
TNARRWRKSWMPKVKVLKAEDDAYTTFISETGKIEPDMMGVEHGFETASHEGEAGPI 
AEALQ 



Further analysis of the NOV22a protein yielded the following properties shown in 
Table 22B. 



Table 22B. Protein Sequence Properties NOV22a 


PSort 
analysis: 


0.6000 probability located in nucleus; 0.1000 probability located in 
mitochondrial matrix space; 0.1000 probability located in lysosome (lumen); 
0.0000 probability located in endoplasmic reticulum (membrane) 


SignalP 
analysis: 


No Known Signal Sequence Predicted 



A search of the NOV22a protein against the Geneseq database, a proprietary 
database that contains sequences published in patents and patent publication, yielded 
several homologous proteins shown in Table 22C. 



Table 22C. Geneseq Results for NOV22a 


Geneseq 
Identifier 


Protein/Organism/Length 
[Patent #, Date] 


NOV22a 
Residues/ 

Match 
Residues 


Identities/ 
Similarities for 
the Matched 
Region 


Expect 
Value 


AAB41621 


Human ORFXORF1385 
polypeptide sequence SEQ ID 
NO:2770 - Homo sapiens, 228 aa. 
[WO200058473-A2, 05-OCT- 
2000] 


300..527 
1..228 


228/228(100%) 
228/228(100%) 


e-131 


ABB 171 17 


Human nervous system related 
polypeptide SEQ ID NO 5774 - 
Homo sapiens, 190 aa. 
[WO200159063-A2, 16-AUG- 
2001] 


409..501 
1..93 


64/94 (68%) 
73/94 (77%) 


7e-29 


AAG78615 


Human zinc finger transcription 
factor BioZFTF45 - Homo sapiens, 
413 aa. [CN1299825-A, 20-JUN- 
2001] 


5..159 
7.. 170 


62/164(37%) 
92/164 (55%) 


2e-25 


AAY73351 


HTRM clone 1484257 protein 


7. .291 
1.211 


83/291 (28%) 
124/291 (42%) 


8e-18 
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[W09957144-A2, ll-NOV-1999] 








AAM41058 


Human polypeptide SEQ ID NO 
5989 - Homo sapiens, 804 aa. 
[WO2001 533 1 2-A1 , 26-JUL-200 1 ] 


7..291 
2..271 


84/295 (28%) 
123/295 (41%) 


2e-17 



In a BLAST search of public sequence databases, the NOV22a protein was found to 
have homology to the proteins shown in the BLASTP data in Table 22D. 



Table 22D. Public BLASTP Results for NOV22a 


Protein 
Accession 
Number 


Protein/Organism/Length 


NOV22a 
Residues/ 

Match 
Residues 


Identities/ 
Similarities for 
the Matched 
Portion 


Expect 
Value 


Q96RE7 


NAC1 PROTEIN - Homo sapiens 
(Human), 527 aa. 


1..527 
1..527 


526/527 (99%) 
526/527 (99%) 


0.0 


035260 


NAC-1 PROTEIN - Rattus 
norvegicus (Rat), 514 aa. 


1..527 
1..514 


462/530 (87%) 
475/530 (89%) 


0.0 


Q9CZ72 


493051 1N13RIK PROTEIN - 
Mus musculus (Mouse), 514 aa. 


1..527 
1..514 


462/530 (87%) 
476/530 (89%) 


0.0 


Q96BF6 


SIMILAR TO RIKEN CDNA 
0610020102 GENE - Homo 
sapiens (Human), 587 aa. 


1..501 
1..478 


289/522 (55%) 
335/522 (63%) 


e-140 


AAH22103 


RIKEN CDNA 0610020102 
GENE - Mus musculus (Mouse), 
586 aa. 


1..485 
1..459 


281/502 (55%) 
327/502 (64%) 


e-139 



PFam analysis predicts that the NOV22a protein contains the domains shown in the 
Table 22E. 



Table 22E. Domain Analysis of NOV22a 


Pfam Domain 


NOV22a Match Region 


Identities/ 
Similarities 
for the Matched Region 


Expect Value 


BTB: domain 1 of 1 


14..124 


40/143 (28%) 
88/143 (62%) 


6.2e-23 



Example 23. 



The NOV23 clone was analyzed, and the nucleotide and predicted polypeptide 
sequences are shown in Table 23A. 
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Table 23 A. NOV23 Sequence Analysis 




SEQ ID NO: 79 


1497 bp 


NOV23a, 

CG5741 1-01 DNA Sequence 


ATGGCCACTGCACAGGTGGAACTGGTGeAGGGTGGTCCCCGGGCTCCAGTAGGGGAGA 
AGCTGGAGCTCGTCCTGTCGAACCTGCAGGCAGACGTCCTGGAGTTGCTGCTGGAGTT 
TGTCTACACGGGCTCCCTGGTCATCGACTCGGCCAACGCCAAGACACTGCTGGAGGCG 
GCCAGCAAGTrCCAGTTCCACACCTTCTGCAAAGTCTGCGTGTCCTTTCTCGAGAAGC 
AGCTGACGGCCAGCAACTGCCTGGGCGTGCTGGCCATGGCCGAGGCCATGCAGTGCAG 
CGAGCTCTACCACATGGCCAAGGCCTTCGCGCTGCAGATCTTCCCCGAGGTGGCCGCC 

GCCTCAACACCAAGGCTGAGGAGCTGGTGTACGAGACAGTCATCAAGTGGATCAAGAA 
GGACCCCGCGACACGCACACAGCTGCAGTACGCGGCTGAGCTCCTGGCCGTGGTCCGC 
CTCCCCTTCATCCACCCCAGCTACCTGCTCAATGTGGTTGACAATGAAGAGCTGATCA 
AGTCATCAGAAGCCTGCCGGGACCTGGTGAACGAGGCCAAACGCTACCATATGCTGCC 
CCACGCCCGCCAGGAGATGCAGACGCCCCGAACCCGGCCGCGCGTCCCTGCAGGTGTG 
GCTGAGGTCATCGTCTTGGTTGGGGGCCGTCAGATGGTGGGGATGACCCAGCGCTCGC 
TGGTGGCCGTCACCTGCTGGAACCCGCAGAACAACAAGTGGTACCCCTrGGCCTCGCT 
GCCCTTCTATGACCGCGAGTTCTTCAGTGTAGTGAGTGCAGGGGACAACATCTACCTC 
TCAGGTGGGATGGAATCAGGGGTGACGCTGGCTGATGTCTGGTGCTACATGTCCCTGC 
TTGATAACTGGAACCTCGTCTCCAGAATGACAGTCCCCCGCTGTCGGCACAATAGCCT 
CGTCTACGATGGGAAGATTTACACCCTCGGGGGACTTGGCGTGGCAGGCAACGTGGAC 
CACGTGGAGGTCCCTGCAGGTGTGGCTGAGGTCATCGTCTTGGTTGGGGGCCGTCAGA 
TGGTGGGGATGACCCAGCGCTCGCTGGTGGCCGTCACCTGCTGGAACCCGCAGAACAA 
CAAGTGGTACCCCTTGGCCTCGCTGGGTGGGATGGAATCAGGGGTGACGCTGGCTGAT 
GTCTGGTGCTACATGTCCCTGCTTGATAACTGGAACCTCGTCTCCAGAATGACAGTCC 
CCCGCTGTCGGCACAATAGCCTCGTCTACGATGGGAAGATTTACACCCTCGGGGGACT 




ACCCTCCTCCCCCACATGCCCTGCCCTGTGTTCAGACACGGCTGCGTCGTGATAAAGA 
AATATATTCAAAGCGGCTGACATCAGCAGAAAGCCCACGATAAGACT 




ORF Start: ATG at 1 


OPvF Stop: TGA at 1468 




SEQ ID NO: 80 


489 aa [mW at 54208.2kD 


NOV23a, 

CG574 11-01 Protein Sequence 


MATAQVELVQGGPRAPVGEKLELVLSNLQADVLELLLEFVYTGSLVIDSANAKTLLEA 
ASKFQFHTFCKVCVSFLEKQLTASNCLGVLAMAEAMQCSELYHMAKAFALQIFPEVAA 
QEEILSISKDDFIAYVSNDSLNTKAEELVYETVIKWIKKDPATRTQLQYAAELLAWR 
LPFIHPSYLLNWDNEELIKSSEACEDLVNEAKRYHMLPHARQEMQTPRTRPRVPAGV 
AEVIVLVGGRQMVGMTQRSLVAVTCWNPQNNKWYPLASLPFYDREFFSWSAGDNIYL 
SGGMESGVTIJUDWCYMSLLDNWNLVSRMTVPRCRHNSLVYDGKIYTLGGLGVAGNVD 
HVEVPAGVAEVIVLVGGRQMVGMTQRSLVAVTCWNPQNNKWYPLASLGGMESGVTLAD 
VWCYMSLLDNWNLVSRMTVPRCRHNSLVYDGKIYTLGGLGVAGNVDHVEAYEPTTNTW 
TLLPHMPCPVFRHGCWIKKYIQSG 



Further analysis of the NOV23a protein yielded the following properties shown in 
Table 23B. 



Table 23B. Protein Sequence Properties NOV23a 


PSort 
analysis: 


0.6500 probability located in cytoplasm; 0.2271 probability located in 
lysosome (lumen); 0.1000 probability located in mitochondrial matrix space; 
0.0000 probability located in endoplasmic reticulum (membrane) 


SignalP 
analysis: 


No Known Signal Sequence Predicted 



A search of the NOV23a protein against the Geneseq database, a proprietary 
database that contains sequences published in patents and patent publication, yielded 
several homologous proteins shown in Table 23C. 
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Table 23C. Geneseq Results for NOV23a 


Geneseq 
Identifier 


Protein/Organism/Length [Patent 
#, Date] 


NOV23a 
Residues/ 

Match 
Residues 


Identities/ 
Similarities for 
the Matched 
Region 


Expect 
Value 


AAB40940 


Human ORFX ORF704 polypeptide 
sequence SEQ ID NO: 1408 - Homo 
sapiens, 335 aa. [WO200058473- 
A2, 05-OCT-2000] 


19..351 
4..334 


317/333 (95%) 
320/333 (95%) 


e-180 


AAM3871 1 


Human polypeptide SEQ ID NO 
1 856 - Homo sapiens, 574 aa. 
[WO200153312-A1, 26-JUL-2001] 


22„472 
78..559 


151/488 (30%) 
222/488 (44%) 


2e-61 


AAB43090 


Human ORFX ORF2854 
polypeptide sequence SEQ ID 
NO:5708 - Homo sapiens, 506 aa. 
[WO200058473-A2, 05-OCT-2000] 


22..468 
9..487 


150/491 (30%) 
241/491 (48%) 


3e-59 


AAM38956 


Human polypeptide SEQ ID NO 
2101 - Homo sapiens, 587 aa. 
[WO200153312-A1, 26-JUL-2001] 


22..468 
90..568 


149/491 (30%) 
240/491 (48%) 


le-58 


AAM94018 


Human stomach cancer expressed 
polypeptide SEQ ID NO 106 - 
Homo sapiens, 568 aa. 
[WO200109317-A1, 08-FEB-2001] 


25..470 
76..553 


148/490 (30%) 
231/490(46%) 


3e-56 



In a BLAST search of public sequence databases, the NOV23a protein was found to 
have homology to the proteins shown in the BLASTP data in Table 23D. 



Table 23D. Public BLASTP Results for NOV23a 


Protein 
Accession 
Number 


Protein/Organism/Length 


NOV23a 
Residues/ 

Match 
Residues 


Identities/ 
Similarities for 
the Matched 
Portion 


Expect 
Value 


Q96CT2 


HYPOTHETICAL 76.8 KDA 
PROTEIN - Homo sapiens 
(Human), 707 aa (fragment). 


19..489 
203..707 


390/507 (76%) 
406/507 (79%) 


0.0 


Q96PW7 


KIAA1921 PROTEIN - Homo 
sapiens (Human), 545 aa 
(fragment). 


19..489 
41. .545 


390/507 (76%) 
406/507 (79%) 


0.0 


Q96BF0 


SIMILAR TO HYPOTHETICAL 
PROTEIN FLJ14106 - Homo 
sapiens (Human), 503 aa. 


19..351 
172..502 


329/333 (98%) 
330/333 (98%) 


0.0 
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Q9D5K3 


4930429H24RIK PROTEIN - Mus 
musculus (Mouse), 484 aa. 


33..485 
1..477 


165/492 (33%) 
248/492 (49%) 


2e-66 


Q9UH77 


Kelch-like protein 3 - Homo 
sapiens (Human), 587 aa. 


22..468 
90..568 


150/491 (30%) 
241/491 (48%) 


le-58 



PFam analysis predicts that the NOV23a protein contains the domains shown in the 
Table 23E. 



Table 23E. Domain Analysis of NOV23a 


Pfam Domain 


NOV23a Match Region 


Identities/ 
Similarities 
for the Matched Region 


Expect Value 


BTB: domain 1 of 1 


4..79 


24/143 (17%) 
53/143 (37%) 


3.7 


Kelch: domain 1 of 4 


223..272 


9/50(18%) 
28/50 (56%) 


0.94 


Kelch: domain 2 of 4 


275..320 


11/47(23%) 
27/47 (57%) 


0.016 


Kelch: domain 3 of 4 


322.396 


14/75(19%) 
44/75 (59%) 


3.3e-05 


Kelch: domain 4 of 4 


426..471 


19/47(40%) 
35/47 (74%) 


7.2e-10 



Example 24. 

The NOV24 clone was analyzed, and the nucleotide and predicted polypeptide 
sequences are shown in Table 24A. 



Table 24A. NOV24 Sequence Analysis 




SEQIDNO:81 1 4268 bp 


NOV24a, 

CG57399-01 DNA Sequence 


ATGACCTGGGACACAGCTCTCTGGACCTCA6TTTTTCTGATTGGGCTCCTTCCTACCC 
TTGGTTTCGCTAATTGCATCCTCCAGACTTCTGGTAAAATGTGTACTTTAAGAGGTAG 
ATACCCCCAGCCCCCACAACCACCTCTCTGCTTGTCTCCCCTAGTCCACCAGCTCCGA 
CCAGCAGACATCAAAGTGGTGGCCGCCCTGGGTAATGATGAAACCTTCCAGGAAAGTG 
GTGCAGGGCAGCTAAGTGAGCCTGACCCCAGGCAGTGGTCCTGGCCACAGGCCTGCTT 
GCCTGGGGTAAAAAAGGAAATGCAAGATGTGGTAGGTGAGAGAACGCCGAGCCGTCGC 
CGCAGCCTCCGCCGCCGAGAAGCCCTTGTTCCCGCTGCTGGGAAGGAGAGTCTGTGCC 
GACAAGATATTTTCATTTCCTTGTTGGAAATTATCAAGCATTTTCCTCCCTCCCCTCA 
GGACATCAACCTGGAGAAAGACTGGAAGCTGGTCACACTCTTCATTGGGGTCAACGAC 
TTGTGTCATTACTGTCCACTTGTTCAGGGCCCCGTTATAGACCTGGGTGGGATGGATA 
CCCTCCACTCCCTGCAGCTCCCAAGGGCTTTCGTCAACGTGGTGGAGGTCATGGAGCT 
GGCTAGCCTGTACCAGGGCCAAGGCGGGAAATGTGCCATGCTGGCAGCTCAGGAAGCC 
TGGAACAGCCTCCTGGCCTCCAGCAGGTACAGTGAGCAGGAGTCCTTCACCGTGGTTT 
TCCAGCCTTTCTTCTATGAGACCACCCCATCTGACCCCCGACTCCAGGATTCTACCAC 
GCrGGCCTGGCATCTCTGGAATAGGATGATGGAGCCAGCAGGAGAGAAAGATGAGCCA 
TTGAGTGTAAAACACGGGAGGCCAATGAAGTGTCCCTCTCAGGAGAGCCCCTATCTGT 
TCAGCTACAGAAACAGCAACTACCTGACCAGACTGCAGAAACCCCAAGACAAGCTTGT 
AAGAGAAGGAGCGGAAATCAGATGTCCTGACAAAGACCCCTCCGATACGGTTCCCACC 
TCAGTTCATAGGCTGAAGCCGGCTGACATCAACGTAATTGGAGCCCTGGGTGACTCTC 
TCACGGCAGGCAATGGGGCCGGGTCCACACCTGGGAACGTCTTGGACGTCTTGACTCA 
3 - L i T I j, ITAC A 
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CTGGCAGACATCCTCCGGGAATTCAACCCTTCCCTGAAGGGCTTCTCTGTTGGCACTG 
GGAAAGAAACCAGTCCTAATGCCTTCTTAAACCAGGCTGTGGCAGGAGGCCGAGCTGA 
GCAGGCCAGGAGGCTGGTGGACCTGATGAAGAATGACACGAGGATACACTTTCAGGAA 
GACTGGAAGATAATAACCCTGTTTATAGGCGGCAATGACCTCTGTGATTTCTGCAATG 
ATCTGGTACACTATTCTCCCCAGAACTTCACAGACAACATTGGAAAGGCCCTGGACAT 
CCTCCATGCTGAGTCTCAGGTTCCTCGGGCATTTGTGAACCTGGTGACGGTGCTTGAG 
ATCGTCAACCTGAGGGAGCTGTACCAGGAGAAAAAAGTCTACTGCCCAAGGATGATCC 
TCAGGTCACTGTGTCCCTGTGTCCTGAAGTTTGATGATAACTCAACAGAACTTGCTAC 
CCTCATCGAATTCAACAAGAAGTTTCAGGAGAAGACCCACCAACTGATTGAGAGTGGG 
CGATATGACACAAGGGAAGATTTTACTGTGGTTGTGCAGCCGTTCTTTGAAAACGTGG 
ACATGCCAAAGACCCAGGAAGGATTGCCTGACAACTCTTTCTTCGCTCCTGACTGTTT 
CCACTTCAGCAGCAAGTCTCACTCCCGAGCAGCCAGTGCTCTCTGGAACAATATGCTG 
GAGCCTGTTGGCCAGAAGACGACTCGTCATAAGTTTGAAAACAAGATCAATATCACAT 
GTCCGTCACAGGTCCAGCCGTTTCTGAGGACCTACAAGAACAGCATGCAGGGTCATGG 
GACCTGGCTGCCATGCAGGGACAGAGCCCCTTCTGCCTTGCACCCTACCTCAGTGCAT 
GCCCTGAGACCTGCAGACATCCAAGTTGTGGCTGCTCTGGGGGATTCTCTGACCGCTG 
GCAATGGAATTGGCTCCAAACCAGACGACCTCCCCGATGTCACCACACAGTATCGGGG 
ACTGTCATACAGTGCAGGAGGGGACGGCTCCCTGGAGAATGTGACCACCTTACCTAGT 
TCTATCCTTCGGGAGTTTAACAGAAACCTCACAGGCTACGCCGTGGGCACGGGTGATG 
CCAATGACACGAATGCATTCCTCAATCAAGCTGTTCCCGGAGCAAAGGCTAGGGATCT 
TATGAGCCAAGrCCAAACTCTGATGCAGAAGATGAAAGATGATCATAGAGTAAATTTC 
CATGAAGACTGGAAGGTCATCACAGTGCTGATCGGAGGCAGCGATTTATGTGACTACT 
GCACAGATTCGAATCTGTATTCTGCAGCCAACTTTGTTCACCATCTCCGCAATGCCTT 
GGACGTCCTGCATAGAGAGGTGCCCAGAGTCCTGGTCAACCTCGTGGACTTCCTGAAC 
CCCACTATCATGCGGCAGGTGTTCCTGGGAAACCCAGACAAGTGCCCAGTGCAGCAGG 
CCAGCGTTTTGTGTAACTGCGTTCTGACCCTGCGGGAGAACTCCCAAGAGCTAGCCAG 
GCTGGAGGCCTTCAGCCGAGCCTACCAGAGCAGCATGCGCGAGCTGGTGGGGTCAGGC 
CGCTATGACACGCAGGAGGACTTCTCTGTGGTGCTGCAGCCCTTCTTCCAGAACATCC 
AGCTCCCTGTCCTGCAGGATGGGCTCCCAGATACGTCCTTCTTTGCCCCAGACTGCAT 
CCACCCAAATCAGAAATTCCACTCCCAGCTGGCCAGAGCCCTTTGGACCAATATGCTT 
GAACCACTTGGAAGCAAAACAGAGACCCTGGACCTGAGAGCAGAGATGCCCATCACCT 
GTCCCACTCAGAATGAGCCCTTCCTGAGAACCCCTCGGAATAGTAACTACACGTACCC 
CATCAAGCCAGCCATTGAGAACTGGGGCAGTGACTTCCTGTGTACAGAGTGGAAGGCT 
TCCAATAGTGTTCCAACCTCTGTCCACCAGCTCCGACCAGCAGACATCAAAGTGGTGG 
CCGCCCTGGGTGACTCTCTGACTACAGCAGTGGGAGCTCGACCAAACAACTCCAGTGA 
CCTACCCACATCTTGGAGGGGACTCTCTTGGAGCATTGGAGGGGATGGGAACTTGGAG 
ACTCACACCACACTGCCCAGTATTCTGAAGAAGTTCAACCCTTACCTCCTTGGCTTCT 
CTACCAGCACCTGGGAGGGGACAGCAGGACTAAATGTGGCAGCGGAAGGGGCCAGAGC 
TAGGAGGGACATGCCAGCCCAGGCCTGGGACCTGGTAGAGCGAATGAAAAACAGCCCC 
ATACACTTTCAGGAAGACTGGAAGATAATAACCCTGTTTATAGGCGGCAATGACCTCT 

CATCCTCTCTGAGGAGCTCCCAAGGGCTTTCGTCAACGTGGTGGAGGTCATGGAGCTG 
GCTAGCCTGTACCAGGGCCAAGGCGGGAAATGTGCCATGCTGGCAGCTCAGAACAACT 
GCACTTGCCTCAGACACTCGCAAAGCTCCCTGGAGAAGCAAGAACTGAAGAAAGTGAA 
CTGGAACCTCCAGCATGGCATCTCCAGTTTCTCCTACTGGCACCAATACACACAGCGT 
GAGGACTTTGCGGTTGTGGTGCAGCCTTTCTTCCAAAACACACTCACCCCACTGAACA 
GAGGGGACACTGACCTCACCTTCTTCTCCGAGGACTGTTTTCACTTCTCAGACCGCGG 
GCATGCCGAGATGGCCATCGCACTCTGGAACAACATGCTGGAACCAGTGGGCCGCAAG 
ACTACCTCCAACAACTTCACCCACAGCCGAGCCAAACTCAAGTGCCCCTCTCCTGTGA 
GTCCTTACCTCTACACCCTGCGGAACAGCCGATTGCTCCCAGACCAGGCTGAAGAAGC 
CCCCGAGGTGCTCTACTGGGCTGTCCCAGTGGCAGCGGGAGTCGGCCTTGTGGTGGGC 
ATCATCGGGACAGTGGTCTGGAGGTGCAGGAGAGGTGGCCGGAGGGAAGATCCTCCAA 
TGAGCCTGCGCACTGTGGCCCTCTAGGCCCGGGG 




ORF Start: ATG at 1 


ORF Stop: TAG at 4258 




SEQ ID NO: 82 


1419 aa ImW at 158435.1kD 


NOV24a, 

CG57399-01 Protein 
Sequence 


MTWDTALWTSVFLIGLLPTLGFANCILQTSGKMCTLRGRYPQPPQPPLCLSPLVHQLR 
PADIKWAALGNDETFQESGAGQLSEPDPRQWSWPQACLPGVKKEMQDWGERTPSRR 
RSLRRREALVPAAGKESLCRQDIFISLLEIIKHFPPSPQDINLEKDWKLVTLFIGVND 
LCHYCPLVQGPVIDLGGMDTLHSLQLPRAFVNWEVMELASLYQGQGGKCAMIAAQEA 
WNSLIASSRYSEQESFTWFQPFFYETTPSDPRLQDSTTLAWHLWNRMMEPAGEKDEP 
LSVKHGRPMKCPSQESPYLFSYRNSNYLTRLQKPQDKLVREGAEIRCPDKDPSDTVPT 
SVHRLKPADINVIGALGDSLTAGNGAGSTPGNVLDVLTQYRGLSWSVGGDENIGTVTT 
LADILREFNPSLKGFSVGTGKETSPNAFLNQAVAGGRAEQARRLVDLMKNDTRIHFQE 
DWKIITLFIGGNDLCDFCNDLVHYSPQNFTDNIGKALDILHAESQVPRAFVNLVTVLE 
IVNLRELYQEKKVYCPRMILRSLCPCVLKFDDNSTELATLIEFNKKFQEKTHQLIESG 
RYDTREDFTVWQPFFENVDMPKTQEGLPDNSFFAPDCFHFSSKSHSRAASALWNNML 
EPVGQKTTRHKFENKINITCPSQVQPFLRTYKNSMQGHGTWLPCRDRAPSALHPTSVH 
ALRPADIQWAALGDSLTAGNGIGSKPDDLPDVTTQYRGLSYSAGGDGSLENVTTLPS 
SILREFNRNLTGYAVGTGDANDTNAFLNQAVPGAKARDLMSQVQTLMQKMKDDHRVNF 
HEDWKVITVLIGGSDLCDYCTDSNLYSAANFVHHLRNALDVLHREVPRVLVNLVDFLN 
PTIMRQVFLGHPDKCPVQQASVLCNCVLTLRENSQELARLEAFSRAYQSSMRELVGSG 
RYDTQEDFSWLQPFFONIQLPVLQDGLPDTSFFAPDCIHPNQKFHSQLARALWTNML 
EPLGSKTETLDLRAEMPITCPTQNEPFLRTPRNSNYTYPIKPAIENWGSDFLCTEWKA 
SNSVPTSVHQLRPADIKWAALGDSLTTAVGARPNNSSDLPTSWRGLSWSIGGDGNLE 
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THTTLPSILKKFNPYLLGFSTSTWEGTAGLNVAAEGARARRDMPAQAWDLVERMKNSP 
IHFQEDWKI I TLF I GGNDLCDFCNDLVGEYVQH I QQALDI LSEELPRAFVNWEVMEL 
ASLYQGQGGKCAMIJ^QNNCTCLRHSQSSLEKQELKKVNWNLQHGISSFSYWHQYTQR 
EDFAVWQPFFQNTLTPLNRGDTDLTFFSEDCFHFSDRGHAEMAIALWNNMLEPVGRK 
TTSNNFTHSRAKLKCPSPVSPYLYTLRNSRLLPDQAEEAPEVLYWAVPVAAGVGLWG 
I IGTWWRCRRGGRREDPPMSLRTVAL 




SEQIDNO:83 1 1624 bp 


NOV24b, 

CG57399-02 DNA Sequence 


GCCGGCTGACATCAATGTAATTGGAGCCCTGGGTGACTCTCTCACGGCAGGCAATGGG 


GCCGGGTCCACACCTGGGAACGTCTTGGACGTCTTGACTCAGTACCGAGGCCTGTCCT 


GGAGCGTCGGCGGAGATGAGAACATCGGCACCGTTACCACCCTGGCGAACATCCTCCG 


AATGCCTTCTTAAACCAGGCTGTGGCAGGAGGCCGAGCTGAGGATCTACCTGTCCAGG 


CCAGGAGGCTGGTGGACCTGATGAAGAATGACACGAGGATACACTTTCAGGAAGACTG 
GAAGATAATAACCCTGTTTATAGGCGGCAATGACCTCTGTGATTTCTGCAATGATCTG 
GTCCACTATTCTCCCCAGAACTTCACAGACAACATTGGAAAGGCCCTGGACATCCTCC 
ATGCTGAGGTTCCTCGGGCATTTGTGAACCTGGTGACGGTGCTTGAGATCGTCAACCT 
GAGGGAGCTGTACCAGGAGAAAAAAGTCTACTGCCCAAGGATGATCCTCAGGTCTCTG 
TGTCCCTGTGTCCTGAAGTTTGATGATAACTCAACAGAACTTGCTACCCTCATCGAAT 
TCAACAAGAAGTTTCAGGAGAAGACCCACCAACTGATTGAGAGTGGGCGATATGACAC 
AAGGGAAGATTrTACTGTGGTTGTGCAGCCGTTCTTTGAAAACGTGGACATGCCAAAG 
ACCTCGGAAGGATTGCCTGACAACTCTTTCTTCGCTCCTGACTGTTTCCACTTCAGCA 
GCAAGTCTCACTCCCGAGCAGCCAGTGCTCTCTGGAACAATATGCTGGAGCCTGTTGG 
CCAGAAGACGACTCGrCATAAGTTTGAAAACAAGATCAATATCACATGTCCGAACCAG 
GTCCAGCCGTTTCTGAGGACCTACAAGAACAGCATGCAGGGTCATGGGACCTGGCTGC 
CATGCAGGGACAGAGCCCCTTCTGCCTTGCACCCTACCTCAGTGCATGCCCTGAGACC 
TGCAGACATCCAAGTTGTGGCTGCTCTGGGGGATTCTCTGACCGCTGGCAATGGAATT 
GGCTCCAAACCAGACGACCTCCCCGATGTCACCACACAGTATCGGGGACTGTCATACA 
GAGAAAGTAAACCAGGGTTCTTATCAGACTCCTGGGTCAGCAAATCCAACAGGAAATG 
CACCAGAAAAGCACCAAATCCCTGAATCTTCACCTCCCCGCTTGCATGTATACGTGTA 
CACGTGGTGTTCCTACGTCTCTGTTTACTGTCTTTATGTGTTTATTCATGTTGTCTTG 


TAGTCACACAGCTGCCTTTACATATATGTACACATCTGCACAGAAAACCTCTGAAACC 


CATCGCACACTTCGAGAGGCCATAACCAAGACACAATCACAATCAGCCATGTCTTGAA 




AGAAGCTCAGGGTGTGTCAGGCGAGCGGTTGCGTGTAGATATTCTCAAGTTTCTTTCT 
CTCCTAATAAAGTTCTCATTCCTGTAGGCTTCAAAGTAAGTGGCGAGTAGCTCAGAAT 




ORF Start: ATG at 
311 


ORF Stop: TG A at 1241 




SEQ ID NO: 84 


310 aa |MWat35240.6kD 


NOV24b, 

CG57399-02 Protein 
Sequence 


MKNDTR I HFQEDWKI ITLFIGGNDLCDFCNDLVHYSPQNFTDN IGKALDI LHAEVPRA 
FVNLVTVLEIVNLRELYQEKKVYCPRMILRSLCPCVLKFDDNSTELATLIEFNKKFQE 
KTHQLIESGRYDTREDFTVWQPFFENVDMPKTSEGLPDNSFFAPDCFHFSSKSHSRA 
ASALWNNMLEPVGQKTTRHKFENKINITCPNQVQPFLRTYKNSMQGHGTWLPCRDRAP 
SALHPTSVHAX,RPADIQWAALGDSLTAGNGIGSKPDDLPDVTTQYRGL,SYRESKPGF 

LSI i) y t i Of 




SEQ ID NO: 85 |4425 bp 


NOV24c, 

CG57399-03 DNA Sequence 


CTGGAGCATTCTGGCATGGGGCTGCGGCCAGGCATTTTCCTCCTGGAGCTGCTGCTGC 
TTCTGGGGCAAGGTACCCCTCAGATCCATACCTCTCCTAGAAAGAGTACATTGGAAGG 
GCAGCTATGGCCAGAGACAGTTCACTCTCTGAAGCCTTCTGATATTAAATTTGTGGCA 
GCCATTGGCAATCTGGAAATTGTGCCAGACCCAGGGACGGGCGATCTGGAGAAGCAAG 
ACGAAAGGCCACAGCAGGTGTGCATGGGAGTGATGACAGTCCTTTCAGACATCATCAG 
ATATTTCAGTCCTTCTGTTCCAATGCCTGTGTGCCACACTGGAAAGAGAGTCATACCC 
CACGATGGTGCTGAGGACTTGTGGATTCAGGCTCAAGAACTGGTGAGAAACATGAAAG 
AGAACCAACTTGACTTTCAATTTGACTGGAAGCTCATCAATGTGTTCTTCAGTAATGC 
AAGCCAGTGTTACCTGTGCCCCTCTGCTCAACAGAATGGGCTTGCGGCGGGCGGCGTG 
GATGAGCTGATGGGGGTGCTGGACTACCTGCAGCAGGAGGTGCCCAGAGCATTTGTAA 
ACCTGGTGGACCTCTCTGAGGTTGCAGAGGTCTCTCGTCAGTATCACGGCACTTGGCT 
CAGCCCTGCACCAGAGCCCTGTAATTGCTCAGAGGAGACCACCCGGCTGGCCAAGGTG 
GTGATGCAGTGGTCTTATCAGGAAGCCTGGAACAGCCTCCTGGCCTCCAGCAGGTACA 
GTGAGCAGGAGTCCTTCACCGTGGTTTTCCAGCCTTTCTTCTATGAGACCACCCCATC 
TGACCCCCGACTCCAGGATTCTACCACGCTGGCCTGGCATCTCTGGAATAGGATGATG 
GAGCCAGCAGGAGAGAAAGATGAGCCATTGAGTGTAAAACACGGGAGGCCAATGAAGT 
GTCCCTCTCAGGAGAGCCCCTATCTGTTCAGCTACAGAAACAGCAACTACCTGACCAG 
ACTGCAGAAACCCCAAGACAAGCTTGAGGTAAGAGAAGGAGCGGAAATCAGATGTCCT 
GACAAAGACCCCTCCGATACGGTTCCCACCTCAGTTCATAGGCTGAAGCCGGCTGACA 
TCAACGTAATTGGAGCCCTGGGTGACTCTCTCACGGCAGGCAATGGGGCCGGGTCCAC 
ACCTGGGAACGTCTTGGACGTCTTGACTCAGTACCGAGGCCTGTCCTGGAGCGTCGGC 




CTTCCCTGAAGGGCTTCTCTGTTGGCACTGGGAAAGAAACCAGTCCTAATGCCTTCTT 
AAACCAGGCTGTGGCAGGAGGCCGAGCTGAGCAGGCCAGGAGGCTGGTGGACCTGATG 
AAGAATGACACGAGGATACACTTTCAGGAAGACTGGAAGATAATAACCCTGTTTATAG 
GCGGCAATGACCTCTGTGATTTCTGCAATGATCTGGTACACTATTCTCCCCAGAACTT 
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c 





CACAGACAACATTGGAAAGGCCCTGGACATCCTCCATGCTGAGGTTCCTCGGGCATTT 
GTGAACCTGGTGACGGTGCTTGAGATCGTCAACCTGAGGGAGCTGTACCAGGAGAAAA 
AAGTCTACTGCCCAAGGATGATCCTCAGGTCACTGTGTCCCTGTGTCCTGAAGTTTGA 
TGATAACTCAACAGAACTTGCTACCCTCATCGAATTCAACAAGAAGTTTCAGGAGAAG 
ACCCACCAACTGATTGAGAGTGGGCGATATGACACAAGGGAAGATTTTACTGTGGTTG 
TGCAGCCGTTCTTTGAAAACGTGGACATGCCAAAGACCCAGGAAGGATTGCCTGACAA 
CTCTTTCTTCGCTCCTGACTGTTTCCACTTCAGCAGCAAGTCTCACTCCCGAGCAGCC 
AGTGCTCTCTGGAACAATATGCTGGAGCCTGTTGGCCAGAAGACGACTCGTCATAAGT 
TTGAAAACAAGATCAATATCACATGTCCGAACCAGGTAGAGTGGCCGTTTCTGAGGAC 
CTACAAGAACAGCATGCAGGGTCATGGGACCTGGCTGCCATGCAGGGACAGAGCCCCT 
TCTGCCTTGCACCCTACCTCAGTGCATGCCCTGAGACCTGCAGACATCCAAGTTGTGG 
CTGCTCTGGGGGATTCTCTGACCGCTGGCAATGGAATTGGCTCCAAACCAGACGACCT 
CCCCGATGTCACCACACAGTATCGGGGACTGTCATACAGTGCAGGAGGGGACGGCTCC 
CTGGAGAATGTGACCACCTTACCTGATATCCTTCGGGAGTTTAACAGAAACCTCACAG 




TCCCGGAGCAAAGGCTAGGGATCTTATGAGCCAAGTCCAAACTCTGATGCAGAAGATG 
AAAGATGATCATAGAGTAAATTTCCATGAAGACTGGAAGGTCATCACAGTGCTGATCG 
GAGGCAGCGATTTATGTGACTACTGCACAGATTCGAATCTGTATTCTGCAGCCAACTT 
TGTTCACCATCTCCGCAATGCCTTGGACGTCCTGCATAGAGAGGTGCCCAGAGTCCTG 
GTCAACCTCGTGGACTTCCTGAACCCCACTATCATGCGGCAGGTGTTCCTGGGAAACC 
CAGACAAGTGCCCAGTGCAGCAGGCCAGCGTTTTGTGTAACTGCGTTCTGACCCTGCG 
GGAGAACTCCCAAGAGCTAGCCAGGCTGGAGGCCTTCAGCCGAGCCTACCAGAGCAGC 

TGCAGCCCTTCTTCCAGAACATCCAGCTCCCTGTCCTGCAGGATGGGCTCCCAGATAC 
GTCCTTCTTTGCCCCAGACTGCATCCACCCAAATCAGAAATTCCACTCCCAGCTGGCC 
AGAGCCCTTTGGACCAATATGCTTGAACCACTTGGAAGCAAAACAGAGACCCTGGACC 
TGAGAGCAGAGATGCCCATCACCTGTCCCACTCAGAATGAGCCCTTCCTGAGAACCCC 
TCGGAATAGTAACTACACGTACCCCATCAAGCCAGCCATTGAGAACTGGGGCAGTGAC 
TTCCTGTGTACAGAGTGGAAGGCTTCCAATAGTGTTCCAACCTCTGTCCACCAGCTCC 
GACCAGCAGACATCAAAGTGGTGGCCGCCCTGGGTGACTCTCTGACTGTGGCAGTGGG 
AGCTCGACCAAACAACTCCAGTGACCTACCCACATCTTGGAGGGGACTCTCTTGGAGC 
ATTGGAGGGGATGGGAACTTGGAGACTCACACCACACTGCCCGACATTCTGAAGAAGT 
TCAACCCTTACCTCCTTGGCTTCTCTACCAGCACCTGGGAGGGGACAGCAGGACTAAA 
TGTGGCAGCGGAAGGGGCCAGAGCTAGGGACATGCCAGCCCAGGCCTGGGACCTGGTA 
GAGCGAATGAAAAACAGCCCCCAGGACATCAACCTGGAGAAAGACTGGAAGCTGGTCA 
CACTCTTCATTGGGGTCAACGACTTGTGTCATTACTGTGAGAATCCGGTAGGCGAATA 
TGTTCAGCACATCCAACAGGCCCTGGACATCCTCTCTGAGGAGCTCCCAAGGGCTTTC 
GTCAACGTGGTGGAGGTCATGGAGCTGGCTAGCCTGTACCAGGGCCAAGGCGGGAAAT 
GTGCCATGCTGGCAGCTCAGAACAACTGCACTTGCCTCAGACACTCGCAAAGCTCCCT 
GGAGAAGCAAGAACTGAAGAAAGTGAACTGGAACCTCCAGCATGGCATCTCCAGTTTC 




TCCAAAACACACTCACCCCACTGAACAGAGGGGACACTGACCTCACCTTCTTCTCCGA 
GGACTGTTTTCACTTCTCAGACCGCGGGCATGCCGAGATGGCCATCGCACTCTGGAAC 
AACATGCTGGAACCAGTGGGCCGCAAGACTACCTCCAACAACTTCACCCACAGCCGAG 
CCAAACTCAAGTGCCCCTCTCCTGAGAGCCCTTACCTCTACACCCTGCGGAACAGCCG 
ATTGCTCCCAGACCAGGCTGAAGAAGCCCCCGAGGTGCTCTACTGGGCTGTCCCAGTG 
GCAGCGGGAGTCGGCCTTGTGGTGGGCATCATCGGGACAGTGGTCTGGAGGTGCAGGA 
GAGGTGGCCGGAGGGAAGATCCTCCAATGAGCCTGCGCACTGTGGCCCTCTAGGCCCG 
GGGGTGGGTCCTCACCCTAAACTCCCTATAGCCACTCTCTTCACCGCCCTCTGCCCCA 


GCCACTCCCGGCCACCAGGACATGCTTCAATGCCTGGTGCCATAGGAAGCCCAGGGGA 


CAGTCACAACTTCTTGG 




ORF Start: ATG at 16 


ORF Stop: TAG at 4285 




SEQ ID NO: 86 


1423 aa JMW at 159352.7kD 


NOV24c, 

CG57399-03 Protein 
Sequence 


MGLRPGIFLLELLLLLGQGTPOIHTSPRKSTLEGQLWPETVHSLKPSDIKFVAAIGNL 
EIVPDPGTGDLEKQDERPQQVCMGVMTVLSDIIRYFSPSVPMPVCHTGKRVIPHDGAE 
DLWIQAQELVRNMKEKQLDFQFDWKLINVFFSNASQCYLCPSAQQNGLAAGGVDELMG 
VLDYLQQEVPRAFVNLVDLSEVAEVSRQYHGTWLSPAPEPCNCSEETTRLAKWMQWS 
YQEAWNSLLASSRYSEQESFTWFQPFFYETTPSDPRLQDSTTLAWHLWNRMMEPAGE 
KDEPLSVKHGRPMKCPSQESPYLFSYRNSNYLTRLQKPQDKLEVREGAEIRCPDKDPS 
DTVPTSVHRLKPADINVIGALGDSLTAGNGAGSTPGNVLDVLTQYRGLSWSVGGDENI 
GTVTTLADILREFNPSLKGFSVGTGKETSPNAFLNQAVAGGRAEQARRLVDLMKNDTR 
IHFQEDWKIITLFIGGNDLCDFCNDLVHYSPQNFTDNIGKALDILHAEVPRAFVNLVT 
VLEIVNLRELYQEKKVYCPRMILRSLCPCVLKFDDNSTELATLIEFNKKFQEKTHQLI 
ESGRYDTREDFTWVQPFFENVDMPKTQEGLPDNSFFAPDCFHFSSKSHSRAASALWN 
NMLEPVGQKTTRHKFENKINITCPNQVEWPFLRTYKNSMQGHGTWLPCRDRAPSALHP 
TSVHALRPADIQWAALGDSLTAGNGIGSKPDDLPDVTTQYRGLSYSAGGDGSLENVT 
TLPDILREFNRNLTGYAVGTGDANDTNAFLNQAVPGAKARDLMSQVQTLMQKMKDDHR 
VNFHEDWKVITVLIGGSDLCDYCTDSNLYSAANFVHHLRNALDVLHREVPRVLVNLVD 
FLNPTIMRQVFLGNPDKCPVQQASVLCNCVLTLRENSQELARLEAFSRAYQSSMRELV 
GSGRYDTQEDFSWLQPFFQNIQLPVLQDGLPDTSFFAPDCIHPNQKFHSQLARALWT 
NMLEPLGSKTETLDLRAEMPITCPTQNEPFLRTPRNSNYTYPIKPAIENWGSDFLCTE 
WKASNSVPTSVHQLRPADIKWAALGDSLTVAVGARPNNSSDLPTSWRGLSWSIGGDG 
NLETHTTLPDILKKFNPYLLGFSTSTWEGTAGLNVAAEGARARDMPAQAWDLVERMKN 
SPQDINLEKDWKLVTLFIGVNDLCHYCENPVGEYVQHIQQALDILSEELPRAFVNWE 
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VMELASLYQGQGGKCAMLAAQNNCTCLRHSQSSLEKQELKKVNWNLQHGISSFSYVJHQ 
YTQREDFAVWQPFFQNTLTPLNRGDTDLTFFSEDCFHFSDRGHAEMAIALWNNMLEP 
VGRKTTSKNFTHSRAKLKCPSPESPYLYTLRNSRLLPDQAEEAPEVLyWAVPVAAGVG 
LWGIIGTWWRCRRGGRREDPPMSLRTVAL 



Sequence comparison of the above protein sequences yields the following sequence 
relationships shown in Table 24B. 



Table 24B. Comparison of NOV24a against NOV24b through NOV24c. 


Protein Sequence 


NOV24a Residues/ 
Match Residues 


Identities/ 
Similarities for the Matched Region 


NOV24b 


454.-748 
1 ..293 


283/295 (95%) 
285/295 (95%) 


NOV24c 


27..1419 
23..1423 


1211/1426 (84%) 
1261/1426 (87%) 



Further analysis of the NOV24a protein yielded the following properties shown in 
Table 24C. 



Table 24C. Protein Sequence Properties NOV24a 


PSort 
analysis: 


0.6850 probability located in endoplasmic reticulum (membrane); 0.6400 
probability located in plasma membrane; 0.4600 probability located in Golgi 
body; 0.1080 probability located in nucleus 


SignalP 
analysis: 


Likely cleavage site between residues 24 and 25 



A search of the NOV24a protein against the Geneseq database, a proprietary 
database that contains sequences published in patents and patent publication, yielded 
several homologous proteins shown in Table 24D. 



Table 24D. Geneseq Results for NOV24a 


Geneseq 
Identifier 


Protein/Organism/Length 
[Patent #, Date] 


NOV24a 
Residues/ 

Match 
Residues 


Identities/ 
Similarities for 
the Matched 
Region 


Expect 
Value 


AAW30751 


Rat phospholipase-B/lipase - 
Rattus rattus, 1450 aa. 
[JP09248190-A, 22-SEP-1997] 


50.. 1403 
60.. 1447 


911/1404(64%) 
1077/1404 (75%) 


0.0 


ABB 11053 


Human phospholipase B 


985. .1203 
45. .267 


205/224 (91%) 
213/224 (94%) 


e-117 
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Homo sapiens, 267 aa. 
[WO200157188-A2, 09-AUG- 
2001] 








AAM25824 


Human protein sequence SEQ ID 
NO: 1339 - Homo sapiens, 267 aa. 
[WO200153455-A2, 26-JUL- 
2001] 


985.. 1203 
45. .267 


205/224(91%) 
213/224 (94%) 


e-117 


AAM95420 


Human reproductive system 
reiateu anxigen oJc.v<> iu inc. h-u/o 
- Homo sapiens, 148 aa. 
[WO200155320-A2, 02-AUG- 
2001] 


979.. 1106 


110/130 (84%) 

117/1 "?fl fSQO/O 

11// 1 5\) \oy /o) 


3e-56 


ABB11237 


Human phospholipase homologue, 
SEQ ID NO: 1 607 - Homo sapiens, 
132 aa. [WO200157188-A2, 09- 
AUG-2001] 


393.-478 
43..132 


84/90 (93%) 
86/90 (95%) 


3e-40 



In a BLAST search of public sequence databases, the NOV24a protein was found to 
have homology to the proteins shown in the BLASTP data in Table 24E. 



Table 24E. Public BLASTP Results for NOV24a 


Protein 
Accession 
Number 


Protein/Organism/Length 


NOV24a 
Residues/ 

Match 
Residues 


Identities/ 
Similarities for 
the Matched 
Portion 


Expect 
Value 


Q05017 


Phospholipase ADRAB-B precursor 
(EC 3.1.-.-) - Oryctolagus cuniculus 
(Rabbit), 1458 aa. 


6..1416 
2.. 1456 


1042/1466 (71%) 
1179/1466(80%) 


0.0 


070320 


PHOSPHOLIPASE B - Cavia 
porcellus (Guinea pig), 1463 aa. 


7..1414 
3.. 1458 


965/1474 (65%) 
1135/1474(76%) 


0.0 


054728 


PHOSPHOLIPASE B - Rattus 
norvegicus (Rat), 1450 aa. 


50..1403 
60.. 1447 


911/1404(64%) 
1077/1404 (75%) 


0.0 


Q96DP9 


CDNA FLJ30866 FIS, CLONE 
FEBRA20041 10, HIGHLY 
SIMILAR TO PHOSPHOLIPASE 
ADRAB-B PRECURSOR (EC 3.1.- 
.-) - Homo sapiens (Human), 270 
aa. 


454..714 
1..259 


257/261 (98%) 
258/261 (98%) 


e-151 


Q9N2Z4 


HYPOTHETICAL 41.4 KDA 
PROTEIN - Caenorhabditis elegans, 
377 aa. 


343..673 
37..369 


130/343 (37%) 
202/343 (57%) 


le-59 
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PFam analysis predicts that the NOV24a protein contains the domains shown in the 
Table 24F. 



Table 24F. Domain Analysis of NOV24a 




NOV24a Match 
Region 


Identities/ 
Similarities 
for the Matched 
Region 


Expect 
Value 


Lipase GDSL: domain 1 of 
3 


360..484 


54/147 (37%) 
116/147 (79%) 


4.8e-42 


Lipase GDSL: domain 2 of 
3 


705..834 


57/147 (39%) 
116/147 (79%) 


4.5e-44 


SecA_protein: domain 1 of 
1 


834..851 


10/20 (50%) 
17/20 (85%) 


4.9 


Vitellogenin N: domain 1 
of 1 


11 07.. 1124 


8/18(44%) 
17/18(94%) 


3.8 


Lipase GDSL: domain 3 of 
3 


1062.. 1185 


48/147 (33%) 
114/147 (78%) 


6.3e-37 



Example 25. 

The NOV25 clone was analyzed, and the nucleotide and predicted polypeptide 
sequences are shown in Table 25A. 



Table 25A. NOV25 Sequence Analysis 




SEQIDNO:87 1 1348 bp 


NOV25a, 

CG593 11-01 DNA Sequence 


CTGGGTCGCCCCTGTTCTACCCAGATTGGGATGGCAGCGACGCTGATCCTGGAGCCCG 
CGGGCCGCTGCTGCTGGGACGAGCCGCTGCGCATCGCAGTGCGCGGCCTGGCCCCGGA 
GCAGCCAGTCACGCTGCGCACGTCCCTGCGCGACGAAGAGGGCGCGCTCTTCCGGGCC 
CACGCGCGCTACCGTGCCGACGCCCGCGACGAGCTGGACCTGGAGCGCGCGCCCGCGC 
TGGGAGGCAGCTTCGCGGGGCTCCAGCCCATGGGGCTGCTGTGGGCGTTGGAGCCCGA 
GAAAGCCTTGGTGCGGCTGGTGAAGCGCGACGTGCGGACGCCCTTCGCCGTGGAGCTG 
GAAGTGCTGGACGGCCACGACACCGAGCCCGGGCGGCTGCTGTGCCTGGCGCAGAACA 
AGCGCGACTTTCTCCGGCCGGGGGTGCGGCGCGAGCCGGTGCGCGCGGGCCCGGTGCG 




GGGAGCAGCAGGGGCCTTTGTGAATACAGGGCCAGCCTCCTGGCCGGACATGGTTTTG 
CTGTGCTTGCCCTGGCTTATTTCAGATTTGAAGACCTCCCCGAAGATCTGAATGATGT 
ACATCTGGAGTACTTTGAAGAAGCCGTGGACTTTATGCTGCAGCATCCAAAGGTGAAA 
GGTCCTAGTATTGCGCTTCTTGGATTTTCCAAAGGAGGTGACCTGTGTCTCTCAATGG 
CTTCTTTCTTGAAGGGCATCACAGCCACTGTACTTATCAATGCCTGTGTAGCCAACAC 
AGTAGCTCCTCTACATTACAAGGATATGATTATTCCTAAACTTGTCGATGATCTAGGA 
AAAGTAAAAATCACTAAGTCAGGATTTCTCACTTTTATGGACACTTGGAGCAATCCAC 




GTTTATTGTTGGCATGGATGATCAAAGCTGGAAGAGTGAATTCTATGCTCAGATAGCC 
TCTGAAAGGCTACAAGCTCATGGGAAAGAAAGACCCCAGATAATCTGTTACCCAGAAA 
CTGGTCACTGTATTGACCCACCTTATTTTCCTCCTTCTAGAGCTTCTGTGCACGCTGT 
TTTGGGTGAGGCAATATTCTATGGAGGTGAGCCAAAGGCTCACTCAAAGGCACAGGTA 
GATGCCTGGCAGCAAATTCAAACTTTCTTCCATAAACATCTCAATGGTAAAAAATCTG 
TCAAGCACAGCAAAATATAACATTGTAGCCACAGACCAGATACCATTAATAAAAATCC 
TATTCATACAACTT 




ORF Start: ATG at 31 (ORF Stop: TAA at 1294 
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SEQ ID NO: 88 |421 aa |MW at 4681 5.4kD 


NOV25a, 

CG5931 1-01 Protein Sequence 


MAATLILEPAGRCCWDEPLRIAVRGLAPEQPVTLRTSLRDEEGALFRAHARYRADARD 
ELDLERAPALGGSFAGLQPMGLLWALEPEKALVRLVKRDVRTPFAVELEVLDGHDTEP 
GRLLCLAQNKRDFLRPGVRREPVRAGPVRAALFLPPDRGPFPGIIDLFGSSRGLCEYR 
ASLLAGHGFAVLALAYFRFEDLPEDLNDVHLEYFEEAVDFMLQHPKVKGPSIALLGFS 
KGGDLCLSMASFLKGITATVLINACVANTVAPLHYKDMIIPKLVDDLGKVKITKSGFL 
TFMDTWSNPLEEHNHQSLVPLEKAQVPFLFIVGMDDQSWKSEFYAQIASERLQAHGKE 
RPQIICYPETGHCIDPPYFPPSRASVHAVLGEAIFYGGEPKAHSKAQVDAWQQIQTFF 
HKHLNGKKSVKHSKI 




SEQ ID NO: 89 1 1021 bp 


NOV25b, 

CG5931 1-02 DNA Sequence 


AGATTGGGATGGCAGCGACGCTGATCCTGGAGCCCGCGGGCCGCTGCTGCTGGGACGA 
GCCGCTGCGCATCGCAGTGCGCGGCCTGGCCCCGGAGCAGCCAGTCACGCTGCGCACG 
TCCCTGCGCGACGAAGAGGGCGCGCTCTTCCGGGCCCACGCGCGCTACCGTGCCGACG 
CCTCTAATCCCGGCACTTTGGGGGGCCAAGGCAGGGGGCCCTTTCCTGGGATCATTGA 
TCTGTTTGGGAGCAGCAGGGGCCTTTGTGAATACAGGGCCAGCCTCCTGGCCGGACAT 
GGTTTTGCTGTGCTTGCCCTGGCTTATTTCAGATTTGAAGACCTCCCCGAAGATCTGA 
ATGATGTACATCTGGAGTACTTTGAAGAAGCCGTGGACTTTATGCTGCAGCATCCAAA 
GGTGAAAGGTCCTAGTATTGCGCTTCTTGGATTTTCCAAAGGAGGTGACCTGTGTCTC 
TCAATGGCTTCTTTCTTGAAGGGCATCACAGCCACTGTACTTATCAATGCCTGTGTAG 
CCAACACAGTAGCTCCTCTACATTACAAGGATATGATTATTCCTAAACTTGTCGATGA 
TCTAGGAAAAGTAAAAATCACTAAGTCAGGATTTCTCACTTTTATGGACACTTGGAGC 
AATCCACTGGAGGAACACAATCACCAAAGTCTTGTTCCATTGGAAAAGGCGCAGGTGC 
CCTTCTTGTTTATTGTTGGCATGGATGATCAAAGCTGGAAGAGTGAATTCTATGCTCA 
GATAGCCTCTGAAAGGCTACAAGCTCATGGGAAAGAAAGACCCCAGATAATCTGTTAC 
CCAGAAACTGGTCACTGTATTGACCCACCTTATTTTCCTCCTTCTAGAGCTTCTGTGC 
ACGCTGTTTTGGGTGAGGCAATATTCTATGGAGGTGAGCCAAAGGCTCACTCAAAGGC 
ACAGGTAGATGCCTGGCAGCAAATTCAAACTTTCTTCCATAAACATCTCAATGGTAAA 
AAATCTGTCAAGCACAGCAAAATATAACATTGTAG 




ORE Start: ATG at 9 


ORF Stop: TAA at 1011 




SEQ ID NO: 90 


334 aa ImW at 36926.0kD 


NOV25b, 

CG5931 1-02 Protein Sequence 


MAATLILEPAGRCCWDEPLRIAVRGLAPEQPVTLRTSLRDEEGALFRAHARYRADASN 
PGTLGGQGRGPFPGIIDLFGSSRGLCEYRASLLAGHGFAVLALAYFRFEDLPEDLNDV 
HLEYFEEAVDFMLQHPKVKGPSIALLGFSKGGDLCLSMASFLKGITATVLINACVANT 
VAPLHYKDMI I PKLVDDLGKVK I TKSGFLTFMDTWSNPLEEHNHQSLVPLEKAQVPFL 
FIVGMDDQSWKSEFYAQIASERLQAHGKERPQIICYPETGHCIDPPYFPPSRASVHAV 
LGEAIFYGGEPKAHSKAQVDAWQQIQTFFHKHLNGKKSVKHSKI 




SEQ ID NO: 91 |l021 bp 


NOV25c, 

CG5931 1-03 DNA Sequence 


AGATTGGGATGGCAGCGACGCTGATCCTGGAGCCCGCGGGCCGCTGCTGCTGGGACGA 
GCCGCTGCGCATCGCAGTGCGCGGCCTGGCCCCGGAGCAGCCAGTCACGCTGCGCACG 
TCCCTGCGCGACGAAGAGGGCGCGCTCTTCCGGGCCCACGCGCGCTACCGTGCCGACG 
CCTCTAATCCCGGCACTTTGGGAGGCCAAGGCAGGGGGCCCTTTCCTGGGATCATTGA 
TCTGTTTGGGAGCAGCAGGGGCCTTTGTGAATACAGGGCCAGCCTCCTGGCCGGACAT 
GGTTTTGCTGTGCTTGCCCTGGCTTATTTCAGATTTGAAGACCTCCCCGAAGATCTGA 
ATGATGTACATCTGGAGTACTTTGAAGAAGCCGTGGACTTTATGCTGCAGCATCCAAA 
GGTGAAAGGTCCTAGTATTGCGCTTCTTGGATTTTCCAAAGGAGGTGACCTGTGTCTC 
TCAATGGCTTCTTTCTTGAAGGGCATCACAGCCACTGTACTTATCAATGCCTGTGTAG 
CCAACACAGTAGCTCCTCTACATTACAAGGATATGATTATTCCTAAACTTGTCGATGA 
TCTAGGAAAAGTAAAAATCACTAAGTCAGGATTTCTCACrTTTATGGACACTTGGAGC 
AATCCACTGGAGGAACACAATCACCAAAGTCTTGTTCCATTGGAAAAGGCGCAGGTGC 
CCTTCTTGTTTATTGTTGGCATGGATGATCAAAGCTGGAAGAGTGAATTCTATGCTCA 
GATAGCCTCTGAAAGGCTACAAGCTCATGGGAAAGAAAGACCCCAGATAATCTGTTAC 
CCAGAAACTGGTCACTGTATTGACCCACCTTATTTTCCTCCTTCTAGAGCTTCTGTGC 
ACGCTGTTTTGGGTGAGGCAATATTCTATGGAGGTGAGCCAAAGGCTCACTCAAAGGC 
ACAGGTAGATGCCTGGCAGCAAATTCAAACTTTCTTCCATAAACATCTCAATGGTAAA 
AAATCTGTCAAGCACAGCAAAATATA/ - I i 




ORE Start: ATG at 9 


ORF Stop: TAA at 1011 




SEQ ID NO: 92 


334 aa ]MW at 36926.0kD 


NOV25c, 

CG5931 1-03 Protein Sequence 


MAATLILEPAGRCCWDEPLRIAVRGLAPEQPVTLRTSLRDEEGALFRAHARYRADASN 
PGTLGGQGRGPFPGIIDLFGSSRGLCEYRASLLAGHGFAVLALAYFRFEDLPEDLNDV 
HLEYFEEAVDFMLQHPKVKGPSIALLGFSKGGDLCLSMASFLKGITATVLINACVANT 
VAPLHYKDMI IPKLVDDLGKVKITKSGFLTFMDTWSNPLEEHNHQSLVPLEKAQVPFL 
FIVGMDDQSWKSEFYAQIASERLQAHGKERPQIICYPETGHCIDPPYFPPSRASVHAV 
LGEAIFYGGEPKAHSKAQVDAWQQIQTFFHKHLNGKKSVKHSKI 
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Sequence comparison of the above protein sequences yields the following sequence 
relationships shown in Table 25B. 



Table 25B. Comparison of NOV25a against NOV25b through NOV25c. 


Protein Sequence 


NOV25a Residues/ 
Match Residues 


Identities/ 
Similarities for the Matched Region 


NOV25b 


154..421 
67.334 


268/268 (100%) 
268/268 (100%) 


NOV25c 


154..421 
67..334 


268/268(100%) 
268/268 (100%) 



Further analysis of the NOV25a protein yielded the following properties shown in 
Table 25C. 



Table 25C. Protein Sequence Properties NOV25a 


PSort 
analysis: 


0.4500 probability located in cytoplasm; 0.3630 probability located in 
microbody (peroxisome); 0.1958 probability located in lysosome (lumen); 
0.1000 probability located in mitochondrial matrix space 


SignalP 
analysis: 


No Known Signal Sequence Predicted 



A search of the NOV25a protein against the Geneseq database, a proprietary 
database that contains sequences published in patents and patent publication, yielded 
several homologous proteins shown in Table 25D. 



Table 25D. Geneseq Results for NOV25a 


Geneseq 
Identifier 


Protein/Organism/Length 
[Patent #, Date] 


NOV25a 
Residues/ 

Match 
Residues 


Identities/ 
Similarities for 
the Matched 
Region 


Expect 
Value 


AAM41490 


Human polypeptide SEQ ID NO 
6421 - Homo sapiens, 494 aa. 
[WO200153312-A1, 26-JUL- 
2001] 


1..421 
74..494 


288/421 (68%) 
347/421 (82%) 


e-175 


AAM39704 


Human polypeptide SEQ ID NO 
2849 - Homo sapiens, 483 aa. 
[WO200153312-A1, 26-JUL- 
2001] 


1..421 
63..483 


288/421 (68%) 
346/421 (81%) 


e-175 
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AAY71112 


Human Hydrolase protein- 10 
(HYDRL-10) - Homo sapiens, 483 
aa. [WO200028045-A2, 18-MAY- 
2000] 


1..421 
63. .483 


288/421 (68%) 
346/421 (81%) 


e-175 


AAB93479 


Human protein sequence SEQ ID 
NO: 12766 - Homo sapiens, 483 aa. 
[EP1074617-A2, 07-FEB-2001] 


1..421 
63..483 


287/421 (68%) 
346/421 (82%) 


e-175 


AAY07932 


Human secreted protein fragment 
encoded from gene 81 - Homo 
sapiens, 182 aa. [WO9918208-A1, 
15-APR-1999] 


241. .421 
1..181 


181/181 (100%) 
181/181 (100%) 


e-105 



In a BLAST search of public sequence databases, the NOV25a protein was found to 
have homology to the proteins shown in the BLASTP data in Table 25E. 



Table 25E. Public BLASTP Results for NOV25a 


Protein 
Accession 
Number 


Protein/Organism/Length 


NOV25a 
Residues/ 

Match 
Residues 


Identities/ 
Similarities for 
the Matched 
Portion 


Expect 
Value 


P49753 


Peroxisomal acyl-coenzyme A 
thioester hydrolase 2 (EC 3.1 .2.2) 
(Peroxisomal long-chain acyl-coA 
thioesterase 2) (ZAP 128) - Homo 
sapiens (Human), 421 aa. 


1..421 
1..421 


288/421 (68%) 
347/421 (82%) 


e-175 


Q9QYR7 


Peroxisomal acyl-coenzyme A 
thioester hydrolase 2 (EC 3.1.2.2) 
(Peroxisomal long-chain acyl-coA 
thioesterase 2) (PTE-la) - Mus 
musculus (Mouse), 432 aa. 


1..421 
1 2..432 


264/421 (62%) 
331/421 (77%) 


e-157 


088267 


Cytosolic acyl coenzyme A thioester 
hydrolase, inducible (EC 3.1.2.2) 
(Long chain acyl-CoA thioester 
hydrolase) (Long chain acyl-CoA 
hydrolase) (CTE-I) (LACH2) (ACH2) 
- Rattus norvegicus (Rat), 419 aa. 


1..421 
1..419 


268/421 (63%) 
318/421 (74%) 


e-153 


Q9QYR9 


Acyl coenzyme A thioester hydrolase, 
mitochondrial precursor (EC 3.1.2.2) 
(Very-long-chain acyl-CoA 
thioesterase) (MTE-I) - Mus musculus 
(Mouse), 453 aa. 


3..413 
44..452 


264/411 (64%) 
321/411 (77%) 


e-153 


055137 


Cytosolic acyl coenzyme A thioester 


1..4I3 
1..411 


262/413(63%) 
319/413 (76%) 


e-153 



180 





(Long chain acyl-CoA thioester 
hydrolase) (Long chain acyl-CoA 
hydrolase) (CTE-I) - Mus musculus 
(Mouse), 419 aa. 









PFam analysis predicts that the NOV25a protein contains the domains shown in the 
Table 25F. 



Table 25F. Domain Analysis of NOV25a 



Pfam Domain 



NOV25a Match Region 



Identities/ 
Similarities 
for the Matched Region 



Expect Value 



No Significant Matches Found 



Example 26. 

The NOV26 clone was analyzed, and the nucleotide and predicted polypeptide 
sequences are shown in Table 26A. 



Table 26A. NOV26 Sequence Analysis 




SEQIDNO:93 1 1375 bp 


NOV26a, 

CG5 93 09-01 DNA Sequence 


GGGACGCCGGACGCCGTCCGGACATTCGGCGCGCTTGCCACGATCTTGGACGGGTCTC 


GGGCCTCGACCTTTGAATTCCCCGCTCCGGCTCCAAGATGTCAGCAACGCTGATCCTG 
GAGCCCCCAGGCCGCTGCTGCTGGAACGAGCCGGTGCGCATTGCCGTGCGCGGCCTGG 
CCCCGGAGCAGCGGGTTACGCTGCGCGCGTCCCTGCGCGACGAGAAGGGCGCGCTCTT 
CCGGGCCCACGCGCGCTACTGCGCCGACGCCCGCGGCGAGCTGGACCTGGAGCGCGCA 
CCCGCGCTGGGCGGCAGCTTCGCGGGACTCGAGCCCATGGGGCTGCTCTGGGCCCTGG 
AACCCGAGAAGCCTTTTTGGCGCTTCCTGAAGCGGGACGTACAGATTCCTTTTGTCGT 
GGAGTTGGAGGTGCTGGACGGCCACGACCCCGAGCCTGGACGGCTGCTGTGCCAGGCG 
CAGCACGAGCGCCACTTCCTCCCGCCAGGGGTGCGGCGCCAGTCGGTGCGAGCGGGCC 

TGACATCTTTGGTATTGGAGGGGGCCTCTTGGAATATCGAGCCAGCCTCCTTGCTGGC 
CATGGCTTTGCCACGTTGGCTCTAGCTTATTATAACTTTGAAGATCTCCCCAATAACA 
TGGACAACATATCCCTGGAGTACTTCGAAGAAGCCGTATGCTACATGCTTCAACATCC 
CCAGGTTAAAGGCCCAGGCATTGGGCTTTTGGGCATTTCTCTAGGAGCTGATATTTGT 
CTCTCAATGGCCTCATTCTTGAAGAATGTCTCAGCCACAGTTTCCATCAATGGATCTG 
GGATCAGTGGGAACACAGCCATCAACTATAAGCACAGTAGCATTCCACCATTGGGCTA 
TGACCTGAGGAGAATCAAGGTAGCTTTCTCAGGCCTCGTGGACATTGTGGATATAAGG 
AATGCTCTCGTAGGAGGGTACAAGAACCCCAGCATGATTCCAATAGAGAAGGCCCAGG 
GGCCCATCCTGCTCATTGTTGGTCAGGATGACCATAACTGGAGAAGTGAGTTGTATGC 
CCAAACAGTCTCTGAACGGTTACAGGCCCATGGAAAGGAAAAACCCCAGATCATCTGT 
TACCCTGGGACTGGGCATTACATCGAGCCTCCTTACTTCCCCCTGTGCCCAGCTTCCC 
TTCACAGATTACTGAACAAACATGTTATATGGGGTGGGGAGCCCAGGGCTCATTCTAA 
GGCCCAGGAAGATGCCTGGAAGCAAATTCTAGCCTTCTTCTGCAAACACCTGGGAGGT 
ACC CAGAAAACAG CTGT CCCTAAATTGTAATG CATTTGTCT 




ORF Start: ATG at 96 


ORF Stop:TAAat 1362 




SEQ ID NO: 94 


422 aa |MWat46455.1kD 


NOV26a, 

CG59309-01 Protein Sequence 


MSATLILEPPGRCCWNEPVRIAVRGLAPEQRVTLRASLRDEKGALFRAHARYCADARG 
ELDLERAPALGGSFAGLEPMGLLWALEPEKPFWRFLKRDVQIPFWELEVLDGHDPEP 
GRLLCQAQHERHFLPPGVRRQSVRAGRVRATLFLPPGEPGPFPGIIDIFGIGGGLLEY 
RASLLAGHGFATIALAYYNFEDLPNNMDNISLEYFEEAVCYMLQHPQVKGPGIGLLGI 
SLGADICLSMASFLKNVSATVSINGSGISGNTAINYKHSSIPPLGYDLRRIKVAFSGL 
VDIVDIRNALVGGYKNPSMIPIEKAQGPILLIVGQDDHNWRSELYAQTVSERLQAHGK 
EKPQIICYPGTGHYIEPPYFPLCPASLHRLLNKHVIWGGEPRAHSKAQEDAWKQILAF 
FCKHLGGTQKTAVPKL 
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Further analysis of the NOV26a protein yielded the following properties shown in 
Table 26B. 



Table 26B. Protein Sequence Properties NOV26a 


PSort 

analysis: 


0.4500 probability located in cytoplasm; 0.2585 probability located in 
lysosome (lumen); 0.1940 probability located in microbody (peroxisome); 
0.1000 probability located in mitochondrial matrix space 


SignalP 
analysis: 


No Known Signal Sequence Predicted 



A search of the NOV26a protein against the Geneseq database, a proprietary 
database that contains sequences published in patents and patent publication, yielded 
several homologous proteins shown in Table 26C. 



Table 26C. Geneseq Results for NOV26a 


Geneseq 
Identifier 


Protein/Organism/Length [Patent 
#,Date] 


NOV26a 
Residues/ 

Match 
Residues 


Identities/ 
Similarities for 
the Matched 
Region 


Expect 
Value 


AAM41490 


Human polypeptide SEQ ID NO 
6421 - Homo sapiens, 494 aa. 
[WO200153312-A1, 26-JUL-2001] 


1..422 
74..494 


296/422 (70%) 
341/422 (80%) 


e-179 


AAM39704 


Human polypeptide SEQ ID NO 
2849 - Homo sapiens, 483 aa. 
[WO200153312-AI, 26-JUL-2001] 


1..422 
63..483 


296/422 (70%) 
341/422(80%) 


e-179 


AAY71112 


Human Hydrolase protein- 10 
(HYDRL-10) - Homo sapiens, 483 
aa. [WO200028045-A2, 18-MAY- 
2000] 


1..422 
63..483 


296/422 (70%) 
341/422(80%) 


e-179 


AAB93479 


Human protein sequence SEQ ID 
NO: 12766 - Homo sapiens, 483 aa. 
[EP1074617-A2, 07-FEB-2001] 


1..422 
63..483 


295/422 (69%) 
340/422 (79%) 


e-178 


AAY07932 


Human secreted protein fragment 
encoded from gene 81 - Homo 
sapiens, 182 aa. [WO9918208-A1, 
15-APR-1999] 


242. .422 
1..181 


93/181 (51%) 
123/181 (67%) 


2e-48 



In a BLAST search of public sequence databases, the NOV26a protein was found to 
have homology to the proteins shown in the BLASTP data in Table 26D. 
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Table 26D. Public BLASTP Results for NOV26a 


Protein 
Accession 
Number 


Protein/Organism/Length 


NOV26a 
Residues/ 

Match 
Residues 


Identities/ 
Similarities for 
the Matched 
Portion 


Expect 
Value 


Q9QYR8 


PEROXISOMAL LONG CHAIN 
ACYL-COA THIOESTERASE IB - 
Mus musculus (Mouse), 421 aa. 


1..422 
1..421 


312/422 (73%) 
362/422 (84%) 


0.0 


P49753 


Peroxisomal acyl-coenzyme A 
thioester hydrolase 2 (EC 3.1.2.2) 
(Peroxisomal long-chain acyl-coA 
thioesterase 2) (ZAP128) - Homo 
sapiens (Human), 421 aa. 


1..422 
1..421 


296/422 (70%) 
341/422 (80%) 


e-178 


Q9QYR7 


Peroxisomal acyl-coenzyme A 
thioester hydrolase 2 (EC 3.1 .2.2) 
(Peroxisomal long-chain acyl-coA 
thioesterase 2) (PTE-la) - Mus 
musculus (Mouse), 432 aa. 


1..422 
12..432 


281/424 (66%) 
333/424 (78%) 


e-163 


055137 


Cytosolic acyl coenzyme A thioester 
hydrolase, inducible (EC 3.1.2.2) 
(Long chain acyl-CoA thioester 
hydrolase) (Long chain acyl-CoA 
hydrolase) (CTE-I) - Mus musculus 
(Mouse), 419 aa. 


1..422 
1..419 


275/423 (65%) 
330/423 (78%) 


e-162 


088267 


Cytosolic acyl coenzyme A thioester 
hydrolase, inducible (EC 3.1.2.2) 
(Long chain acyl-CoA thioester 
hydrolase) (Long chain acyl-CoA 
hydrolase) (CTE-I) (LACH2) (ACH2) 
- Rattus norvegicus (Rat), 419 aa. 


1..422 
1..419 


276/423 (65%) 
329/423 (77%) 


e-162 



PFam analysis predicts that the NOV26a protein contains the domains shown in the 
Table 26E. 



Table 26E. Domain Analysis of NOV26a 


Pfam Domain 


NOV26a Match Region 


Identities/ 
Similarities 
for the Matched Region 


Expect Value 


DLH: domain 1 of 2 


144.. 188 


17/52 (33%) 
32/52 (62%) 


63 


DLH: domain 2 of 2 


394..411 


9/18(50%) 
13/18(72%) 


2.6 
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Example 27. 

The NOV27 clone was analyzed, and the nucleotide and predicted polypeptide 
sequences are shown in Table 27A. 



Table 27A. NOV27 Sequence Analysis 




SEQ ID NO: 95 


1333 bp 


NOV27a, 

CG57364-01 DNA Sequence 


CCTGGCCCCCAAGCTCCCCACTCTGGTGCCCCGAGCAGCCCTGTGGGCAAGCAGCCGC 


CGCCATGGCCGAGCACCTGGAGCTGCTGGCAGAGATGCCCATGGTGGGCAGGATGAGC 
ACACAGGAGCGGCTGAAGCATGCCCAGAAGCGGCGCGCCCAGCAGGTGAAGATGTGGG 
CCCAGGCTGAGAAGGAGGCCCAGGGCAAGAAGGGTCCTGGGGAGCGTCCCCGGAAGGA 
GGCAGCCAGCCAAGGGCTCCTGAAGCAGGTCCTCTTCCCTCCCAGTGTTGTCCTTCTG 
GAGGCCGCTGCCCGAAATGACCTGGAAGAAGTCCGCCAGTTCCTTGGGAGTGGGGTCA 
GCCCTGACTTGGCCAACGAGGACGGCCTGACGGCCCTGCACCAGTGCTGCATTGATGA 
TTTCCGAGAGATGGTGCAGCAGCTCCTGGAGGCTGGGGCCAACATCAATGCCTGTGAC 
AGTGAGTGCTGGACGCCTCTGCATGCTGCGGCCACCTGCGGCCACCTGCACCTGGTGG 
AGCTGCTCATCGCCAGTGGCGCCAATCTCCTGGCGGTCAACACCGACGGGAACATGCC 
CTATGACCTGTGTGATGATGAGCAGACGCTGGACTGCCTGGAGACTGCCATGGCCGAC 
CGTGGCATCACCCAGGACAGCATCGAGGCCGCCCGGGCCGTGCCAGAACTGCGCATGC 
TGGACGACATCCGGAGCCGGCTGCAGGCCGGGGCAGACCTCCATGCCCCCCTGGACCA 
CGGGGCCACGCTGCTGCACGTCGCAGCCGCCAACGGGTTCAGCGAGGCGGCTGCCCTG 
CTGCTGGAACACCGAGCCAGCCTGAGCGCTAAGGACCAAGACGGCTGGGAGCCGCTGC 
ACGCCGCGGCCTACTGGGGCCAGGTGCCCCTGGTGGAGCTGCTCGTGGCGCACGGGGC 
CGACCTGAACGCAAAGTCCCTGATGGACGAGACGCCCCTTGATGTGTGCGGGGACGAG 




AGAGCCGCCAGCGCTCCTTGCTGCGCCGCCGCACCTCCAGCGCCGGCAGCCGCGGGAA 
GGTGGrGAGGCGGGATGAGCCTAACCCAGCGCAGCGGCTGACGCATGTCCCAGAAGCG 
GCGCGCCCAGCAGGTGAAGATGTGGGCCCAGGCTGAGAAGGAGGCCCAGGGCAAGAAG 
GGTCCTGGGGAGCGTCCCCGGAAGGAGGCAGCCAGCCAAGGGCTCCTGAAGCAGGTCC 








ORF Start: ATG at 63 


ORF Stop: TGA at 1194 




SEQ ID NO: 96 


377 aa [MW at 41019.9kD 


NOV27a, 

CG5 73 64-01 Protein Sequence 


MMHLELLAEMPMVGRMSTQERLKHAQKRRAQQVKMWAQAEKEAQGKKGPGERPRKEA 
ASQGLLKQVLFPPSWLLEAAARNDLEEVRQFLGSGVSPDLANEDGLTALHQCCIDDF 
REMVQQLLEAGANINACDSECWTPLHAAATCGHLHLVELLIASGANLLAVNTDGNMPY 
DLCDDEQTLDCLETAMADRGITQDSIEAARAVPELRMLDDIRSRLQAGADLHAPLDHG 
ATLLHVAAANGFSEAAALLLEHRASLSAKDQDGWEPLHAAAYWGQVPLVELLVAHGAD 
IiNAKSLMDETPLDVCGDEEVRAKLIiELKHKHDALLRAQSRQRSLLRRRTSSAGSRGKV 
VRRDEPNPAQRLTHVPEAARPAGEDVGPG 



Further analysis of the NOV27a protein yielded the following properties shown in 
Table 27B. 



Table 27B. Protein Sequence Properties NOV27a 


PSort 
analysis: 


0.3000 probability located in microbody (peroxisome); 0.3000 probability 
located in nucleus; 0.1547 probability located in lysosome (lumen); 0.1000 
probability located in mitochondrial matrix space 


SignalP 
analysis: 


No Known Signal Sequence Predicted 



A search of the NOV27a protein against the Geneseq database, a proprietary 
database that contains sequences published in patents and patent publication, yielded 
several homologous proteins shown in Table 27C. 
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Table 27C. Geneseq Results for NOV27a 


Geneseq 
Identifier 


Protein/Organism/Length 
[Patent #, Date] 


NOV27a 
Residues/ 

Match 
Residues 


Identities/ 
Similarities for 
the Matched 
Region 


Expect 
Value 


AAM40636 


Human polypeptide SEQ ID NO 
5567 - Homo sapiens, 440 aa. 
[WO200153312-A1, 26-JUL- 
2001] 


89..351 
1..263 


262/263 (99%) 
263/263 (99%) 


e-151 


AAM38850 


Human polypeptide SEQ ID NO 
1995 - Homo sapiens, 410 aa. 
[WO200153312-A1, 26-JUL- 
2001] 


11 9..35 1 
1..233 


233/233 (100%) 
233/233 (100%) 


e-132 


AAM78864 


Human protein SEQ ID NO 1526 - 
Homo sapiens, 567 aa. 
[WO200157190-A2, 09-AUG- 
2001] 


1..351 
1..348 


209/351 (59%) 
265/351 (74%) 


e-118 


ABB11817 


Human KIAA0823 protein 
homologue, SEQ ID NO:2187 - 
Homo sapiens, 536 aa. 
[WO200157188-A2, 09-AUG- 
2001] 


45..354 
3..318 


173/316(54%) 
226/316(70%) 


3e-94 


AAM79848 


Human protein SEQ ID NO 3494 - 
Homo sapiens, 536 aa. 
[WO200157190-A2, 09-AUG- 
2001] 


45..354 
3. .318 


173/316(54%) 
226/316(70%) 


3e-94 



In a BLAST search of public sequence databases, the NOV27a protein was found to 
have homology to the proteins shown in the BLASTP data in Table 27D. 



Table 27D. Public BLASTP Results for NOV27a 


Protein 

Accession 
Number 


Protein/ O rgan ism/Length 


NOV27a 
Residues/ 

Match 
Residues 


Identities/ 
Similarities for 
the Matched 
Portion 


Expect 
Value 


Q96I34 


UNKNOWN (PROTEIN FOR 
MGC:14333) - Homo sapiens 
(Human), 528 aa. 


1..351 
1..351 


351/351 (100%) 
351/351 (100%) 


0.0 


Q923M0 


MYOSIN PHOSPHATASE 
TARGETING SUBUNIT 3 
MYPT3 - Mus musculus (Mouse), 
524 aa (fragment). 


1..351 
1..351 


301/351 (85%) 
320/351 (90%) 


e-171 
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AAL62093 


PROTEIN PHOSPHATASE 1 
REGULATORY SUBUNIT 16B - 
Mus musculus (Mouse), 568 aa. 


1..351 
1..348 


210/351 (59%) 
266/351 (74%) 


e-118 


Q95N27 


CAAX BOX PROTEIN TIMAP - 
Bos taurus (Bovine), 568 aa. 


1..351 
1..348 


210/351 (59%) 
266/351 (74%) 


e-118 


Q96T49 


CAAX BOX PROTEIN TIMAP - 
Homo sapiens (Human), 567 aa. 


1..351 
1..348 


209/351 (59%) 
265/351 (74%) 


e-117 



PFam analysis predicts that the NOV27a protein contains the domains shown in the 
Table 27E. 



Table 27E. Domain Analysis of NOV27a 


Pfam Domain 


NOV27a Match Region 


Identities/ 
Similarities 
for the Matched Region 


Expect Value 


ank: domain 1 of 5 


70.. 102 


8/33 (24%) 
20/33 (61%) 


99 


ank: domain 2 of 5 


103..135 


16/33 (48%) 
26/33 (79%) 


7.1e-08 


ank: domain 3 of 5 


136..168 


15/33 (45%) 
26/33 (79%) 


2.9e-07 


ank: domain 4 of 5 


231. .263 


16/33(48%) 
24/33 (73%) 


2e-06 


ank: domain 5 of 5 


264..296 


16/33 (48%) 
27/33 (82%) 


2.7e-08 



Example 28. 

The NOV28 clone was analyzed, and the nucleotide and predicted polypeptide 
sequences are shown in Table 28A. 



Table 28A. NOV28 Sequence Analysis 




SEQIDNO:97 1 171 9 bp 


NOV28a, 

CG59348-01 DNA Sequence 


CGGGCACAGGCTCACCCTCGAGTGGCACAGGAATCCCAGGTAGATGACGGCGGCCGCG 


GCGCACCCTCAGGGTCGCAGGGGGTGCTGATCGGGGACAGGCTGTACTCCGGGGTGCT 
CATCACCTTGGAGAACTGCCTCCTGCCTGACGACAAGCTCCGTTTCACGCCGTCCATG 
TCGAGCGGCCTCGACACCGACACAGAGACCGACCTCCGCGTGGTGGGCTGCGAGCTCA 
TCCAGGCGGCCGGTATCCTGCTCCGCCTGCCGCAGGTGGCCATGGCTACCGGGCAGGT 
GTTGTTCCAGCGGTTCTTTTATACCAAGTCCTTCGTGAAGCACTCCATGGAGCATGTG 
TCAATGGCCTGTGTCCACCTGGCTTCCAAGATAGAAGAGGCCCCAAGACGCATACGGG 
ACGTCATCAATGTGTTTCACCGCCTTCGACAGCTGAGAGACAAAAAGAAGCCCGTGCC 
TCTACTACTGGATCAAGATTATGTTAATTTAAAGAACCAAATTATAAAGGCGGAAAGA 
CGAGTTCTCAAAGAGTTGGGTTTCTGCGTCCATGTGAAGCATCCTCATAAGATAATCG 
TTATGTACCTTCAGGTGTTAGAGTGTGAGCGTAACCAACACCTGGTCCAGACCTCATG 
GAATTACATGAACGACAGCCTTCGCACCGACGTCTTCGTGCGGTTCCAGCCAGAGAGC 
ATCGCCTGTGCCTGCATTTATCTTGCTGCCCGGACGCTGGAGATCCCTTTGCCCAATC 
GTCCCCATTGGTTTCTTTTGTTTGGAGCAACTGAAGAAGAAATTCAGGAAATCTGCTT 
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AAAGATCTTGCAGCTTTATGCTCGGAAAAAGGTTGATCTCACACACCTGGAGGGTGAA 
GTGGAAAAAAGAAAGCACGCTATCGAAGAGGCAAAGGCCCAAGCCCGGGGCCTGTTGC 




GGAATCCCCCAAAGAAGGTAAAGGGAGCAAGCCTTCCCCACTGTCTGTGAAGAACACC 
AAGAGGAGGCTGGAGGGCGCCAAGAAAGCCAAGGCGGACAGCCCCGTGAACGGCTTGC 
CAAAGGGGCGAGAGAGTCGGAGTCGGAGCCGGAGCCGTGAGCAGAGCTACTCGAGGTC 
CCCATCCCGATCAGCGTCTCCTAAGAGGAGGAAAAGTGACAGCGGCTCCACATCTGGT 
GGGTCCAAGTCGCAGAGCCGCTCCCGGAGCAGGAGTGACTCCCCACCGAGACAGGCCC 
CCCGCAGCGCTCCCTACAAAGGCTCTGAGATTCGGGGCTCCCGGAAGTCCAAGGACTG 
CAAGTACCCCCAGAAGCCACACAAGTCTCGGAGCCGGAGTTCTTCCCGTTCTCGAAGC 
AGGTCACGGGAGCGGGCGGATAATCCGGGAAAATACAAGAAGAAAAGTCATTACTACA 
GAGATCAGCGACGAGAGCGCTCGAGGTCGTATGAACGCACAGGCCGTCGCTATGAGCG 
GGACCACCCTGGGCACAGCAGGCATCGGAGGTGACACGTGCTTCAGACCGGTCTGGGG 
TGCGGCGCACACCTGGGCCCGTGCAGGGCTCAGCTCGGCAGCAGCTCTGAGGGCAGCT 


CAATGAAAAAGTGAATGCACACGCCCTTGTTGGCGTG 




ORF Start: ATG at 44 


ORF Stop: TGA at 1598 




SEQ ID NO: 98 


518 aa IMW at 58034.5kD 


NOV28a, 

CG59348-01 Protein Sequence 


MTAAAAGAAGSAAPAAAAGAPGSGGAPSGSQGVLIGDRLYSGVLITLENCLLPDDKLR 
FTPSMSSGLDTDTETDLRWGCELIQAAGILLR1.PQVAMATGQVLFQRFFYTKSFVKH 
SMI3IVSMACVHLASKIEEAPRRIRDVINVFHRLRQLRDKKKPVPLLLDQDYVNLKNQI 
IKAERRVLKELGFCVHVKHPHKIIVMYLQVLECERNQHLVQTSWNYMNDSLRTDVFVR 
FQPESIACACIYLAARTLEIPLPNRPHWFLLFGATEEEIQEICLKILQLYARKKVDLT 
HLEGEVEKRKHAIEEAKAQARGLLPGGTQVLDGTSGFSPAPKLVESPKEGKGSKPSPL 
SVKNTKRRLEGAKKAKADSPVNGLPKGRESRSRSRSREQSYSRSPSRSASPKRRKSDS 
GSTSGGSKSQSRSRSRSDSPPRQAPRSAPYKGSEIRGSRKSKDCKYPQKPHKSRSRSS 
SRSRSRSRERADNPGKYKKKSHYYRDQRRERSRSYERTGRRYERDHPGHSRHRR 



Further analysis of the NOV28a protein yielded the following properties shown in 
Table 28B. 



Table 28B. Protein Sequence Properties NOV28a 


Psort 
analysis: 


0.5500 probability located in endoplasmic reticulum (membrane); 0.2400 
probability located in nucleus; 0.1900 probability located in lysosome 
(lumen); 0.1000 probability located in endoplasmic reticulum (lumen) 


SignalP 
analysis: 


No Known Signal Sequence Predicted 



A search of the NOV28a protein against the Geneseq database, a proprietary 
database that contains sequences published in patents and patent publication, yielded 
several homologous proteins shown in Table 28C. 



Table 28C. Geneseq Results for NOV28a 


Geneseq 
Identifier 


Protein/Organism/Length 
[Patent #, Date] 


NOV28a 
Residues/ 

Match 
Residues 


Identities/ 
Similarities for 
the Matched 
Region 


Expect 
Value 


AAM94028 


Human stomach cancer expressed 
polypeptide SEQ ID NO 126 - 
Homo sapiens, 298 aa. 
[WO200109317-A1, 08-FEB- 
2001] 


221..518 
1..298 


298/298(100%) 
298/298(100%) 


e-172 
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AAG64403 


Human paneth cell enhanced 
expression-like protein - Homo 
sapiens, 298 aa. [WO200138372- 
A1,31-MAY-2001] 


221..518 
I. .298 


298/298(100%) 
298/298(100%) 


e-172 


AAB94641 


Human protein sequence SEQ ID 
NO: 15526 - Homo sapiens, 298 aa. 
[EP1074617-A2, 07-FEB-2001] 


221. .5 18 
L.298 


298/298(100%) 
298/298(100%) 


e-172 


AAM78533 


Human protein SEQ ID NO 1 195 - 
Homo sapiens, 526 aa. 
[WO200157190-A2, 09-AUG- 
2001] 


2..518 
8..526 


316/526 (60%) 
390/526 (74%) 


e-168 


AAB94371 


Human protein sequence SEQ ID 
NO: 14909 - Homo sapiens, 526 aa. 
[EP1074617-A2, 07-FEB-2001] 


2..518 
8..526 


316/526(60%) 
390/526 (74%) 


e-168 


In a BLAST search of public sequence databases, the NOV28a protein was found to 
have homology to the proteins shown in the BLASTP data in Table 28D. 


Table 28D. Public BLASTP Results for NOV28a 


Protein 
Accession 
Number 


Protein/Organism/Length 


NOV28a 
Residues/ 

Match 
Residues 


Identities/ 
Similarities for 
the Matched 
Portion 


Expect 
Value 


Q96S94 


HYPOTHETICAL 58.1 KDA 
PROTEIN - Homo sapiens (Human), 
520 aa. 


3..518 
5..520 


516/516(100%) 
516/516(100%) 


0.0 


Q9JJA7 


BRAIN CDNA, CLONE MNCB- 
5160, SIMILAR TO MUS 
MUSCULUS PANETH CELL 
ENHANCED EXPRESSION PCEE 
MRNA - Mus musculus (Mouse), 
518 aa. 


1..518 
1..518 


466/519(89%) 
482/519(92%) 


0.0 


Q9UK58 


CYCLIN L ANIA-6A - Homo 
sapiens (Human), 526 aa. 


2..518 
8..526 


316/526 (60%) 
390/526 (74%) 


e-167 


Q9R1Q2 


CYCLIN ANIA-6A - Rattus 
norvegicus (Rat), 527 aa. 


2..518 
9..527 


312/526 (59%) 
391/526 (74%) 


e-165 


Q9WV44 


CYCLIN ANIA-6A - Mus musculus 
(Mouse), 531 aa. 


3..518 
15..531 


314/526 (59%) 
385/526 (72%) 


e-162 



PFam analysis predicts that the NOV28a protein contains the domains shown in the 
Table 28E. 
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Table 28E. Domain Analysis of NOV28a 


Pfam Domain 


NOV28a Match 
Region 


Identities/ 
Similarities 
for the Matched 
Region 


Expect 
Value 


cyclin: domain 1 of 1 


46..190 


28/163 (17%) 
86/163 (53%) 


0.0022 


Srg: domain 1 of 1 


221. .230 


4/10 (40%) 
10/10(100%) 


6.7 


transcript fac2: domain 1 
of 1 


235..253 


12/19 (63%) 
15/19(79%) 


0.86 


cyclinC: domain 1 of 1 


196-31 1 


22/139(16%) 
65/139 (47%) 


2.6 



Example 29. 



The NOV29 clone was analyzed, and the nucleotide and predicted polypeptide 
□ sequences are shown in Table 29A. 



Table 29 A. NOV29 Sequence Analysis 




SEQ ID NO: 99 


1069 bp 


NOV29a, 

CG59245-01 DNA Sequence 


CGGGGCCTGGTCGGCAGCTGGGCCGCCATGGAGTCCACGCTGGGCGCGGGCATCGTGA 


TAGCCGAGGCGCTACAGAACCAGCTAGCCTGGCTGGAGAACGTGTGGCTCTGGATCAC 
CTTTCTGGGCGATCCCAAGATCCTCTTTCTGTTCTACTTCCCCGCGGCCTACTACGCC 
TCCCGCCGTGTGGGCATCGCGGTGCTCTGGATCAGCCTCATCACCGAGTGGCTCAACC 
TCATCTTCAAGTGGTTTCTTTTTGGAGACAGGCCCTTTTGGTGGGTCCATGAGTCTGG 
TTACTACAGCCAGGCTCCAGCCCAGGTTCACCAGTTCCCCTCTTCTTGTGAGACTGGT 
CCAGGTGGCAGCCCTTCTGGACACTGCATGATCACAGGAGCAGCCCTCTGGCCCATAA 
TGACGGCCCTGTCTTCGCAGGTGCGCTGGGTAAGGGTGATGCCTAGCCTGGCTTATTG 
CACCTTCCTTTTGGCGGTTGGCTTGTCGCGAATCTTCATCTTAGCACATTTCCCTCAC 




GGGAGCTAAGCTTCTATGGGTTGACTGCACTGGCCCTCATGCTAGGCACCAGCCTCAT 
CTATTGGACCCTCTTTACACTGGGCCTGGATCTTTCTTGGTCCATCAGCCTAGCCTTC 

GCCGTGACTCAGGGGCTGCCCTGGGCCTGGGCATTGCCTTGCACTCTCCCTGCTATGC 






CTGGGCAGTGCACATGTTCAGTGCCCAGGAAGCACCGCCCATCCACTCTTCCTGACTT 
CTTGTGTGCCTCCCTTTCCTTTCCC 




ORF Start: ATG at 28 


ORF Stop: TGAat 1039 




SEQ ID NO: 100 


337 aa |MW at 37808.0kD 


NOV29a, 

CG59245-01 Protein Sequence 


MESTLGAGIVIAEALQNQLAWLENVWLWITFLGDPKILFLFYFPAAYYASRRVGIAVL 
WISLITEWLNLIFKWFLFGDRPFWWVHESGYYSQAPAQVHQFPSSCETGPGGSPSGHC 
MITGAALWPIMTALSSQVRWVRVMPSLAYCTFLLAVGLSRIFILAHFPHQVLAGLITG 
WLMTPRVPMERELSFYGLTALAIiMLGTSLIYWTLFTLGLDLSWSISLiAFKWCERPEWI 
HVDSRPFASLSRDSGAALGLGIALHSPCYAQVRRAQLGNGQKIACLVLAMGLLGPLDW 
LGHPPQISLFY IFNFLKYTLWPCLVLALVPWAVHMFSAQEAPP XHSS 




SEQ ID NO: 101 


1386 bp 


NOV29b, 

CG59245-02 DNA Sequence 


TGAGTCTGTACTTTCCGCCCTGGAGCAAGCCGGGGCCTGGTCGGCAGCTGGGCCGCCA 


TGGAGTCCACGCTGGGCGCGGGCATCGTGATAGCCGAGGCGCTACAGAACCAGCTAGC 
CTGGCTGGAGAACGTGTGGCTCTGGATCACCTTTCTGGGCGATCCCAAGATCCTCTTT 
CTGTTCTACTTCCCCGCGGCCTACTACGCCTCCCGCCGTGTGGGCATCGCGGTGCTCT 
GGATCAGCCTCATCACCGAGTGGCTCAACCTCATCTTCAAGTGGTTTCTTTTTGGAGA 
CAGGCCCTTTTGGTGGGTCCATGAGTCTGGTTACTACAGCCAGGCTCCAGCCCAGGTT 
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TCACAGGAGCAGCCCTCTGGCCCATAATGACGGCCCTGTCTTCGCAGGTGGCCACTCG 

GCGGTTGGCTTGTCGCGAATCTTCATCTTAGCACATTTCCCTCACCAGGTGCTGGCTG 
GCCTAATAACTGGCGCTGTCCTGGGCTGGCTGATGACTCCCCGAGTGCCTATGGAGCG 
GGAGCTAAGCTTCTATGGGTTGACTGCACTGGCCCTCATGCTAGGCACCAGCCTCATC 
TArTGGACCCTCTTTACACTGGGCCTGGATCTTTCTTGGTCCATCAGCCTAGCCTTCA 
AGTGGTGTGAGCGGCCTGAGTGGATACACGTGGATAGCCGGCCCTTTGCCTCCCTGAG 
CCGTGACTCAGGGGCTGCCCTGGGCCTGGGCATTGCCTTGCACTCTCCCTGCTATGCC 
CAGGTGCGTCGGGCACAGCTGGGAAATGGCCAGAAGATAGCCTGCCTTGTGCTGGCCA 
TGGGGCTGCTGGGCCCCCTGGACTGGCTGGGCCACCCCCCTCAGATCAGCCTCTTCTA 
CATTTTCAATTTCCTCAAGTACACCCTCTGGCCATGCCCAGTCCTGGCCCTCGTGCCC 
TGGGCAGTGCACATGTTCAGTGCCCAGGAAGCACCGCCCATCCACTCTTCCTGACTTC 
TTGTGTGCCTCCCTTTCCTTTCCCTCCCACAAAGCCAACACTCTGTGACCACCACACT 


CCAGGAGGCAGCCCCATCCCCTTCCAGCCCCTAAGTAGGCCCTCCCCTCCCTAAATCr 


GCTTCCGCACCACCTGGTCTTAGCCCCAAAGATGGGCCTTCTCTCTCCCAGATAAGTT 


GGTCCTCCCTCTGCCTTTCCTCTCAAGCCCCCAAAGAGCAAAGGCAACAGCAAGACCA 






ORF Start: ATG at 58 


ORF Stop: TGA at 1 096 




SEQIDNO: 102 


346 aa ImW at 38718.0kD 


NOV29b, 

CG59245-02 Protein Sequence 


MESTLGAGIVIAEALQNQIAWLENVWLWITFLGDPKILFLFYFPAAYYASRRVGIAVL 
WI SLI TEWLNLI FKWFLFGDRP FWWVHESGYYSQAPAQVHQFP SSCE TGPGSPSGHCM 
ITGARLWPIMTALSSQVATRARSRWVRVMPSLAYCTFLLAVGLSRI F ILAHFPHQVLA 
GLITGAVLGWLMTPRVPMERELSFYGLTALALMLGTSLIYWTLFTLGLDLSWSISLAF 
KWCERPEWIHVDSRPFASLSRDSGAALGLGIALHSPCYAQVRRAQLGNGQKIACLVLA 
MGLLGPLDWLGHPPQISLFYIFNFLKYTLWPCPVIALVPWAVHMFSAQEAPPIHSS 



Sequence comparison of the above protein sequences yields the following sequence 
relationships shown in Table 29B. 



Table 29B. Comparison of NOV29a against NOV29b. 


Protein Sequence 


NOV29a Residues/ 
Match Residues 


Identities/ 
Similarities for the Matched Region 


NOV29b 


1..337 
1..346 


335/347 (96%) 
335/347 (96%) 



Further analysis of the NOV29a protein yielded the following properties shown in 
Table 29C. 



Table 29C. Protein Sequence Properties NOV29a 


PSort 
analysis: 


0.6850 probability located in endoplasmic reticulum (membrane); 0.6400 
probability located in plasma membrane; 0.4600 probability located in Golgi 
body; 0.1000 probability located in endoplasmic reticulum (lumen) 


SignalP 
analysis: 


Likely cleavage site between residues 41 and 42 



A search of the NOV29a protein against the Geneseq database, a proprietary 
database that contains sequences published in patents and patent publication, yielded 
several homologous proteins shown in Table 29D. 
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Table 29D. Geneseq Results for NOV29a 


Geneseq 
Identifier 


Protein/Organism/Length 
[Patent #, Date] 


NOV29a 
Residues/ 

Match 
Residues 


Identities/ 
Similarities for 
the Matched 
Region 


Expect 
Value 


AAM79500 


Human protein SEQ ID NO 3146 - 
Homo sapiens, 382 aa. 
[WO200157190-A2, 09-AUG- 
2001] 


1..337 
37.382 


336/347 (96%) 
336/347 (96%) 


0.0 


AAB42637 


Human ORFX ORF2401 
polypeptide sequence SEQ ID 
NO:4802 - Homo sapiens, 377 aa. 
[WO200058473-A2, 05-OCT- 
2000] 


1..337 
31. .377 


328/348 (94%) 
328/348 (94%) 


0.0 


AAB85355 


Human phosphatase (PP) (clone ID 
1269556CD1) - Homo sapiens, 385 
aa. [WO2001 53469- A2, 26-JUL- 
2001] 


1..305 
1.314 


297/315 (94%) 
298/315 (94%) 


e-174 


AAM78516 


Human protein SEQ ID NO 1 178 - 
Homo sapiens, 404 aa. 
[WO200157190-A2, 09-AUG- 
2001] 


1..337 
125..404 


266/341 (78%) 
272/341 (79%) 


e-146 


AAB25679 


Human secreted protein sequence 
encoded by gene 15 SEQ ID 
NO: 68 - Homo sapiens, 141 aa. 
[WO200043495-A2, 27-JUL-2000] 


198..337 
1..140 


140/140 (100%) 
140/140(100%) 


6e-81 



In a BLAST search of public sequence databases, the NOV29a protein was found to 
have homology to the proteins shown in the BLASTP data in Table 29E. 



Table 29E. Public BLASTP Results for NOV29a 


Protein 
Accession 
Number 


Protein/Organism/Length 


NOV29a 
Residues/ 

Match 
Residues 


Identities/ 
Similarities for 
the Matched 
Portion 


Expect 
Value 


AAH21574 


HYPOTHETICAL 38.7 KDA 
PROTEIN - Homo sapiens 
(Human), 346 aa. 


1..337 
1..346 


336/347 (96%) 
336/347 (96%) 


0.0 


Q9BUM1 


HYPOTHETICAL 40.1 KDA 
PROTEIN - Homo sapiens 
(Human), 360 aa (fragment). 


1..337 
15..360 


336/347 (96%) 
336/347 (96%) 


0.0 
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042153 


Glucose-6-phosphatase (EC 
3.1 .3.9) (G6Pase) (G-6-Pase) - 
Haplochromis nubilus, 352 aa. 


8..323 
8..339 


127/333 (38%) 
184/333 (55%) 


le-59 


Q98UF8 


GLUCOSE-6-PHOSPHATASE - 
Sparus aurata (Gilthead sea 
bream), 350 aa. 


8..323 
8..337 


123/333 (36%) 
185/333 (54%) 


2e-57 


Q9Z186 


GLUCOSE-6-PHOSPHATASE - 
Mus musculus (Mouse), 355 aa. 


7..325 
7..345 


128/343 (37%) 
188/343 (54%) 


5e-56 



PFam analysis predicts that the NOV29a protein contains the domains shown in the 
Table 29F. 



Table 29F. Domain Analysis of NOV29a 


Pfam Domain 


NOV29a Match Region 


Identities/ 
Similarities 
for the Matched Region 


Expect Value 


PAP2: domain 1 of 1 


51..190 


38/175 (22%) 
95/175 (54%) 


0.00037 



Example 30. 



The NOV30 clone was analyzed, and the nucleotide and predicted polypeptide 
sequences are shown in Table 3 OA. 



Table 30A. NOV30 Sequence Analysis 




SEQ ID NO: 103 |l624 bp 


NOV30a, 

CG59241-01 DNA Sequence 


ATGGAACTGAAGGCCGAGGAGGAGGAGGTGGGTGGCGTCCAGCCGGTGGACTTGGTGG 
CCTTTGCCAACAGCTGCACCCTCCATGGCACCAACCACATTTTTGTGGAGGGGGGTCC 


TGCCAGGTAGGGGACCGCGTTGCTTATTACCTCAGCrACCCACACGTGACCCTTCTAA 
ACGAAGTGGCCACCACGGAGCTGGCCTTCCCGGCAGTCACCCTCTGCAACACTAATGC 
TGTGCGGCTGTCCCAGCTCAGCTACCCTGACTTGCTTTATTTGGCCCCCATGCTGGGA 

CTGGGGAGCCCTTTAACCTGCACCGCTTCTACAATCGCTCCTGCCACCGGCTGGAGGA 
CATGCTGCTCTATTGCTCCrACCAAGGGGGACCCTGCGGCCCTCACAACTTCTCAGTG 
GTGTTCACACGCTATGGAAAGTGCTACACGTTCAACTCGGGCCGAGATGGGCGGCCGC 
GGCTGAAGACCATGAAGGGTGGGACGGGCAATGGGCTGGAAATCATGCrGGACATCCA 
GCAGGACGAGTACCTGCCTGTGTGGGGGGAGACTGACGAGACGTCCTTCGAAGCAGGC 
ATCAAAGTGCAGATCCATAGTCAGGATGAACCTCCTTTCATCGACCAGCTGGGCTTTG 
GCGTGGCCCCAGGCTTCCAGACCTTTGTGGCCTGCCAGGAGCAGCGGATCTACCTGCC 
CCCACCCTGGGGCACCTGCAAAGCTGTTACCATGGACTCGGATTTCTTCGACTCCTAC 
AGCATCACTGCCTGCCGCATCGACTGTGAGACGCGCTACCTGGTGGAGAACTGCAACT 
GCCGCATGGTGCACATGCCAGGTGATGCCCCATACTGTACTCCAGAGCAGTACAAGGA 

GAAATGCCTTGCAACCTGACCCGCTATGGCAAAGAGCTGTCCATGGTCAAGATCCCCA 
GCAAAGCCTCAGCCAAGTACCTGGCCAAGAAGTTCAACAAATCTGAGCAATACATAGG 
GGAGAACATCCTGGTGCTGGACATTTTCTTTGAAGTCCTCAACTATGAGACCATTGAA 
CAGAAGAAGGCCTATGAGATTGCAGGGCTCCTGGGTGACATCGGGGGCCAGATGGGGC 
TGTTCATCGGGGCCAGCATCCTCACGGTGCTGGAGCTCTTTGACTACGCCTACGAGGT 
AGTCATTAAGCACAAGCTGTGCCGACGAGGAAAATGCCAGAAGGAGGCCAAAAGGAGC 
AGTGCGGACAAGGGCGTGGCCCTCAGCCTGGACGACGTCAAAAGACACAACCCGTGCG 
AGAGCCTTCGGGGCCACCCTGCCGGGATGACATACGCTGCCAACATCCTACCTCACCA 
TCCGGCCCGAGGCACGTTCGAGGACTTTACCTGCTGAGCCCCGCAGGCCGCTGAACCA 
AAGGCCTAGATGC 5 T \G? AGGG ZCC - T '"CTCCTCACATC 
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ORF Start: ATG at 1 


ORF Stop: TGA at 1543 




SEQ ID NO: 104 


514 aa |m\V at 57221. 7kD 


NOV30a, 

CG59241-01 Protein Sequence 


MELKAEEEEVGGVQPVDLVAFANSCTLHGTNHIFVEGGPGPRQVLWAVAFVLALGAFL 
CQVGDRVAYYLSYPHVTLI^EVATTEIAFPAVTLCNTNAVRLSQLSYPDLLYIAPMI^ 
LDESDDPGVPLAPPGPEAFSGEPFNLHRFYNRSCHRLEDMLLYCSYQGGPCGPHNFSV 
V FTRYGKCYTFNSGRDGRPRLKTMKGGTGNGLE I MUD I QQDEYLPVWGETDETSFEAG 
I KVQIHSQDEPPF IDQLGFGVAPGFQTFVACQEQRI YLPPPWGTCKAVTMDSDFFDSY 
SITACRIDCETRYLVENCNCRMVHMPGDAPYCTPEQYKECADPALDFLVEKDQEYCVC 
EMPCNLTRYGKELSMVKIPSKASAKYLAKKFNKSEQYIGENILVLDIFFEVLNYETIE 
QKKAYEIAGLLGDIGGQMGLFIGASILTVLELFDYAYEWIKHKLCRRGKCQKEAKRS 
AT ;GVALSLDD KRHNPCESLRGHPAGMTYAANILPHHPARGTFEDFTC 



Further analysis of the NOV30a protein yielded the following properties shown in 
Table 30B. 



Table 30B. Protein Sequence Properties NOV30a 


PSort 
analysis: 


0.7900 probability located in plasma membrane; 0.3000 probability located in 
Golgi body; 0.2000 probability located in endoplasmic reticulum (membrane); 
0.1000 probability located in mitochondrial inner membrane 


SignalP 
analysis: 


Likely cleavage site between residues 60 and 61 



A search of the NOV30a protein against the Geneseq database, a proprietary 
database that contains sequences published in patents and patent publication, yielded 
several homologous proteins shown in Table 30C. 



Table 30C. Geneseq Results for NOV30a 


Geneseq 
Identifier 


Protein/Organism/Length 
{Patent #, Date] 


NOV30a 
Residues/ 

Match 
Residues 


Identities/ 
Similarities for 
the Matched 
Region 


Expect 
Value 


AAY69178 


A rat acid-sensitive cationic 
channel IB (rASIClB) -Rattus sp, 
559 aa. [WO200008149-A2, 17- 
FEB-2000] 


1..514 

47..559 


488/515(94%) 
497/515(95%) 


0.0 


AAY03186 


Rat Acid sensitive ion channel 
protein sequence - Rattus sp, 513 
aa. [W09911784-A1, 11-MAR- 
1999] 


1..514 
1..513 


488/515 (94%) 
498/515(95%) 


0.0 


AAW68507 


Rat acid sensing ionic channel IB - 
Rattus sp, 559 aa. [WO9835034- 
Al, 13-AUG-1998] 


1..514 
47..559 


488/515 (94%) 
497/515 (95%) 


0.0 


AAY69175 


A rat acid-sensitive cationic 


1-514 
1..526 


416/527(78%) 
445/527 (83%) 


0.0 
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526 aa. [WO200008149-A2, 17- 
FEB-2000] 








AAY03188 


Rat Acid sensitive ion channel 
alpha protein sequence - Rattus sp, 
526 aa. [W09911784-A1, 11- 
MAR-1999] 


1..514 
1..526 


416/527 (78%) 
445/527 (83%) 


0.0 



In a BLAST search of public sequence databases, the NOV30a protein was found to 
have homology to the proteins shown in the BLASTP data in Table 30D. 



Table 30D. Public BLASTP Results for NOV30a 


Protein 
Accession 
Number 


Protein/Organism/Length 


NOV30a 
Residues/ 

Match 
Residues 


Identities/ 
Similarities for 
the Matched 
Portion 


Expect 
Value 


Q91YB8 


ION CHANNEL - Rattus norvegicus 
(Rat), 559 aa. 


1-514 
47..559 


489/515(94%) 
498/515 (95%) 


0.0 


088762 


ASIC-BETA - Rattus norvegicus 
(Rat), 513 aa. 


L.514 
1..513 


488/515 (94%) 
498/515 (95%) 


0.0 


P55926 


Amiloride-sensitive brain sodium 
channel BNaC2 (Amiloride-sensitive 
cation channel neuronal 2) (Proton 
gated cation channel ASIC1) - Rattus 
norvegicus (Rat), 526 aa. 


1..514 
1..526 


416/527 (78%) 
445/527 (83%) 


0.0 


P78348 


Amiloride-sensitive brain sodium 
channel BNaC2 (Amiloride-sensitive 
cation channel neuronal 2) - Homo 
sapiens (Human), 574 aa. 


1..514 
1..574 


421/575 (73%) 
447/575 (77%) 


0.0 


Q99NA1 


PROTON-GATED CATION 
CHANNEL SUBUNIT ASIC- 
BETA2 - Rattus norvegicus (Rat), 
425 aa. 


175..514 
86..425 


334/341 (97%) 
337/341 (97%) 


0.0 



PFam analysis predicts that the NOV30a protein contains the domains shown in the 
Table 30E. 



194 



Table 30E. Domain Analysis of NOV30a 


Pfam Domain 


NOV30a Match Region 


Identities/ 
Similarities 
for the Matched Region 


Expect Value 


ASC: domain 1 of 2 


21..118 


34/106 (32%) 
79/106 (75%) 


1.6e-29 


ASC: domain 2 of 2 


145..442 


133/351 (38%) 
281/351 (80%) 


2.1e-139 



The NOV31 clone was analyzed, and the nucleotide and predicted polypeptide 
sequences are shown in Table 31 A. 
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Table 31 A. NOV31 Sequence Analysis 




SEQ ID NO: 105 1 1949 bp 


NOV31a, 

CG58602-01 DNA Sequence 


TGCCTGGCTATSGCCCGACTGCTCAGGTCTGCAACCTGGflAnrTRTTrrrrTnnannr; 
GCTACTGCTCCCAGTCCCTGCAGGGAGAGCTCTGCAGGGACTTCGTAGAGGCTCTGAA 
GGCCGTGGTGGGCGGCTCCCACGTGTCCACTGCCGCGGTGGTCCGAGAGCAGCACGGG 
CGCGATGAGTCGGTGCACAGGTGCGAACCTCCTGATGCTGTGGTGTGGCCCCAGAACG 
TGGAGCAGGTCAGCCGGCTGGCAGCCCTGTGCTATCGCCAAGGTGTGCCCATCATCCC 
ATTCGGCACCGGCACCGGGCTTGAGGGTGGCGTCTGTGCTGTGCAGGGCGGCGTCTGC 
GTTAACCTGACGCATATGGACCGAATCCTGGAGCTGAACCAGGAGGACTTCTCTGTGG 
TGGTGGAGCCAGGTGTCACCCGCAAAGCCCTCAACGCCCACCTGCGGGACAGCGGCCT 
CTGGTTTCCTCCAGACCCAGGCGCGGACGCCTCTCTCTGTGGCATGGCGGCCACCGGG 
GCGTCGGGGACCAACGCGGTCCGCTACGGCACCATGCGGGACAACGTGCTCAACCTGG 




TTCCACTGCTGCATTCCCCCACATCAGCCCCACTGAGTGCTTTTCCCAGGGGCCAGGG 
CCTCATGTCAATTCTCCTCACCCTGCCCCTGAGGCCACAGTGGCCGCCACGTGTGCGT 
TCCCCAGTGTCCAGGCTGCTGTGGACAGCACTGTACACATCCTCCAGGCTGCAGTGCC 
CGTAGCCCGCATTGAGTTCCTGGATGAAGTCATGATGGATGCCTGCAACAGGTACAGC 
AAGCTGAATTGCTTAGTGGCGCCCACACrCTTCCTGGAGTTCCATGGCTCCCAGCAGG 
CACTGGAGGAGCAGCTGCAGCGCACAGAGGAGATAGrCCAGCAGAACGGAGCCTCTGA 
CTTCTCCTGGGCCAAGGAGGCCGAGGAGCGCAGCCGGCTTTGGACAGCACGGCACAAT 
GCCTGGTACGCAGCCCTGGCCACGCGGCCAGGCTGCAAGGGCTACTCCACGGATGTGT 
GTGTGCCCATCTCCCGGCTGCCGGAGATCGTGGTGCAGACCAAGGAGGATCTGAATGC 
CTCAGGACTCACAGGAAGCATTGTCGGGCATGTGGGTGACGGCAACTTCCACTGCATC 
CTGCTGGTCAACCCTGATGACGCCGAGGAACTGGGCAGGGTCAAGGCTTTTGCAGAAC 
AGCTGGGCAGGCGGGCACTGGCTCTCCACGGAACGTGCACGGGGGAGCATGGCATCGG 
AATGGGCAAGCGGCAGCTGCTGCAGGAGGAGGTGGGCGCCGTGGGCGTGGAGACCATG 
CGGCAGCTCAAGGCCGTGCTAGACCCCCAAGGCCTCATGAATCCAGGCAAAGTGCTGT 
GAAGGGGGTCTGAGCACTTAGCCCACAAGTTCCCTGACTACGGAGCCGGTTCTGGAAC 


TTTTCTTCATGCCACGGCCCCTGCAAGGAAATAGATGCTGAGGCAGTCTTCCTGCCAG 


CGAGCCCACTGTATCTGGGCCCAAGGCCAGAGGGCCCAGAGAGAAGCCTGAGCACCGT 


GTTACCTCCCTGGCCCTCTGGCTGGCCCCAGGAGCCTTTGGTTCAGTAAACGACCCAG 


GGTGGTTCCCAGCAAAGCTGCTTCCTCTCTGCTCCTACGCATCCTGTCCTGGCGGGAA 


GAGAGCGTCTGGGTCCATTrA, 2TCTGATGACAC FCC AGGl C -c 


CCGGGGTCCCAGGACCCTTCCCCCTTCACCTGGTGACAGGAACACTCCTTTCCTGGTA 


TGGAACGTGAGCTCCCGTGACATGATGATAGGTCTTCTCCTTGGGGCCTCCCCCAATA 


»AT TGTA iAP rGA, \C CAC TACt ;tA£ 




ORF Start: ATG at 10 


ORF Stop:TGAat 1450 




SEQ ID NO: 106 


480 aa ImW at 5 1629.1 kD 


NOV31a, 

CG58602-01 Protein Sequence 


MARLLRSATWELFPWRGYCSQSLQGELCRDFVEALKAWGGSHVSTAAWREQHGRDE 
SVHRCEPPDAWWPQNVEQVSRLAALCYRQGVPlrPFGTGTGLEGGVCAVQGGVCVNL 
THMDRILELNQEDFSVWEPGVTRKALNAHLRDSGLWFPPDPGADASLCGMAATGASG 
TNAVRYGTMRDNVLNLEWLPDGRLLHTAGRGLITDSTAAFPHISPTECFSQGPGPHV 
NSPHPAPEATVAATCAFPSVQAAVDSTVHILQAAVPVARIEFLDEVMMDACNRYSKLN 
CLVAPTLFLEFHGSQQALEEQLQRTEEIVQQNGASDFSWAKEAEERSRLWTARHNAWY 
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AAIATRPGCKGYSTDVCVPISRLPEIWQTKEDLNASGLTGSIVGHVGDGNFHCILLV 
NPDDAEELGRVKAFAEQLGRRAIALHGT CTGEHG I GMGKRQLLQE E VGAVGVETMRQL 
KAVLDPQGLMNPGKVL 

Further analysis of the NOV31a protein yielded the following properties shown in 
Table 3 IB. 



Table 31B. Protein Sequence Properties NOV31a 


PSort 
analysis: 


0.6574 probability located in mitochondrial matrix space; 0.3502 probability 
located in mitochondrial inner membrane; 0.3502 probability located in 
mitochondrial intermembrane space; 0.3502 probability located in 
mitochondrial outer membrane 


SignalP 
analysis: 


Likely cleavage site between residues 20 and 21 



A search of the NOV3 la protein against the Geneseq database, a proprietary 
database that contains sequences published in patents and patent publication, yielded 
several homologous proteins shown in Table 31C. 



Table 31C. Geneseq Results for NOV31a 


Geneseq 
Identifier 


Protein/Organism/Length [Patent 
#, Date] 


NOV31a 
Residues/ 

Match 
Residues 


Identities/ 
Similarities for 
the Matched 
Region 


Expect 
Value 


ABB 10446 


Human cDNA SEQ ID NO: 754 - 
Homo sapiens, 115 aa. 
[WO200154474-A2, 02-AUG-2001] 


1..96 
15..110 


91/96 (94%) 
92/96 (95%) 


8e-49 


AAE09597 


Human gene 5 encoded novel 
protein HDPMT22, SEQ ID NO:33 - 
Homo sapiens, 1 15 aa. 
[WO20015531 1-A2, 02-AUG-2001] 


1..96 
15..110 


91/96 (94%) 
92/96 (95%) 


8e-49 


AAM52368 


GIP12-C4 protein - Arabidopsis 
thaliana, 159 aa. [FR2806095-A1, 
14-SEP-2001] 


66..203 
3..140 


69/138(50%) 
98/138 (71%) 


9e-34 


AAG92286 


C glutamicum protein fragment SEQ 
ID NO: 6040 - Corynebacterium 
glutamicum, 948 aa. [EP1 108790- 
A2, 20-JUN-2001] 


46..477 
25..502 


108/486(22%) 
186/486 (38%) 


2e-22 


AAB79309 


Corynebacterium glutamicum SMP 
protein sequence SEQ ID NO: 134 - 
Corynebacterium glutamicum, 945 
aa. [WO200100844-A2, 04-JAN- 
2001] 


46..477 
22..499 


108/486(22%) 
186/486 (38%) 


2e-22 
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In a BLAST search of public sequence databases, the NOV3 la protein was found to 
have homology to the proteins shown in the BLASTP data in Table 3 ID. 



Table 31D. Public BLASTP Results for NOV31a 


Protein 
Accession 
Number 


Protein/Organism/Length 


NOV31a 
Residues/ 

Match 
Residues 


Identities/ 
Similarities for 
the Matched 
Portion 


Value 


Q9D635 


473340 1P21RIK PROTEIN - Mus 
musculus (Mouse), 481 aa. 


1..480 
1..481 


394/483 (81%) 
423/483 (87%) 


0.0 


Q 19965 


F32D8.4 PROTEIN - Caenorhabditis 
elegans, 912 aa. 


20..480 
445..909 


221/466(47%) 
307/466 (65%) 


e-121 


CAD16371 


PUTATIVE D-LACTATE 
DEHYDROGENASE 
(CYTOCHROME) 
OXIDOREDUCTASE PROTEIN 
(EC 1.1.2.4) -Ralstonia 
solanacearum (Pseudomonas 
solanacearum), 472 aa. 


32..479 
20..469 


226/454 (49%) 
300/454 (65%) 


e-119 


A89201 


protein F32D8.4 [imported] - 
Caenorhabditis elegans, 870 aa. 


30..480 
399..867 


214/469(45%) 
296/469 (62%) 


e-115 


AAL51780 


D-LACTATE DEHYDROGENASE 
(CYTOCHROME) (EC 1.1.2.4) - 
Brucella melitensis, 468 aa. 


41. .480 
28..467 


209/444 (47%) 
286/444 (64%) 


e-114 



PFam analysis predicts that the NOV3 la protein contains the domains shown in the 
Table 3 IE. 



Table 31 E. Domain Analysis of NOV31a 


Pfam Domain 


NOV31a Match 
Region 


Identities/ 
Similarities 
for the Matched 
Region 


Expect 
Value 


FAD binding 4: domain 1 
of 1 


33..2I4 


70/208 (34%) 
154/208 (74%) 


3.7e-56 


FAD-oxidase C: domain 1 
of 1 


206..479 


91/307 (30%) 
210/307 (68%) 


1.3e-58 



Example 32. 



The NOV32 clone was analyzed, and the nucleotide and predicted polypeptide 
sequences are shown in Table 32A. 



Table 32A. NOV32 Sequence Analysis 




SEQ ID NO: 1 07 |698 bp 


NUViza, 

CG58468-01 DNA Sequence 


CTCCTTCCTGTGCTCTTTATATGGACCAACAACTTCTCGTCCTTORORTrTrTRTRPa 
AATATCAATTTTTTCACArTATCTTTTCTCCACAGACATGAGAGGGAAGGCATTTATT 
TTCCCTCAAGAATCAGCTACAGTCTATGTGTCCCTGATCCCCAAGGTGAAGAAGCCCC 
TGAAGAACTTCAAGCTTTGCCTGAAAACCTTCACAGACTTCACCTGCCCTTATAGCCT 
CTTCTACAGCACTCGGTCCCAGGACAATGAGCTGCTTCTCCTTGTCAACAAAATGGGA 
ATGTATCTGCTGCACATTGGAAATGCTGCGGTCACTTTCAATGGCCCCACCCCCTGCC 
CTCGATCTCCTTATGCTTCGACCCATGTCAATGTGAGCTGGGAGTCTGCCTCTGGAAT 
TGCTACACTCTGGGCAAATGGGAAGCTGGTGGGGAGGAAGGGTGTGTGGAAGGGGTAC 
TCTGTGGGAGAAGAGGCTAAGATCATCCTGGGACAAGAGCAGGATTCCTTTGGGGGAC 
ATTTTGATGAAAATCAATCCTTTGTTGGGGTGATATGGGATGTGTTTTTGTGGGATCA 
TGTGCTCCCTCCAAAGGAGATGTGTGACTCCTGTTACAGCGGCAGCCTCCTGAATCGG 
CATACCCTGACTTATGAAGATAATGGCTATGTGGTAACTAAGCCCAAGGTGTGGGCTT 
AA 




ORF Start: ATG at 21 


ORF Stop: TAA at 696 




SEQ ID NO: 108 


225 aa |MW at 25265.8kD 


NOV32a, 

CG5 8468-01 Protein Sequence 


MDQQLLVLGVSVQISIFSHYLFSTDMRGKAFIFPQESATVYVSLIPKVKKPLKNFKLC 
LKTFTDFTCPYSLFYSTRSQDKELLLLVNKMGMYLLHIGNAAVTFNGPTPCPRSPYAS 
THVNVSWESASGIATLWANGKLVGRKGVWKGYSVGEEAKIILGQEQDSFGGHFDENQS 
FVGVIWDVFLWDHVLPPKEMCDSCYSGSLLNRHTLTYEDNGYWTKPKVWA 



Further analysis of the NOV32a protein yielded the following properties shown in 
Table 32B. 



Table 32B. Protein Sequence Properties NOV32a 


PSort 
analysis: 


0.5500 probability located in endoplasmic reticulum (membrane); 0.3200 
probability located in microbody (peroxisome); 0.2368 probability located in 
lysosome (lumen); 0.1000 probability located in endoplasmic reticulum 
(lumen) 


SignalP 
analysis: 


No Known Signal Sequence Predicted 



A search of the NOV32a protein against the Geneseq database, a proprietary 
database that contains sequences published in patents and patent publication, yielded 
several homologous proteins shown in Table 32C. 



Table 32C. Geneseq Results for NOV32a 


Geneseq 
Identifier 


Protein/Organism/Length {Patent 
#,Date] 


NOV32a 
Residues/ 

Match 
Residues 


Identities/ 
Similarities for 
the Matched 
Region 


Expect 
Value 


AAR74763 


Sermun amyloid P component, 
promoter sapm - Homo sapiens, 
204 aa. [WO9505394-A, 23-FEB- 
1995] 


24..224 
2..203 


98/207 (47%) 
136/207 (65%) 


4e-48 
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AAR29923 


SAP - Homo sapiens, 223 aa. 
[W09221364-A, 10-DEC-1992] 


7..224 
5..222 


101/224(45%) 
143/224 (63%) 


3e-47 


AAR29922 


CRP - Homo sapiens, 225 aa. 


14..224 
1 1..224 


100/218(45%) 
132/218 (59%) 


2e-43 


AAR74769 


Female hamster protein, Ifhp - 
Cricetus cricetus, 210 aa. 
[WO9505394-A, 23-FEB-1995] 


24..222 
1.-199 


95/206 (46%) 
132/206 (63%) 


6e-43 


AAY76844 


Human C reactive protein (CRP) 
sequence - Homo sapiens, 206 aa. 
[JP2000014388-A, 18-JAN-2000] 


24..224 
2..205 


98/208 (47%) 
128/208 (61%) 


le-42 



In a BLAST search of public sequence databases, the NOV32a protein was found to 
have homology to the proteins shown in the BLASTP data in Table 32D. 



Table 32D. Public BLASTP Results for NOV32a 


Protein 
Accession 
Number 


Protein/Organism/Length 


NOV32a 
Residues/ 

Match 
Residues 


Identities/ 
Similarities for 
the Matched 
Portion 


Expect 
Value 


Q9D8J8 


1 8 1 0030J 1 4RIK PROTEIN - Mus 
musculus (Mouse), 219 aa. 


6..224 
4..218 


130/220 (59%) 
166/220 (75%) 


5e-72 


Q9D8V2 


1 8 1 0030J 1 4RIK PROTEIN - Mus 
musculus (Mouse), 200 aa. 


6..190 
20..200 


110/186 (59%) 
139/186 (74%) 


2e-58 


Q63913 


SERUM AMYLOID P - Cricetulus 
migratorius (Armenian hamster), 
223 aa. 


1..224 
1..222 


109/231 (47%) 
152/231 (65%) 


4e-51 


P23680 


Serum amyloid P-component 
precursor (SAP) - Rattus norvegicus 
(Rat), 228 aa. 


6..224 
4..223 


105/224(46%) 
145/224 (63%) 


7e-50 


PI 5697 


Female protein precursor (FP) 
(Serum amyloid P-component) - 
Cricetulus migratorius (Armenian 
hamster), 23 1 aa. 


1..222 
1..220 


108/229 (47%) 
151/229 (65%) 


7e-50 



PFam analysis predicts that the NOV32a protein contains the domains shown in the 
Table 32E. 
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Table 32E. Domain Analysis of NOV32a 


Pfam Domain 


NOV32a Match 
Region 


Identities/ 
Similarities 
for the Matched 
Region 


Expect 
Value 


pentaxin: domain 1 of 
1 


29..221 


103/214 (48%) 
156/214(73%) 


8e-76 



Example 33. 

The NOV33 clone was analyzed, and the nucleotide and predicted polypeptide 
sequences are shown in Table 33 A. 



Table 33A. NOV33 Sequence Analysis 




SEQIDNO:109 [3350 bp 


NOV33a, 

CG5 8 183-01 DNA Sequence 


TAATGAGGAGACTGAGTTTGTGGTGGCTGCTGAGCAGGGTCTGTCTGCTGTTGCCGCC 
GCCCTGCGCACTGGTGCTGGCCGGGGTGCCCAGCTCCTCCTCGCACCCGCAGCCCTGC 
CAGATCCTCAAGCGCATCGGGCACGCGGTGAGGGTGGGCGCGGTGCACTTGCAGCCCT 
GGACCACCGCCCCCCGCGCGGCCAGCCGCGCTCCGGACGACAGCCGAGCAGGAGCCCA 
GAGGGATGAGCCGGAGCCAGGGACTAGGCGGTCCCCGGCGCCCTCGCCGGGCGCACGC 
TGGTTGGGGAGCACCCTGCATGGCCGGGGGCCGCCGGGCTCCCGTAAGCCCGGGGAGG 
GCGCCAGGGCGGAGGCCCTGTGGCCACGGGACGCCCTCCTATTTGCCGTGGACAACCT 
GAACCGCGTGGAAGGGCTGCTACCCTACAACCTGTCTTTGGAAGTAGTGATGGCCATC 
GAGGCAGGCCTGGGCGATCTGCCACTTTTGCCCTTCTCCTCCCCTAGTTCGCCATGGA 
GCAGTGACCCTTTCTCCTTCCTGCAAAGTGTGTGCCATACCGTGGTGGTGCAAGGGGT 
GTCGGCGCTGCTCGCCTTCCCCCAGAGCCAGGGCGAAATGATGGAGCTCGACTTGGTC 
AGCTTAGTCCTGCACATTCCAGTGATCAGCATCGTGCGCCACGAGTTTCCACGGGAGA 
GTCAGAATCCCCTTCACCTACAACTGAGTTTAGAAAATTCATTAAGTTCTGATGCTGA 
TGTCACTGTCTCAATCCTGACCATGAACAACTGGTACAATTTTAGCTTGTTGCTGTGC 
CAGGAAGACTGGAACATCACCGACTTCCTCCTCCTTACCCAGAATAATTCCAAGTTCC 
ACCTTGGTTCTATCATCAACATCACCGCTAACCTCCCCTCCACCCAGGACCTCTTGAG 
CTTCCTACAGATCCAGCTTGAGAGTATTAAGAACAGCACACCCACAGTGGTGATGTTT 
GGCTGCGACATGGAAAGTATCCGGCGGATTTTCGAAATTACAACCCAGTTTGGGGTCA 
TGCCCCCTGAACTTCGTTGGGTGCTGGGAGATTCCCAGAATGTGGAGGAACTGAGGAC 
AGAGGGTCTGCCCTTAGGGCTCATTGCTCATGGAAAAACAACACAGTCTGTCTTTGAG 
CACTACGTACAAGATGCTATGGAGCTGGTCGCAAGAGCTGTAGCCACAGCCACCATGA 
TCCAACCAGAACTTGCTCTCATTCCCAGCACGATGAACTGCATGGAGGTGGAAACrAC 
AAATCTCACTTCAGGACAATATTTATCAAGGTTTCTAGCCAATACCACTTTCAGAGGC 
CTCAGTGGTTCCATCAGAGTAAAAGGTTCCACCATCGTCAGCTCAGAAAACAACTTTT 
TCATCTGGAATCTTCAACATGACCCCATGGGAAAGCCAATGTGGACCCGCTTGGGCAG 
CTGGCAGGGGGGAAAGATTGTCATGGACTATGGAATATGGCCAGAGCAGGCCCAGAGA 
CACAAAACCCACTTCCAACATCCAAGTAAGCTACACTTGAGAGTGGTTACCCTGATTG 
AGCATCCTTTTGTCTTCACAAGGGAGGTAGATGATGAAGGCTTGTGCCCTGCTGGCCA 
ACTCTGTCTAGACCCCATGACTAATGACTCTTCCACATTGGACAGCCTTTTTAGCAGC 
CTCCATAGCAGTAATGATACAGTGCCCATTAAATTCAAGAAGTGCTGCTATGGATATT 
GCATTGATCTGCTGGAAAAGATAGCAGAAGACATGAACTTTGACTTCGACCTCTATAT 
TGTAGGGGATGGAAAGTATGGAGCATGGAAAAATGGGCACTGGACTGGGCTAGTGGGT 
GATCTCCTGAGAGGGACTGCCCACATGGCAGTCACTTCCTTTAGCATCAATACTGCAC 
GGAGCCAGGTGATAGATTTCACCAGCCCTTTCTTCTCCACCAGCTTGGGCATCTTAGT 
GAGGACCCGAGATACAGCAGCTCCCATTGGAGCCTTCATGTGGCCACTCCACTGGACA 
ATGTGGCTGGGGATTTTTGTGGCTCTGCACATCACTGCCGTCTTCCTCACTCTGTATG 
AATGGAAGAGTCCATTTGGTTTGACTTCCAAGGGGCGAAATAGAAGTAAAGTCTTCTC 
CTTTTCTTCAGCCTTGAACATCTGTTATGCCCTCrTGTTTGGCAGAACAGTGGCCATC 
AAACCTCCAAAATGTTGGACTGGAAGGTTTCTAATGAACCTTTGGGCCATTTTCTGTA 
TGTTTTGCCTTTCCACATACACGGCAAACTTGGCTGCTGTCATGGTAGGTGAGAAGAT 
CTATGAAGAGCTTTCTGGAATACATGACCCCAAGTTACATCATCCTTCCCAAGGATTC 
CGCTTTGGAACTGTCCGAGAAAGCAGTGCTGAAGATTATGTGAGACAAAGTTTCCCAG 
AGATGCATGAATATATGAGAAGGTACAATGTTCCAGCCACCCCTGATGGAGTGGAGTA 
TCTGAAGAATGATCCAGAGAAACTAGACGCCTTCATCATGGACAAAGCCCTTCTGGAT 
TATGAAGTGTCAATAGATGCTGACTGCAAACTTCTCACTGTGGGGAAGCCATTTGCCA 
TAGAAGGTTACGGCATTGGCCTCCCACCCAACTCTCCATTGACCGCCAACATATCCGA 
GCTAATCAGTCAATACAAGTCACATGGGTTTATGGATATGCTCCATGACAAGTGGTAC 
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AGGGTGGTTCCCTGTGGCAAGAGAAGTTTTGCTGTCACGGAGACTTTGCAAATGGGCA 
TCAAACACTTCTCTGGGCTCTTTGTGCTGCTGTGCATTGGATTTGGTCTGTCCATTTT 
GACCACCATTGGTGAGCACATAGTATACAGGCTGCTGCTACCACGAATCAAAAACAAA 
TCCAAGCTGCAATACTGGCTCCACACCAGCCAGAGATTACACAGAGCAATAAATACAT 
CATTTATAGAGGAAAAGCAGCAGCATTTCAAGACCAAACGTGTGGAAAAGAGATCTAA 
TGTGGGACCCCGTCAGCTTACCGTATGGAATACTTCCAATCTGAGTCATGACAACCGA 
CGGAAATACATCTTTAGTGATGAGGAAGGACAAAACCAGCTGGGCATCCGGATCCACC 
AGGACATCCCCCTCCCTCCAAGGAGAAGAGAGCTCCCTGCCTTGCGGACCACCAATGG 
GAAAGCAGACTCCCTAAATGTATCTCGGAACTCAGTGATGCAGGAACTCTCAGAGCTC 
GAGAAGCAGATTCAGGTGATCCGTCAGGAGCTGCAGCTGGCTGTGAGCAGGAAAACGG 
AGCTGGAGGAGTATCAAAGGACAAGTCGGACTTGTGAGTCCTAG 




ORF Start: ATG at 3 


ORF Stop: TAG at 3348 




SEQ ID NO: 110 


1115aa iMWat 1 25453. 7kD 


NOV33a, 

CG58 183-01 Protein Sequence 


MRRLSLWWLLSRVCLLLPPPCALVLAGVPSSSSHPQPCQILKRIGHAVRVGAVHLQPW 
TTAPRAASRAPDDSRAGAQRDEPEPGTRRSPAPSPGARWLGSTLHGRGPPGSRKPGEG 
ARAEALWPRDALLFAVDNLNRVEGLLPYNLSLEWMAIEAGLGDLPLLPFSSPSSPWS 
SDPFSFLQSVCHTVWQGVSALLAFPQSQGEMMELDLVSLVLHIPVISIVRHEFPRES 
QNPLHLQLSLENSLSSDADVTVSILTMNNWYNFSLLLCQEDWNITDFLLLTQNNSKFH 
LGSIINITANLPSTQDLLSFLQIQLESIKNSTPTWMFGCDMESIRRIFEITTQFG\7M 
PPELRWVLGDSQNVEELRTEGLPLGLIAHGKTTQSVFEHYVQDAMELVARAVATATMI 
QPELALIPSTMNCMEVETT1TLTSGQYLSRFLANTTFRGLSGSIRVKGSTIVSSENNFF 
IWNLQHDPMGKPMWTRLGSWQGGKIVMDYGIWPEQAQRHKTHFQHPSKLHLRWTLIE 
HPFVFTREVDDEGLCPAGQLCLDPMTNDSSTLDSLFSSLHSSNDTVPIKFKKCCYGYC 
IDLLEKIAEDMNFDFDLYIVGDGKYGAWKNGHWTGLVGDLLRGTAHMAVTSFSINTAR 
SQVIDFTS PFFSTSLGI LVRTRDTAAP I GAFMWPLHWTMWLGI FVALHI TAVFLTLYE 
WKSPFGLTSKGRNRSKVFSFSSALNICYALLFGRTVAIKPPKCWTGRFLMNLWAIFCM 
FCLSTYTANLAAVMVGEKIYEELSGIHDPKLHHPSQGFRFGTVRESSAEDYVRQSFPE 
MHEYMRRYNVPATPDGVEYLKNDPEKLDAFIMDKALLDYEVSIDADCKLLTVGKPFAI 
EGYGIGLPPNSPLTANISELISQYKSHGFMDMLHDKWYRWPCGKRSFAVTETLQMGI 
KHFSGLFVLLCIGFGLSILTTIGEHIVYRLLLPRIKNKSKLQYWLHTSQRLHRAINTS 
FIEEKQQHFKTKRVEKRSNVGPRQLTVWNTSNLSHDNRRKYIFSDEEGQNQLGIRIHQ 
DIPLPPRRRELPALRTTNGKADSLNVSRNSVMQELSELEKQIQVIRQELQLAVSRKTE 
LEEYQRTSRTCES 



Further analysis of the NOV33a protein yielded the following properties shown in 
Table 33B. 



Table 33B. Protein Sequence Properties NOV33a 


PSort 
analysis: 


0.6400 probability located in plasma membrane; 0.4600 probability located in 
Golgi body; 0.3700 probability located in endoplasmic reticulum (membrane); 
0.1000 probability located in endoplasmic reticulum (lumen) 


SignalP 
analysis: 


Likely cleavage site between residues 34 and 35 



A search of the NOV33a protein against the Geneseq database, a proprietary 
database that contains sequences published in patents and patent publication, yielded 
several homologous proteins shown in Table 33C. 
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Table 33C . Genescq Results for NOV33a 


Geneseq 
Identifier 


Protein/Organism/Length [Patent 
#, Date] 


NOV33a 
Residues/ 

Match 
Residues 


Identities/ 
Similarities for 
the Matched 
Region 


Expect 
Value 


AAU02199 


Human glutamate receptor-like 
protein, MEM4 - Homo sapiens, 
1043 aa. [WO200144473-A2, 21- 
JUN-2001] 


95..1103 
6..1007 


508/1047 (48%) 
680/1047 (64%) 


0.0 


AAB42494 


Human ORFX ORF2258 
polypeptide sequence SEQ ID 
NO:4516 - Homo sapiens, 901 aa. 
[WO200058473-A2, 05-OCT- 
2000] 


95. .985 
6..885 


484/912(53%) 
635/912 (69%) 


0.0 


AAU02198 


Human glutamate receptor-like 
protein, MEM3 - Homo sapiens, 
971 aa. [WO2001 44473 -A2, 21- 
JUN-2001] 


532..1103 
362..935 


361/579 (62%) 
448/579 (77%) 


0.0 


AAU02197 


Human glutamate receptor-like 
protein, MEM2 - Homo sapiens, 
965 aa. [WO200144473-A2, 21- 
JUN-2001] 


532..1103 
362..929 


352/579 (60%) 
437/579 (74%) 


0.0 


AAR44192 


Rat NMDA receptor subunit, 
NR2A - Rattus rattus, 1464 aa. 
[DE4216321-A, 18-NOV-1993] 


175.. 1023 
77..911 


245/873 (28%) 
425/873 (48%) 


2e-83 



In a BLAST search of public sequence databases, the NOV33a protein was found to 
have homology to the proteins shown in the BLASTP data in Table 33D. 



Table 33D. Public BLASTP Results for NOV33a 


Protein 
Accession 
Number 


Protein/Organism/Length 


NOV33a 
Residues/ 

Match 
Residues 


Identities/ 
Similarities for 
the Matched 
Portion 


Expect 
Value 


AAL40734 


N-METHYL-D-ASPARTATE 
RECEPTOR 3A - Homo sapiens 
(Human), 1 1 1 5 aa. 


1..1115 
1..1115 


1110/1115(99%) 
1112/1115(99%) 


0.0 


Q62800 


IONOTROPIC GLUTAMATE 
RECEPTOR - Rattus norvegicus 
(Rat), 1115 aa. 


1-1 115 
1..1115 


1032/1115 (92%) 
1083/1115 (96%) 


0.0 
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Q9R1M7 


N-METHYL-D-ASPARTATE 
RECEPTOR SPLICE VARIANT 
NR3A-2 - Rattus norvegicus 
(Rat), 1135 aa. 


1..1115 
1-1 135 


1032/1135 (90%) 
1083/1135 (94%) 


0.0 


CAC69380 


SEQUENCE 7 FROM PATENT 
WOO 144473 - Homo sapiens 
(Human), 1043 aa. 


95..1103 
6..1007 


508/1047(48%) 
680/1047(64%) 


0.0 


Q91ZU9 


NMDA-TYPE GLUTAMATE 
RECEPTOR SUBUNIT NR3B 
PRECURSOR - Mus musculus 
(Mouse), 1003 aa. 


112..1103 
34..980 


510/1001 (50%) 
669/1001 (65%) 


0.0 



PFam analysis predicts that the NOV33a protein contains the domains shown in the 
Table 33E. 



Table 33E. Domain Analysis of NOV33a 


Pfam Domain 


NOV33a Match 
Region 


Identities/ 
Similarities 
for the Matched 
Region 


Expect 
Value 


lig chan: domain 1 of 
1 


6 '4..952 


81/323 (25%) 
232/323 (72%) 


4e-95 



Example 34. 



The NOV34 clone was analyzed, and the nucleotide and predicted polypeptide 
sequences are shown in Table 34A. 



Table 34A. NOV34 Sequence Analysis 




SEQIDNO:lll 1 1253 bp 


NOV34a, 

CG593 1 5-01 DNA Sequence 


CCCAGCGCCATGGGGGAGTGGGCGTTCCTGGGCTCGCTGCTGGACGCCGTGCAGCTGC 
AGTCGCCGCTCGrGGGCCGCCTCTGGCTGGTGGTCATGCTGATCrTCCGCATCCTGGT 
GCTGGCCACGGTGGGCGGCGCCGTGTTCGAGGACGAGCAAGAGGAGTTCGTGTGCAAC 
ACGCTGCAGCCGGGCTGTCGCCAGACCTGCTACGACCGCGCCTTCCCGGTCTCCCACT 




CTACTCCATGCACCGGGCAGGCAAGGAGGCGGGCGGCGCTGAGGCGGCGGCGCAGTGC 
GCCCCCGGACTGCCCGAGGCCCAGTGCGCGCCGTGCGCCCTGCGCGCCCGCCGCGCGC 




CGGCCAGGCGCTGCTCTACGGCTTCCGCGTGGCCCCGCACTTCGCGTGCGCCGGTCCG 
CCCTGCCCGCACACGGrCGACTGCTTCGTGAGCCGGCCCACCGAGAAGACCGTCTTCG 
TGCTCTTCTATTTCGCGGTGGGGCTGCTGTCGGCGCTGCTCAGCGTAGCCGAGCTGGG 
CCACCTGCTCTGGAAGGGCCGCCCGCGCGCCGGGGAGCGTGACAACCGCTGCAACCGT 
GCACACGAAGAGGCGCAGAAGCTGCTCCCGCCGCCGCCGCCGCCACCTCCGCCACCGG 
CCCTGCCCTCCCGGCGCCCCGGCCCCGAGCCGTGCGCCCCGCCGGCCTATGCGCACCC 
GGCGCCGGCCAGCCTCCGCGAGTGCGGCAGCGGCCGCGGCAGGAATGCGCCAATGGCT 
CCCAGATGTGGACGCCACCGCTTAACCCCTTACCCCCCAGCCGCGCTCCCCCAAGGGC 
CTTCCAGCCTGAGCCCCGCCAACAGCAGGGAGCTCTGCCCAGGTGAGAACCAGCCCAG 
GACTGGAGTCAGCGCCAGCCCGCCCCTAGTGCCCACGGACACCTCCCAACCTAGATCC 
TACCTGTCTTCCTTCCTTGAGGCTGGAGGGGAAGGCTCATGGACACAAGAATGCAAGC 
ATGCATGCACACAGCTACACTGCCTCCCATCCCCTCCCGCCGACGCTGCCAGGGTGCC 
CCTCCCTCGCTCCCCATCCTGGCAGGGCGGGCGGCGCAGAGCGCTCCACTCCGGATTC 
CCCACGCCCCCGAGCCGTTCGCAGGCTCGCACAAG 
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ORF Start: ATG at 10 


ORF Stop: AG at 1252 




SEQ ID NO: 112 


414 aa IMW at 44773.0kD 


NOV34a, 

CG59315-01 Protein Sequence 


MGEWAFLGSLLDAVQLQS PLVGRLWLWMLI FRI LVLATVGGAVFEDEQEE FVCNTLQ 
PGCRQTCYDRAFPVSHYRFWLFHILLLSAPPVLFWySMHRAGKEAGGAEAAAQCAPG 
LPEAQCAPCALRARRARRCYLLSVALRLLAELTFLGGQALLYGFRVAPHFACAGPPCP 
HTVDCFVSRPTEKTVFVLFYFAVGLLSALLSVAELGHLLWKGRPRAGERDNRCNRAHE 
EAQKLLPPPPPPPPPPALPSRRPGPEPCAPPAYAHPAPASLRECGSGRGRNAPMAPRC 
GRHRLTPYPPAALPQGPSSLSPANSRELCPGENQPRTGVSASPPLVPTDTSQPRSYLS 
SFLEAGGEGSWTQECKHACTQLHCLPSPPADAARVPLPRSPSWQGGRRRALHSGFPTP 
PSRSQART 



Further analysis of the NOV34a protein yielded the following properties shown in 
Table 34B. 



Table 34B. Protein Sequence Properties NOV34a 


PSort 
analysis: 


0.6000 probability located in plasma membrane; 0.4000 probability located in 
Golgi body; 0.3000 probability located in endoplasmic reticulum (membrane); 
0.0300 probability located in mitochondrial inner membrane 


SignalP 
analysis: 


Likely cleavage site between residues 39 and 40 



A search of the NOV34a protein against the Geneseq database, a proprietary 
database that contains sequences published in patents and patent publication, yielded 
several homologous proteins shown in Table 34C. 



Table 34C. Geneseq Results for NOV34a 


Geneseq 
Identifier 


Protein/Organism/Length 
[Patent #, Date] 


NOV34a 
Residues/ 

Match 
Residues 


Identities/ 
Similarities for 
the Matched 
Region 


Expect 
Value 


AAW49009 


Mouse alpha 3 connexin protein - 
Mus sp, 417 aa. [WO9830677-A1, 
16-JUL-1998] 


1..296 
1..327 


121/334 (36%) 
169/334 (50%) 


5e-52 


AAW23968 


Connexin protein Cx40 - Homo 
sapiens, 358 aa. [WO9802150-A1, 
22-JAN-1998] 


1..215 
1..232 


93/233 (39%) 
133/233 (56%) 


9e-46 


AAW23970 


Connexin protein Cx45 - Homo 
sapiens, 396 aa. [WO9802150-A1, 
22-JAN-1998] 


4..212 
3..253 


93/252 (36%) 
137/252 (53%) 


3e-43 


AAW23969 


Connexin protein Cx43 - Homo 
sapiens, 382 aa. [WO9802150-A1, 
22-JAN-1998] 


1..216 
1..235 


86/235 (36%) 
130/235(54%) 


le-42 


AAM93194 


Human polypeptide, SEQ ID NO: 


7..384 
7.. 360 


129/409 (31%) 
169/409 (40%) 


8e-38 
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In a BLAST search of public sequence databases, the NOV34a protein was found to 
have homology to the proteins shown in the BLASTP data in Table 34D. 



Table 34D. Public BLASTP Results for NOV34a 


Protein 
Accession 
Number 


Protein/Organism/Length 


NOV34a 
Residues/ 

Match 
Residues 


Identities/ 
Similarities for 
the Matched 
Portion 


Expect 
Value 


Q91YD1 


CONNEXIN30.2 - Mus musculus 
(Mouse), 278 aa. 


1..283 
1..265 


228/283 (80%) 
240/283 (84%) 


e-129 


146053 


connexin44 - bovine, 402 aa. 


1..397 
1..396 


151/418(36%) 
207/418(49%) 


le-62 


P41987 


Gap junction alpha-3 protein 
(Connexin 44) (Cx44) - Bos 
taurus (Bovine), 401 aa. 


2..397 
1..395 


150/417(35%) 
206/417(48%) 


4e-62 


AAA50954 


CONNEXIN44 - Bos taurus 
(Bovine), 407 aa. 


1..398 
1..402 


154/429 (35%) 
214/429 (48%) 


le-60 


Q9TU17 


GAP JUNCTION PROTEIN 
(CONNEXIN) - Ovis aries 
(Sheep), 413 aa. 


1..398 
1..408 


147/415 (35%) 
204/415 (48%) 


le-60 



PFam analysis predicts that the NOV34a protein contains the domains shown in the 
Table 34E. 



Table 34E. Domain Analysis of NOV34a 


Pfam Domain 


NOV34a Match 
Region 


Identities/ 
Similarities 
for the Matched 
Region 


Expect 
Value 


DUF26: domain 1 of 1 


107.. 152 


12/56 (21%) 
27/56 (48%) 


1.4 


connexin: domain 1 of 
1 


1..212 


101/247 (41%) 
150/247 (61%) 


6.5e-75 



Example 35. 



The NOV35 clone was analyzed, and the nucleotide and predicted polypeptide 
sequences are shown in Table 3 5 A. 
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Table 35A. NOV35 Sequence Analysis 




SEQIDNO:113 1 724 bp 


NOV35a, 

CG59203-01 DNA Sequence 


TAAATTCGCGGCCGCGTCGACCTTCCGCAGACTCAACTGAGAAGTCAGCCTCTGCGGC 


AGGCACCAGGAATCTGCCTTTTCAGTTCTGTCTCCGGCAGGCTTTGAGGATGAAGGCT 


GCGGGCATTCTGACCCTCATTGGCTGCCTGGTCACAGGCGCCGAGTCCAAAATCTACA 
CTCGTTGCAAACTGGCAAAAATATTCTCGAGGGCTGGCCTGGACAATTACTGGGGCTT 
CAGCCTTGGAAACTGGATCTGCATGGCGTATTATGAGAGCGGCTACAACACCACAGCC 
CAGACGGTCCTGGATGACGGCAGCATCGACTACGGCATCTTCCAGATCAACAGCTTCG 
CGTGGTGCAGACGCGGAAAGCTGAAGGAGAACAACCACTGCCACGTCGCCTGCTCAGC 
CTTGGTCACTGATGACCTCACAGATGCGATTATCTGTGCCAAGAAAATTGTTAAAGAG 
ACACAAGGAATGAACTATTGGCAAGGCTGGAAGAAACACTGTGAGGGGAGAGACCTGT 
CCGACTGGAAAAAAGACTGTGAGGTTTCCTAAACTGGAACTGGACCCAGGATGCTTTG 
CAGCAACGCCCTAGGGTTTGCAGTGAATGTCCAAATGCCTGTGTCATCTTGTCCCGTT 


TCCTCCCAATATTCCTTCTCAAACTTGGAGAGGGAAAATTAAGCTATACTTTTAAGAA 


AATAAATATTTCCATTTAAATGTCAAAA 




ORF Start: ATG at 108 


ORF Stop: TAA at 552 




SEQIDNO: 114 


148 aa ImW at 16655. 9kD 


LSkJ V jJd, 

CG59203-01 Protein Sequence 


MKAAGILTLIGCLVTGAESKIYTRCKLAKIFSRAGLDNYWGFSLGNWICMAYYESGYN 
TTAQTVLDDGSIDYGIFQINSFAWCRRGKLKENNHCHVACSALVTDDLTDAIICAKKI 
VKETQGMNYWQGWKKHCEGRDLSDWKKDCEVS 




SEQIDNO: 115 1 453 bp 


NOV35b 

CG59203-02 DNA Sequence 


CATTCTGACCCTCATTGGCTGCCTGGTCACAGGCGCCGAGTCCAASATCTACACTCGT 


T JCAAACTGGCAAAAATATTCTCGAGGGCT JGCCTGGACAATTACTGGGGCTTCAGCC 


TTGGAAACTGGATCTGCATGGCGTATTATGAGAGCGGCTACAACACCACAGCCCAGAC 
GGTCCTGGATGACGGCAGCATCGACTACGGCATCTTCCAGATCAACAGCTTCGCGTGG 
TGCAGACGCGGAAAGCTGAAGGAGAACAACCACTGCCACGTCGCCTGCTCAGCCTTGG 
TCACTGATGACCTCACAGATGCAATTATCTGTGCCAGGAAAATTGTTAAAGAGACACA 
AGGAATGAATTATTGGCAAGGCTGGAAGAAACATTGTGAGGGCAGAGACCTGTCCGAC 
TGGAAAAAAGGCTGTGAGGTTTCCTAAACTGGAACTGGACCCAGGAT 




ORF Start: ATG at 134 


ORF Stop: TAA at 431 




SEQIDNO: 116 


99 aa jMW at 1 1288.6kD 


NOV35b, 

CG59203-02 Protein Sequence 


MAYYESGYNTTAQTVLDDGSIDYGIFQINSFAWCRRGKLKENNHCHVACSALVTDDLT 
DAI I CARKIVKETQGMNYWQGWKKHCEGRDLSDWKKGCEVS 



Sequence comparison of the above protein sequences yields the following sequence 
relationships shown in Table 35B. 



Table 35B. Comparison of NOV35a against NOV35b. 


Protein Sequence 


NOV35a Residues/ 
Match Residues 


Identities/ 
Similarities for the Matched Region 


NOV35b 


50..148 
1..99 


97/99 (97%) 
98/99 (98%) 



Further analysis of the NOV35a protein yielded the following properties shown in 
Table 35C. 
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Table 35C. Protein Sequence Properties NOV35a 


Psort 
analysis: 


0.3700 probability located in outside; 0.1697 probability located in microbody 
(peroxisome); 0.1000 probability located in endoplasmic reticulum 
(membrane); 0.1000 probability located in endoplasmic reticulum (lumen) 


SignalP 
analysis: 


Likely cleavage site between residues 20 and 21 



A search of the NOV35a protein against the Geneseq database, a proprietary 
database that contains sequences published in patents and patent publication, yielded 
several homologous proteins shown in Table 35D. 



Table 35D. Geneseq Results for NOV35a 


Geneseq 
Identifier 


Protein/Organism/Length [Patent 
#, Date] 


Residues/ 

Match 
Residues 


Identities/ 
Similarities for 
the Matched 
Region 


Expect 
Value 


AAY57399 


Human lysoenzyme LYC2 
polypeptide - Homo sapiens, 1 48 aa. 
[WO200012722-A1, 09-MAR- 
2000] 


1..148 
1..I48 


143/148 (96%) 
147/148 (98%) 


3e-86 


AAU29169 


Human PRO polypeptide sequence 
#146 - Homo sapiens, 148 aa. 
[WO200168848-A2, 20-SEP-2001] 


1..I48 
1..148 


143/148 (96%) 
146/148 (98%) 


6e-86 


AAB66145 


Protein of the invention #57 - 
Unidentified, 148 aa. 
[WO200078961-A1, 28-DEC-2000] 


1..148 
1..148 


143/148 (96%) 
146/148 (98%) 


6e-86 


AAY99396 


Human PR01278 (UNQ648) amino 
acid sequence SEQ ID NO:203 - 
Homo sapiens, 148 aa. 
[WO200012708-A2, 09-MAR- 
2000] 


1..148 
1..148 


143/148 (96%) 
146/148 (98%) 


6e-86 


AAY71109 


Human Hydrolase protein-7 
(HYDRL-7) - Homo sapiens, 194 
aa. [WO200028045-A2, 18-MAY- 
2000] 


1-148 
47.. 194 


142/148 (95%) 
146/148 (97%) 


le-85 



In a BLAST search of public sequence databases, the NOV35a protein was found to 
have homology to the proteins shown in the BLASTP data in Table 35E. 
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Table 35E. Public BLASTP Results for NOV35a 


Protein 
Accession 
Number 


Protein/Organism/Length 


NOV35a 
Residues/ 

Match 
Residues 


Identities/ 
Similarities for 
the Matched 
Portion 


Expect 
Value 


Q96LF2 


BA14C22.1 (NOVEL PROTEIN 
SIMILAR TO LYSOZYME) - 
Homo sapiens (Human), 148 aa. 


1..148 
1..148 


148/148 (100%) 
148/148 (100%) 


7e-88 


Q9H1R9 


BA534G20.1.1 (NOVEL PROTEIN 
SIMILAR TO LYSOZYME C-l 
(1,4-BETA-N- 

ACYLMURAMIDASE C, EC 
3.2.1.17) (ISOFORM 1)) - Homo 
sapiens (Human), 148 aa. 


1..148 
1..148 


144/148(97%) 
147/148 (99%) 


4e-86 


AAH21730 


HYPOTHETICAL 21 .6 KDA 
PROTEIN - Homo sapiens 
(Human), 194 aa. 


1..148 
47.. 194 


143/148 (96%) 
146/148 (98%) 


2e-85 


Q9CPX3 


1700038F02RIK PROTEIN - Mus 
musculus (Mouse), 148 aa. 


1..148 
1..148 


110/148 (74%) 
127/148 (85%) 


3e-66 


Q9H1R8 


BA534G20.1 .2 (NOVEL PROTEIN 
SIMILAR TO LYSOZYME C-l 
(1,4-BETA-N- 

ACYLMURAMIDASE C, EC 
3.2.1.17) (ISOFORM 2)) - Homo 
sapiens (Human), 106 aa (fragment). 


20..125 
1..106 


104/106 (98%) 
106/106 (99%) 


le-59 



PFam analysis predicts that the NOV35a protein contains the domains shown in the 



Table 35F. Domain Analysis of NOV35a 


Pfam Domain 


NOV35a Match Region 


Identities/ 
Similarities 
for the Matched Region 


Expect Value 


lys: domain 1 of 1 


20..145 


68/129 (53%) 
107/129 (83%) 


8e-58 



Example 36. 



The NOV36 clone was analyzed, and the nucleotide and predicted polypeptide 
sequences are shown in Table 36A. 
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Table 36A. NOV36 Sequence Analysis 




SEQIDNO:117 \l\2 bp 


NOV36a, 

CG5 8662-01 DNA Sequence 


GCAGCTATTGCACTTAATCGCGGCTGCTAGCACCATGTCCCGCGTTTTGGTGCCTTGC 


CATGTGAAAGGCACCGTAGCCCTGCAGGTGGGGGACGTATGGACCTCCCAAGGCCGGC 
CTAGTGTGCTGGTCATTGATGTCACCTTCCCCTGTGTCACTCCGTTCGAGGGGATCAC 
ATTTAAGAATTATTACACAGCGTTTTTTGAGCATCCTGTCTGTCAGCACACCTCAGCA 
CACACACCGGCCAAGTGGGTGACCTGCCTGTGGGACTACTGTCTGATGCCCGACCCAC 
ACAGTGAGGAGGGAGCCCAGGAGTATGTGTCGCTGTTCAAGCAACAGATACTGTGTGA 
CATGGCCAGAATATCGGAGCTACACCrGATTCTGCAGCAGCCATCACCACTGTGGCTG 
TCTTTCACAGTGGAGGAGCTGCAGATCTATCAGCAGGGACCAAAGAGCCCCTCCATGA 
TCTTCCCCAAGTGGCTCTCCCACCCAGTGCCCTGTGAGCAACCTGCACTCCTCCATGA 
GGGTCTCCCAGACCCCAGCAGGGTATCCTCTGAGGTGCAGCAGATGTGGGCACTGACA 
GAGATGATCCGGGCCAGTCACACCTCCGCGAGGATAGGCCACTTTGATGTAGATGGCT 
GTTATGACCTGAACTTACTCTCCTACACTTGAGTGGTGGCTCCTAGCCAAGATGTTGG 
CCTTTCTGTGCCCACT 




ORF Start: ATG at 35 


ORF Stop: TGA at 668 




SEQIDNO: 118 


211 aa ImW at 23932.3kD 


NOV36a, 

CG58662-01 Protein Sequence 


MSRVLVPCHVKGTVALQVGDVWTSQGRPSVLVIDVTFPCVTPFEGITFKNYYTAFFEH 
PVCQHTSAHTPAKWVTCLWDYCLMPDPHSEEGAQEYVSLFKQQILCDMARISELHLIL 
QQPSPLWLSFTVEELQIYQQGPKSPSMIFPKWLSHPVPCEQPALLHEGLPDPSRVSSE 
VQQMWALTEMIRASHTSARIGHFDVDGCYDLNLLSYT 




SEQIDNO: 119 j 843 bp 


NOV36b, 

L-ojoooz-Uz UJNA Sequence 


CTGGCCTGAAGCCATGTCCCGCGTTCTAGCACCATGTCCCGCGTCTAGCACCATGTCC 


CGCGTCTAGCACCATGTCCCGCGTTCTAGCACCATGTCCCGCGTTCTAGCACCATGTC 


CCGCGTTCTAGCACCATGTCCCGCGTTTTGGTGCCTTGCCATGTGAAAGGCTCCGTAG 
CCCTCCAGGTGGGCGACGTGCGGACCTCCCAAGGCCGGCCTGGCGTGCTGGTCATCGA 
TGTCACCTTCCCCAGCGTCGCTCCCTTCGAGTTGCAGGAAATCACGTrTAAGAATTAC 
TACACAGCTTTTTTGAGCATCCGTGTCCGTCAGTACACCTCAGCACACACACCTGCCA 




AGCCCAGGAGTATGTATCGCTGTTCAAGCATCAGATGCTATGTGACATGGCTAGAATA 
TCGGAGCTACGCCTGATTCTGCGGCAGCCATCACCACTGTGGCTGTCTTTCACAGTGG 
AGGAGCTGCAGATCTATCAGCAGGGACCAAAGAGCCCCTCCGTGACCTTTCCCAAGTG 
GCTCTCCCACCCAGTGCCCTGTGAGCAACCTGCACTCCTCCGTGAGGGTTTCCCAGAC 
CCCAGCAGGGTATCCTCCGAGGTGCAGCAGATGTGGGCACTGACAGAGATGATCCGGG 
CCAGTCACACCTCCGCAAGGATCGGCCGCTTTGATGTGGATGGCTGTTATGACCTGAC 
CTTGCTCTCCTACACTTGAATGGTTGCTCTTAGCCAAGATGTTGGCCTTTTTGTGGGC 


ACAGAAAGGCCAACGCGGGACATGGTGCTAG 




ORF Start: ATG at 132 


ORF Stop: TGA at 771 




SEQ ID NO: 120 


213 aa |MW at 24222.6kD 


NOV36b, 

CG5 8662-02 Protein Sequence 


MSRVLVPCHVKGSVALQVGDVRTSQGRPGVLVIDVTFPSVAPFELQEITFKNYYTAFL 
SIRVRQYTSAHTPAKWVTCLRDYCLMPDPHSEEGAQEYVSLFKHQMLCDMARISELRL 
ILRQPSPLWLSFTVEELQIYQQGPKSPSVTFPKWLSHPVPCEQPALLREGFPDPSRVS 
SEVQQMWALTEMIRASHTSARIGRFDVDGCYDLTLLSYT 



Sequence comparison of the above protein sequences yields the following sequence 
relationships shown in Table 36B. 



Table 36B. Comparison of NOV36a against NOV36b. 


Protein Sequence 


NOV36a Residues/ 


Identities/ 


Match Residues 


Similarities for the Matched Region 


NOV36b 


1..21 1 


188/213 (88%) 




1..213 


193/213 (90%) 



Further analysis of the NOV36a protein yielded the following properties shown in 
Table 36C. 
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Table 36C. Protein Sequence Properties NOV36a 


PSort 
analysis: 


0.5666 probability located in microbody (peroxisome); 0.4500 probability 
located in cytoplasm; 0.1562 probability located in lysosome (lumen); 0.1000 
probability located in mitochondrial matrix space 


SignalP 
analysis: 


No Known Signal Sequence Predicted 



A search of the NOV36a protein against the Geneseq database, a proprietary 
database that contains sequences published in patents and patent publication, yielded 
several homologous proteins shown in Table 36D. 



Table 36D. Geneseq Results for NOV36a 


Geneseq 
Identifier 


Protein/Organism/Length [Patent 
#,Date] 


NOV36a 
Residues/ 

Match 
Residues 


Identities/ 
Similarities for 
the Matched 
Region 


Expect 
Value 


AAG04038 


Human secreted protein, SEQ ID 
NO: 81 19 - Homo sapiens, 1 15 aa. 
[EP1 03340 1-A2, 06-SEP-2000] 


1..103 
1-105 


82/105 (78%) 
85/105 (80%) 


le-39 



In a BLAST search of public sequence databases, the NOV36a protein was found to 
have homology to the proteins shown in the BLASTP data in Table 36E. 



Table 36E. Public BLASTP Results for NOV36a 


Protein 
Accession 
Number 


Protein/Organism/Length 


NOV36a 
Residues/ 

Match 
Residues 


Identities/ 
Similarities for 
the Matched 
Portion 


Expect 
Value 


Q9BSH3 


SIMILAR TO RIKEN CDNA 
1500032A17 GENE - Homo 
sapiens (Human), 213 aa. 


1.-21 1 
1..213 


190/213 (89%) 
195/213 (91%) 


e-107 


Q9CQM0 


1500032A17RIK PROTEIN - 
Mus musculus (Mouse), 213 aa. 


1..211 
1.-213 


174/213 (81%) 
183/213 (85%) 


4e-97 



PFam analysis predicts that the NOV36a protein contains the domains shown in the 
Table 36F. 
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Table 36F. Domain Analysis of NOV36a 



Pfam Domain 



NOV36a Match Region 



Identities/ 
Similarities 
for the Matched Region 



Expect Value 



No Significant Matches Found 



Example 37. 

The NOV37 clone was analyzed, and the nucleotide and predicted polypeptide 
sequences are shown in Table 37A. 



Table 37A. NOV37 Sequence Analysis 



SEQIDNO: 121 



1 520 bp 



NOV37a, 

CG58584-01 DNA Sequence 



TGTTCAAACCGGCACGGTCTGATCCGGAAATAT 
GGCCTCAATATGTGCCGCCAGTGTTTCCGTCAGTACGCGAAGGATATCGGTTTCATTA 
AGAAAGACCTGAGCTGTCTTCCTTGGCACTGCCTATGGAGGTGA CACCCATCTCCTCC 
ATCATGGCCATCCTGAGACCGCTCGCGAAGCCCAAGATCATCAAAAAGAGCACCAAGT 



TCACTGGGAACCAGTCAGACTGATATGTCAAAATTAAGGGTAACTGGTGGAAACACAG 
AGGTATTGACAACAGGGTTCATAGAAGGTTTGAGGGCCAGATCTATGCCCAACATTGG 
TTATGGGAGAAAC AAA AAGftCAAAGCACATACl-GCCCAGTGGC TT CTGGAAGTTCCTG 



GTCCACAACG1TAA GGA GCTGGAAGTACT GC TGGTGA GCAGAGGAG ACAG CAAATG 



ORF Start: TTT at 3 ORF Stop: TG A at 2 1 6 



SEQIDNO: 122 



71 aa MWat8461.8kD 



NOV37a, 
CG5 85 84-01 Protein Sequence 



Further analysis of the NOV37a protein yielded the following properties shown in 
Table 37B. 



Table 37B. Protein Sequence Properties NOV37a 


PSort 

analysis: 


0.6400 probability located in microbody (peroxisome); 0.4500 probability 
located in cytoplasm; 0.1000 probability located in mitochondrial matrix 
space; 0.1000 probability located in lysosome (lumen) 


SignalP 
analysis: 


No Known Signal Sequence Predicted 



A search of the NOV37a protein against the Geneseq database, a proprietary 
database that contains sequences published in patents and patent publication, yielded 
several homologous proteins shown in Table 37C. 
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Table 37C. Geneseq Results for NOV37a 


Geneseq 
Identifier 


Protein/Organism/Length [Patent 
#, Date] 


NOV37a 
Residues/ 

Match 
Residues 


Identities/ 
Similarities 
for the 
Matched 
Region 


Expect 
Value 


AAG76128 


Human colon cancer antigen protein 
SEQ ID NO:6892 - Homo sapiens, 
80 aa. [WO200122920-A2, 05-APR- 
2001] 


7..60 

2..55 


46/54 (85%) 
48/54 (88%) 


4e-24 


AAM79084 


Human protein SEQ ID NO 1746 - 
Homo sapiens, 56 aa. 
[WO200157190-A2, 09-AUG-2001] 


7..60 
3..56 


39/54 (72%) 
43/54 (79%) 


2e-18 


AAG39921 


Arabidopsis thaliana protein 
fragment SEQ ID NO: 49464 - 
Arabidopsis thaliana, 637 aa. 
[EP1033405-A2, 06-SEP-2000] 


7..63 
3..58 


40/57 (70%) 
45/57 (78%) 


2e-18 


AAM80068 


Human protein SEQ ID NO 3714 - 
Homo sapiens, 74 aa. 
[WO200157190-A2, 09-AUG-2001] 


7„58 
22..73 


38/52 (73%) 
42/52 (80%) 


5e-18 


AAG34802 


Arabidopsis thaliana protein 
fragment SEQ ID NO: 42406 - 
Arabidopsis thaliana, 56 aa. 
[EP1033405-A2, 06-SEP-2000] 


7..58 
3..54 


37/52 (71%) 
42/52 (80%) 


le-17 



In a BLAST search of public sequence databases, the NOV37a protein was found to 
have homology to the proteins shown in the BLASTP data in Table 37D. 
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Table 37D. Public BLASTP Results for NOV37a 


Protein 
Accession 
Number 


Protein/Organism/Length 


NOV37a 
Residues/ 

Match 
Residues 


Identities/ 
Similarities for 
the Matched 
Portion 


Expect 
Value 


BAB79485 


RIBOSOMAL PROTEIN S29 - 
Homo sapiens (Human), 56 aa. 


7..60 
3..56 


53/54 (98%) 
53/54 (98%) 


le-27 


P30054 


40S ribosomal protein S29 - Homo 
sapiens (Human)„ 55 aa. 


7..60 
2..55 


53/54 (98%) 
53/54 (98%) 


le-27 


Q90YP2 


40S RIBOSOMAL PROTEIN S29 
- Ictalurus punctatus (Channel 
catfish), 56 aa. 


7..60 
3..56 


52/54 (96%) 
53/54 (97%) 


2e-27 


AAL62474 


RIBOSOMAL PROTEIN S29 - 
Spodoptera frugiperda (Fall 
armyworm), 56 aa. 


7..60 

3..56 


41/54 (75%) 
48/54 (87%) 


6e-21 


Q9VH69 


CG8495 PROTEIN - Drosophila 
melanogaster (Fruit fly), 56 aa. 


10..60 
6..56 


41/51 (80%) 
46/51 (89%) 


3e-20 



PFam analysis predicts that the NOV37a protein contains the domains shown in the 
Table 37E. 



Table 37E. Domain Analysis of NOV37a 


Pfam Domain 


NOV37a Match 
Region 


Identities/ 
Similarities 
for the Matched 
Region 


Expect 
Value 


Ribosomal SI 4: domain 1 
of 1 


7..61 


17/60(28%) 
51/60 (85%) 


7.5e-20 



Example 38. 

The NOV38 clone was analyzed, and the nucleotide and predicted polypeptide 
sequences are shown in Table 38A. 



Table 38A. NOV38 Sequence Analysis 




SEQIDNO:123 |2039 bp 


NOV38a, 

CG58538-01 DNA Sequence 


GCAGCACACCTGCTCTGTGACTGACACTCTTGCAGAAGTGGGGCCACTTCAGGGACAT 


GGACAAGGTGTTGTACCTGCTGTCACAGAGCCTGTTATCTGTTCAGAATGACCGAAGA 
AGCATGCCGAACACGGAGTCAGAAACGAGCGCTTGAACGGGACCCAACAGAGGACGAT 
GTGGAGAGCAAGAAAATAAAAATGGAGAGAGGATTGTTGGCTTCAGATTTAAACACTG 
ACGGAGACATGAGGGTGACACCTGAGCCGGGAGCAGGTCCAACCCAAGGATTGCTGAG 
GGCAACAGAGGCCACGGCCATGGCCATGGGCAGAGGCGAAGGGCTGGTGGGCGATGGG 
CCCGTGGACATGCGCACCTCACACAGTGACATGAAGTCCGAGAGGAGACCCCCCTCAC 




GACCACGGTGGCCTTGAAGGAGACTAGCACCGAGGCCCTCATGAAAAGCAGTCCTGAA 
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GAACGAGAAAGGATGATCAAGCAGCTGAAGGAAGAATTGAGGTTAGAAGAAGCAAAAC 
TCGTGTTGTTGAAAAAGTTGCGGCAGAGTCAAATACAAAAGGAAGCCACCGCCCAGAA 
GCCCACAGGTTCTGTTGGGAGCACCGTGACCACCCCTCCCCCGCTTGTTCGGGGCACT 
CAGAACATTCCTGCTGGCAAGCCATCACTCCAGACCTCTTCAGCTCGGATGCCCGGCA 

ACAGGCGAGCTCACAGGTCGTCATGCCCCCACTCGTCAGGGGGGCTCAGCAAATCCAC 
AGCATTAGGCAACATTCCAGCACAGGGCCACCGCCCCTCCTCCTGGCCCCCCGGGCGT 
CGGTGCCCAGTGTGCAGATTCAGGGACAGAGGATCATCCAGCAGGGCCTCATCCGCGT 
CGCCAATGTTCCCAACACCAGCCTGCTCGTCAACATCCCACAGCCCACCCCAGCATCA 
CTGAAGGGGACAACAGCCACCTCCGCTCAGGCCAACTCCACCCCCACTAGTGTGGCCT 
CTGTGGTCACCTCTGCCGAGTCTCCAGCAAGCCGACAGGCGGCCGCCAAGCTGGCGCT 
GCGCAAACAGCTGGAGAAGACGCTACTCGAGATCCCCCCACCCAAGCCCCCAGCCCCA 
GAGATGAACTTCCTGCCCAGCGCCGCCAACAACGAGTTCATCTACCTGGTCGGCCTGG 
AGGAGGTGGTGCAGAACCTACTGGAGACACAAGCAGGCAGGATGTCGGCCGCCACTGT 
GCTGTCCCGGGAGCCCTACATGTGTGCACAGTGCAAGACGGACTTCACGTGCCGCTGG 

AGGCGCTCAAGGTGGAGCACACCAGCCGGCTGAAGGCCGCCTTTGTGAAGGCGCTGCA 
GCAGGAACAGGAGATTGAGCAGCGGCTCCTGCAGCAGGGCACGGCCCCTGCACAGGCC 
AAGGCCGAGCCCACCGCTGCCCCACACCCCGTGCTGAAGCAGGCCTCCAGCCAGCTGT 
CCCGGGGTTCGGCCACGACGCCCCGAGGTGTCCTGCACACGTTCAGTCCGTCACCCAA 
ACTGCAGAACTCAGCCTCGGCCACAGCCCTGGTCAGCAGGACCGGCAGACATTCTGAG 
AGAACCGTGAGCGCCGGCAAGGGCAGCGCCACCTCCAACTGGAAGAAGACGCCCCTCA 
GCACAGGCGGGACCCTTGCGTTTGTCAGCCCAAGCCTGGCGGTGCACAAGAGCTCCTC 
GGCCGTGGACCGCCAGCGAGAGTACCTCCTGGACATGATCCCACCCCGCTCCATCCCC 
CAGTCAGCCACGTGGAAATAGTGCGAGCCAGGCCCCGTGGAAGACGGGCTCCCTCCTC 


CCCCACCTGGCCCCTGGTCTAGAAGGACCCACTGCACCACCCTCCGCTGGCTCGGGAA 


GACACCGTG 




ORF Start: ATG at 106 


ORF Stop: TAG at 1933 




SEQIDNO: 124 


609 aa |MW at 65295.8kD 


NOV38a, 

CG5 853 8-01 Protein Sequence 


MTEEACRTRSQKRALERDPTEDDVESKKIKMERGLLASDLNTDGDMRVTPEPGAGPTQ 
GLLRATEATAMAMGRGEGLVGDGPVDMRTSHSDMKSERRPPSPDVIVLSDNEQPSSPR 
VKGLTTVALKETSTEALMKSSPEERERMIKQLKEELRLEEAKLVLLKKLRQSQIQKEA 
TAQKPTGSVGSTVTTPPPLVRGTQNIPAGKPSLQTSSARMPGSVIPPPLVRGGQQASS 
KLGPQASSQWMPPLVRGAQQIHSIRQHSSTGPPPLLLAPRASVPSVQIQGQRIIQQG 
LI RVANVPNTSLLVNI PQPTPASLKGTTATSAQANSTPTSVASWTSAESPASRQAAA 
KLALRKQLEKTLLE I PPPKPPAPEMNFLPSAAMNEFIYLVGLEEWQNLLETQAGRMS 
AATVLSREPYMCACCKTDFTCRWREEKSGAIMCENCMTTNQKKALKVEHTSRLKAAFV 
KALQQEQEIEQRLLQQGTAPAQAKAEPTAAPHPVLKQASSQLSRGSATTPRGVLHTFS 
PSPKLQNSASATALVSRTGRHSERTVSAGKGSATSNWKKTPLSTGGTLAFVSPSLAVH 
KSSSAVDRQREYLLDMIPPRSI PQSATWK 



Further analysis of the NOV38a protein yielded the following properties shown in 
Table 38B. 



Table 38B. Protein Sequence Properties NOV38a 


PSort 

analysis: 


0.4404 probability located in mitochondrial matrix space; 0.3000 probability 
located in microbody (peroxisome); 0.1257 probability located in 
mitochondrial inner membrane; 0.1257 probability located in mitochondrial 
intermembrane space 


SignalP 
analysis: 


No Known Signal Sequence Predicted 



A search of the NOV38a protein against the Geneseq database, a proprietary 
database that contains sequences published in patents and patent publication, yielded 
several homologous proteins shown in Table 38C. 
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Table 38C. Geneseq Results for NOV38a 


Geneseq 
Identifier 


Protein/Organism/Length [Patent 
#, Date] 


NOV38a 
Residues/ 

Match 
Residues 


Identities/ 
Similarities for 
the Matched 
Region 


Expect 
Value 


AAM00991 


Human bone marrow protein, SEQ 
ID NO: 492 - Homo sapiens, 502 
aa. [WO200153453-A2, 26-JUL- 
2001] 


1..471 
4..473 


217/504 (43%) 
290/504 (57%) 


2e-87 


AAM00944 


Human bone marrow protein, SEQ 
ID NO: 420 - Homo sapiens, 546 
aa. [WO200153453-A2, 26-JUL- 
2001] 


1..471 
48..517 


217/504(43%) 
290/504 (57%) 


2e-87 


AAM00831 


Human bone marrow protein, SEQ 
ID NO: 194 - Homo sapiens, 266 
aa. [WO200153453-A2, 26-JUL- 
2001] 


1..197 
47..262 


84/217(38%) 
110/217(49%) 


le-23 


AAM85818 


Human immune/haematopoietic 
antigen SEQ ID NO: 1341 1 - Homo 
sapiens, 84 aa. [WO200157182-A2, 
09-AUG-2001] 


41 7..47 1 
1..55 


41/55 (74%) 
49/55 (88%) 


7e-19 



In a BLAST search of public sequence databases, the NOV38a protein was found to 
have homology to the proteins shown in the BLASTP data in Table 38D. 



Table 38D. Public BLASTP Results for NOV38a 



Protein 
Accession 
Number 



Protein/Organism/Length 



NOV38a 
Residues/ 

Match 
Residues 



Identities/ 
Similarities for 
the Matched 
Portion 



Expect 
Value 



No Significant Matches Found 



PFam analysis predicts that the NOV38a protein contains the domains shown in the 
Table 38E. 



Table 38E. Domain Analysis of NOV38a 


Pfam Domain 


NOV38a Match Region 


Identities/ 
Similarities 
for the Matched Region 


Expect Value 


GATA: domain 1 of 1 


414..453 


12/43 (28%) 
17/43 (40%) 


1.1 
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Example 39. 

The NOV39 clone was analyzed, and the nucleotide and predicted polypeptide 
sequences are shown in Table 39A. 



Table 39 A. NOV39 Sequence Analysis 




SEQIDNO:125 1 1421 bp 


NOV39a 

CG59371-01 DNA Sequence 


ACCATTTCAGAGATGTCTTCCAGAAGTACCAAAGATTTAATTAAAAGTAAGTGGGGAT 
CGAAGCCTAGTAACTCCAAATCCGAAACTACATTAGAAAAATTAAAGGGAGAAATTGC 
ACACTTAAAGACATCAGTGGATGAAATCACAAGTGGGAAAGGAAAGCTGACTGATAAA 
GAGAGACACAGACTTTTGGAGAAAATTCGAGTCCTTGAGGCTGAGAAGGAGAAGAATG 
CTTATCAACTCACAGAGAAGGACAAAGAAATACAGCGACTGAGAGACCAACTGAAGGC 
CAGATATAGTACTACCACATTGCTTGAACAGCTGGAAGAGACAACGAGAGAAGGAGAA 
AGGAGGGAGCAGGTGTTGAAAGCCTTATCTGAAGAGAAAGACGTATTGAAACAACAGT 
TGTCTGCTGCAACCTCACGAATTGCTGAACTTGAAAGCAAAACCAATACACTCCGTTT 
ATCACAGACTGTGGCTCCAAACTGCTTCAACTCATCAATAAATAATATTCATGAAATG 
GAAATACAGCTGAAAGATGCTCTGGAGAAAAATCAGCAGTGGCTCGTGTATGATCAGC 
AGCGGGAAGTCTATGTAAAAGGACTTTTAGCAAAGATCTTTGAGTTGGAAAAGAAAAC 
GGAAACAGCTGCTCATTCACTCCCACAGCAGACAAAAAAGCCTGAATCAGAAGGTTAT 
CTTCAAGAAGAGAAGCAGAAATGTTACAACGATCTCTTGGCAAGTGCAAAAAAAGATC 
TTGAGGTTGAACGACAAACCATAACTCAGCTGAGTTTTGAACTGAGTGAATTTCGAAG 
AAAATATGAAGAAACCCAAAAAGAAGTTCACAATTTAAATCAGCTGTTGTATTCACAA 
AGAAGGGCAGATGTGCAACATCTGGAAGATGATAGGCATAAAACAGAGAAGATACAAA 
AACTCAGGGAAGAGAATGATATTGCTAGGGGAAAACTTGAAGAAGAGAAGAAGAGATC 
CGAAGAGCTCTTATCTCAGGTCCAGTCTCTTTACACATCTCTGCTAAAGCAGCAAGAA 
GAACAAACAAGGGTAGCTCTGTTGGAACAACAGATGCAGGCATGTACTTTAGACTTTG 
AAAATGAAAAACTCGACCGTCAACATGTGCAGCATCAATTGCATGTAATTCTTAAGGA 

TTTGCCATCACAGAGCCATTAGTCACTTTCCAAGGAGAGACTGAAAACAGAGAAAAAG 
TTGCCGCCTCACCAAAAAGTCCCACTGCTGCACTCAATGGAAGCCTGGTGGAATGTCC 




TACTGTTCAAAGTAGCAAAATAAGTATTT 




ORF Start: ATG at 1 3 


ORE Stop: TAG at 1405 




SEQ ID NO: 126 


464 aa |MW at 54045.6kD 


NOV39a, 

CG59371-01 Protein Sequence 


MSSRSTKDLIKSKWGSKESNSKSETTLEKLKGEIAHLKTSVDEITSGKGKLTDKERHR 
LLEKIRVLEAEKEKNAYQLTEKDKEIQRLRDQLKARYSTTTLLEQLEETTREGERREQ 
VLKALSEEKDVLKQQLSAATSRIAELESKTNTLRLSQTVAPNCFNSSINNIHEMEIQL 
KDALEKNQQWLVYDQQREVYVKGLLAKIFELEKKTETAAHSLPQQTKKPESEGYLQEE 
KQKCYNDLLASAKKDLEVERQTITQLSFELSEFRRKYEETQKEVHNLNQLLYSQRRAD 
VQHLEDDRHKTEKIQKLREENDIARGKLEEEKKRSEELLSQVQSLYTSLLKQQEEQTR 
VALLEQQMQACTLDFENEKLDRQHVQHQLHVILKELRKARKNITQLESLKQLHEFAIT 
EPI ' rt iGETEI "iASPKSPTAALNGSLVECPKCNIQYPATEHRDLLVHVEYCSK 



Further analysis of the NOV39a protein yielded the following properties shown in 
Table 39B. 



Table 39B. Protein Sequence Properties NOV39a 


PSort 
analysis: 


0.4500 probability located in cytoplasm; 0.3000 probability located in 
microbody (peroxisome); 0.1000 probability located in mitochondrial matrix 
space; 0.1000 probability located in Iysosome (lumen) 


SignalP 
analysis: 


No Known Signal Sequence Predicted 
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A search of the NOV39a protein against the Geneseq database, a proprietary 
database that contains sequences published in patents and patent publication, yielded 
several homologous proteins shown in Table 39C. 



Table 39C. Geneseq Results for NOV39a 


Geneseq 
Identifier 


Protein/Organism/Length [Patent 
#, Date] 


NOV39a 
Residues/ 

Match 
Residues 


Identities/ 
Similarities for 
the Matched 
Region 


Expect 
Value 


AAB92925 


Human protein sequence SEQ ID 
NO: 1 1576 - Homo sapiens, 231 aa. 
[EP1074617-A2, 07-FEB-2001] 


170..392 
1..223 


222/223 (99%) 
222/223 (99%) 


e-122 


AAG75490 


Human colon cancer antigen 
protein SEQ ID NO:6254 - Homo 
sapiens, 165 aa. [WO200 122920- 
A2, 05-APR-2001J 


1..67 
99..165 


64/67 (95%) 
64/67 (95%) 


le-28 


AAM78520 


Human protein SEQ ID NO 1 1 82 - 
Homo sapiens, 990 aa. 
[WO200 1571 90- A2, 09-AUG- 
2001] 


6..394 
515..929 


96/421 (22%) 
182/421 (42%) 


3e-12 


AAM41000 


Human polypeptide SEQ ID NO 
5931 - Homo sapiens, 1988 aa. 
[WO200153312-A1, 26-JUL-2001] 


70..420 
852.. 1203 


90/384 (23%) 
161/384 (41%) 


3e-12 


AAM40999 


Human polypeptide SEQ ID NO 
5930 - Homo sapiens, 1988 aa. 
[WO200153312-A1, 26-JUL-2001] 


70..420 
852.. 1203 


90/384(23%) 
161/384(41%) 


3e-12 



In a BLAST search of public sequence databases, the NOV39a protein was found to 
have homology to the proteins shown in the BLASTP data in Table 39D. 



Table 39D. Public BLASTP Results for NOV39a 


Protein 
Accession 
Number 


Protein/Organism/Length 


NOV39a 
Residues/ 

Match 
Residues 


Identities/ 
Similarities for 
the Matched 
Portion 


Expect 
Value 


Q96H32 


SIMILAR TO RIKEN CDNA 
1200008012 GENE - Homo 
sapiens (Human), 464 aa. 


1..464 
1..464 


458/464 (98%) 
458/464 (98%) 


0.0 


Q9DBZ8 


1 2000080 12RIK PROTEIN - 
Mus musculus (Mouse), 462 aa. 


1..464 
1..462 


348/464 (75%) 
401/464 (86%) 


0.0 
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Q9NVS7 


CDNA FLJ10540 FIS, CLONE 
NT2RP2001245 - Homo sapiens 
(Human), 231 aa. 


170..392 
1..223 


222/223 (99%) 
222/223 (99%) 


e-122 


Q9CZP8 


2700032M20RIK PROTEIN - 
Mus musculus (Mouse), 1 89 aa. 


1..176 
1..176 


121/176 (68%) 
150/176(84%) 


3e-63 


Q9VJE5 


CLIP- 190 PROTEIN - Drosophila 
meianogaster (Fruit fly), 1690 aa. 


4..439 
675..1118 


108/461 (23%) 
203/461 (43%) 


2e-16 



PFam analysis predicts that the NOV39a protein contains the domains shown in the 
Table 39E. 



Table 39E. Domain Analysis of NOV39a 



Pfam Domain 



NOV39a Match Region 



Identities/ 
Similarities 
for the Matched Region 



Expect Value 



No Significant Matches Found 



Example 40. 

The NOV40 clone was analyzed, and the nucleotide and predicted polypeptide 
5 sequences are shown in Table 40A. 



Table 40A. NOV40 Sequence Analysis 




SEQ ID NO: 127 1 395 5 bp 


NOV40a, 




CG59346-01 DNA Sequence 


CTACAATAATGGTCGCTGTCCCCGGAATTCTCTCTACAGTGACTGCATTATTGAGGAG 
AAGACGGTGGTCCTGCAGAAAAAAGACAATGAGGGCTTTGGATTCGTGCTTCGAGGGG 

GTACCTGGAGTCCGTGGATGAAGGTGGGGTGGCGTGGCAAGCCGGACTAAGGACCGGG 
GACTTCTTGATTGAGGTTAACAATGAGAATGTTGTCAAAGTCGGCCACAGGCAGGTGG 
TGAACATGATCCGGCAGGGAGGGAATCACCTGGTCCTTAAGGTGGTCACGGTGACCAG 
GAATCTGGACCCCGACGACACCGCCAGGAAGAAAGCTCCCCCGCCTCCAAAGCGGGCA 
CCGACCACAGCCCTCACCCTGCGCTCCAAGTCCATGACCTCGGAGCTGGAGGAGCTCG 




GGCTCAGACATGAACGTGAGTGGCCGTACCTTGGGACCACGAGGGCGGGGGCCGACGG 




ATGCGAAGGCAGAAATCAATAGGAATAACAGAGGAAGAGCGGCAGTTTCTGGCTCCTC 
CAATGCTGAAGTTCACCAGAAGCCTGTCCATGCCGGACACCTCTGAGGACATCCCCCC 
TCCACCGCAGTCTGTGCCCCCGTCCCCACCACCACCTTCCCCAACCACTTACAACTGC 
CCCAAGTCCCCAACTCCAAGAGTCTACGGGACGATTAAGCCTGCGTTCAATCAGAATT 
CTGCCGCCAAGGTGTCCCCCGCCACCAGGTCCGACACCGTGGCCACCATGATGAGGGA 
GAAGGGGATGTACTTCAGGAGAGAGCTGGACCGCTACTCCTTGGACTCTGAAGACCTC 
TACAGTCGGAATGCCGGCCCGCAAGCCAACTTCCGCAACAAGAGAGGCCAGATGCCAG 
AAAACCCATACTCAGAGGTGGGGAAGATCGCCAGCAAAGCCGTCTACGTCCCCGCCAA 
GCCCGCCAGGCGGAAGGGGATGCTGGTGAAGCAGTCCAACGTGGAGGACAGCCCCGAG 
AAGACGTGCTCCATCCCTATCCCGACCATCATCGTGAAGGAGCCGTCCACCAGCAGCA 
GCGGCAAGAGCAGCCAGGGCAGCAGCATGGAGATCGACCCCCAGGCCCCGGAGCCACC 
GAGCCAGCTGCGGCCTGACGAAAGCCTGACCGTCAGCAGCCCCTTTGCCGCCGCCATC 
GCCGGAGCCGTCCGCGACCGTGAGAAGCGGCrGGAAGCCAGGAGGAACTCCCCGGCCT 
TCCTCTCCACAGACCTGGGGGATGAGGATGTGGGCCTGGGGCCACCCGCCCCCAGGAC 
GCGGCCCTCCATGTTCCCCGAGGAGGGGGATTTTGCTGACGAGGACAGCGCTGAGCAG 
CTGTCATCCCCCATGCCGAGTGCCACGCCCAGGGAGCCCGAAAACCATTTCGTGGGTG 
GCGCCGAGGCCAGTGCTCCGGGTGAGGCTGGGAGGCCGCTGAATTCCACGTCCAAAGC 
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CCAGGGGCCCGAGAGCAGCCCAGCAGTGCCCTCCGCGAGCAGCGGCACAGCCGGCCCC 
GGGAATTATGTCCACCCACTCACAGGGCGGCTGCTTGATCCCAGCTCCCCGCTGGCCC 
TGGCACTCTCCGCAAGGGACCGAGCCATGAAGGAGTCTCAACAGGGACCCAAAGGGGA 
GGCCCCCAAGGCCGACCTCAACAAACCTCTTTACATTGATACCAAAATGCGGCCCAGC 
CTGGATGCCGGCTTCCCTACGGTCACCAGGCAGAACACCCGGGGACCCCTGAGGCGGC 
AGGAGACGGAGAACAAGTACGAGACCGACCTGGGCCGAGACCGGAAAGGCGATGACAA 
GAAGAACATGCTGATCGACATCATGGACACGTCCCAGCAGAAGTCGGCTGGCCTGCTG 
ATGGTGCACACCGTGGACGCCACTAAGCTGGACAACGCCCTGCAGGAAGAGGACGAGA 
AGGCAGAGGTGGAGATGAAGCCAGACAGCTCGCCGTCCGAGGTGCCAGAAGGTGTTTC 
CGAAACCGAAGGTGCTTTACAGATCTCCGCTGCCCCCGAGCCCACCACCGTGCCCGGC 
AGAACCATCGTCGCGGTGGGCTCCATGGAAGAGGCGGTGATTTTGCCATTCCGCATCC 
CTCCTCCCCCTCTGGCATCCGTGGACTTGGATGAGGATTTTATTTTTACAGAGCCATT 
GCCTCCTCCCCTGGAATTTGCAAATAGTTTTGATATCCCCGATGACCGGGCAGCTTCT 
GTCCCGGCTCTCTCAGACTTAGTGAAGCAGAAGAAAAGCGACACCCCTCAGTCCCCTT 
CGTTGAACTCCAGCCAACCAACCAACTCTGCAGACAGCAAGAAGCCAGCCAGTCTTTC 
AAACTGTCTGCCTGCCTCATTCCTGCCACCCCCTGAAAGCTTTGACGCCGTCGCCGAC 
TCTGGGATCGAGGAGGTGGACAGCCGGAGTAGCAGCGACCACCACCTCGAGACGACCA 
GCACTATCTCCACCGTGTCTAGCATCTCCACCCTGTCTTCCGAAGGTGGAGAGAATGT 
GGACACCTGCACAGTCTATGCAGATGGGCAAGCATTTATGGTTGACAAACCCCCAGTA 
CCTCCTAAGCCAAAAATGAAGCCCATCATTCACAAAAGCAATGCACTTTATCAAGACG 
CGCTCGTGGAAGAAGATGTAGATAGCTTTGTTATCCCCCCGCCCGCTCCCCCGCCCCC 
GCCGGGCAGTGCCCAGCCTGGGATGGCCAAGGTTCTCCAGCCAAGGACCTCCAAGTTG 
TGGGGCGACGTCACAGAGATCAAAAGCCCGATTCTCTCAGGCCCAAAGGCAAACGTTA 
TTAGTGAATTGAACTCTATCCTACAGCAAATGAACCGAGAGAAATTGGCAAAGCCGGG 
GGAAGGACTGGATTCACCAATGGGAGCCAAGTCCGCCAGCCTCGCTCCAAGAAGCCCG 

CCGGCACCTCCCAGCCCATCACCCTGCAGAGCCGGCCCCCCGACTATGAAAGCAGGAC 
CTCAGGAACAAGACGTGCCCCAAGCCCTGTGGTCTCGCCAACAGAGATGAACAAAGAG 
ACCCTGCCCGCCCCCCTGTCTGCTGCCACCGCCTCTCCTTCTCCCGCTCTCTCAGATG 
TCTTTAGCCTTCCAAGCCAGCCCCCTTCTGGGGATCTATTTGGCTTGAACCCAGCGGG 
ACGCAGTAGGTCGCCATCCCCCTCGATACTGCAACAGCCAATCTCAAATAAGCCTTTT 
ACAACTAAACCTGTCCACCTGTGGACTAAACCAGATGTGGCCGATTGGCTGGAAAGTC 
TAAACTTGGGTGAACATAAAGAGGCCTTCATGGACAATGAGATCGATGGCAGTCACTT 
ACCAAACCTGCAGAAGGAGGACCTCATCGATCTTGGGGTAACTCGAGTCGGGCACAGA 
ATGAACATAGAAAGGGCTTTGAAACAGCTGCTGGACAGATAAGGACGGCTGCTCTCCA 
CCTCGCAGACTGCTCTTGTTATAAGTAGAGATGGGCTCGTGCTGAAACATCTGAATGC 


CAAGCGAAGTC 




ORF Start: ATG at 
67 


ORF Stop: TAA at 3868 




SEQ ID NO: 128 


1267aa |MW at 136108.7kD 


NOV40a, 

CG59346-01 Protein 
Sequence 


MMMNVPGGGAAAVMMTGYNNGRCPRNSLYSDCIIEEKTVVLQKKDNEGFGFVLRGAKA 
DTPIEEFTPTPAFPALQYLESVDEGGVAWQAGLRTGDFLIEVNNENWKVGHRQWNM 
IRQGGNHLVLKWTVTRNLDPDDTARKKAPPPPKRAPTTALTLRSKSMTSELEELDKP 
EEIVPASKPSRAAENMAVEPRVATIKQRPSSRCFPAGSDMNVSGRTLGPRGRGPTVPP 
RLSGLQSVYERQGIAVMTPTVPGSPKAPFLGIPRGTMRRQKSIGITEEERQFLAPPML 
KFTRSLSMPDTSEDIPPPPQSVPPSPPPPSPTTYNCPKSPTPRVYGTIKPAFNQNSAA 
KVSPATRSDTVATMMREKGMYFRRELDRYSLDSEDLYSRNAGPQANFRNKRGQMPENP 
YSEVGKIASKAVYVPAKPARRKGMLVKQSNVEDSPEKTCSIPIPTIIVXEPSTSSSGK 
SSQGSSMEIDPQAPEPPSQLRPDESLTVSSPFAAAIAGAVEDREKRLEARRNSPAFLS 
TDLGDEDVGLGPPAPRTRPSMFPEEGDFADEDSAEQLSSPMPSATPREPENHFVGGAE 
ASAPGEAGRPLNSTSKAQGPESSPAVPSASSGTAGPGNYVHPLTGRLLDPSSPLALAL 
SARDRAMKESQQGPKGEAPKADLNKPLYIDTKMRPSLDAGFPTVTRQNTRGPLRRQET 
ENKYETDLGRDRKGDDKKKMLIDIMDTSQQKSAGLLMVHTVDATKLDNALQEEDEKAE 
VEMKPDSSPSEVPEGVSETEGALQISAAPEPTTVPGRTIVAVGSMEEAVILPFRIPPP 
PLASVDLDEDFI FTEPL P PPLE FANS FD I PDDRAASVPALSDLVKQKKSDT PQS PSLN 
SSQPTNSADSKKPASLSNCLPASFLPPPESFDAVADSGIEEVDSRSSSDHHLETTSTI 
STVSSISTLSSEGGENVDTCTVYADGQAFMVDKPPVPPKPKMKPIIHKSNALYQDALV 
EEDVDSFVIPPPAPPPPPGSAQPGMAKVLQPRTSKLWGDVTEIKSPILSGPKANVISE 
LNS I LQQMNREKLAKPGEGLDSPMGAKSASLAPRSPE IMSTI SGTRSTTVTFTVRPGT 
SQPITLQSRPPDYESRTSGTRRAPSPWSPTEMNKETLPAPLSAATASPSPALSDVFS 
LPSQPPSGDLFGIJJPAGRSRSPSPSILQQPISNKPFTTKPVHLWTKPDVADWLESLNL 
GEHKEAFMDNEIDGSHLPNLQKEDLIDLGVTRVGHRMNIERALKQLLDR 



Further analysis of the NOV40a protein yielded the following properties shown in 
Table 40B. 
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Table 40B. Protein Sequence Properties NOV40a 


PSort 
analysis: 


0.4500 probability located in cytoplasm; 0.3000 probability located in 
microbody (peroxisome); 0.1000 probability located in mitochondrial matrix 
space; 0.1000 probability located in lysosome (lumen) 


SignalP 
analysis: 


No Known Signal Sequence Predicted 



A search of the NOV40a protein against the Geneseq database, a proprietary 
database that contains sequences published in patents and patent publication, yielded 
several homologous proteins shown in Table 40C. 



Table 40C. Geneseq Results for NOV40a 


Geneseq 
Identifier 


Protein/Organism/Length 
[Patent #, Date] 


NOV40a 
Residues/ 

Match 
Residues 


Similarities for 
the Matched 
Region 


Expect 
Value 


AAM79240 


Human protein SEQ ID NO 1 902 
- Homo sapiens, 1248 aa. 
[WO200157190-A2, 09-AUG- 
2001] 


14.. 1267 
1..1248 


1231/1271 (96%) 
1231/1271 (96%) 


0.0 


AAB31518 


Amino acid sequence of the rat 
Shank2 polypeptide - Rattus sp, 
1470 aa. [WO200078921-A2, 28- 
DEC-2000] 


30..1267 
240..1470 


1078/1255 (85%) 
1132/1255 (89%) 


0.0 


AAM80224 


Human protein SEQ ID NO 3870 
- Homo sapiens, 1 161 aa. 
[WO200157190-A2, 09-AUG- 
2001] 


172..1267 
82..1161 


1071/1103 (97%) 
1071/1103 (97%) 


0.0 


AAB31517 


Amino acid sequence of the rat 
Shank3a polypeptide - Rattus sp, 
1740 aa. [WO200078921-A2, 28- 
DEC-2000] 


18..1264 
550..1737 


496/1349 (36%) 
673/1349 (49%) 


0.0 


AAY83017 


Rat shank 3a - Rattus rattus, 1740 
aa. [WO20001 1204-A2, 02- 
MAR-2000] 


18.. 1264 
550..1737 


496/1349 (36%) 
673/1349 (49%) 


0.0 



In a BLAST search of public sequence databases, the NOV40a protein was found to 
have homology to the proteins shown in the BLASTP data in Table 40D. 
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Table 40D. Public BLASTP Results for NOV40a 


Protein 
Accession 
Number 


Protein/Organism/Length 


NOV40a 
Residues/ 
Match 
Residues 


Identities/ 
Similarities for 
the Matched 
Portion 


Expect 
Value 


Q9UPX8 


KIAA1022 PROTEIN - Homo 
sapiens (Human), 1131 aa 
(fragment). 


124.. 1267 
1..1131 


1121/1154 (97%) 
1121/1154 (97%) 


0.0 


Q9QX93 


PROLINE RICH SYNAPSE 
ASSOCIATED PROTEIN 1 - 
Rattus norvegicus (Rat), 1252 aa. 


2.. 1267 
1..1252 


1103/1276 (86%) 
1158/1276 (90%) 


0.0 


070470 


CORTACTIN-BINDING 
PROTEIN 1 - Rattus norvegicus 
(Rat), 1252 aa. 


2..1267 
1..1252 


1102/1276 (86%) 
1158/1276 (90%) 


0.0 


Q9WUV9 


PROLINE RICH SYNAPSE 
ASSOCIATED PROTEIN 1 - 
Rattus norvegicus (Rat), 1259 aa. 


2..1267 
1..1259 


1103/1283 (85%) 
1158/1283 (89%) 


0.0 


Q9WUW0 


PROLINE RICH SYNAPSE 
ASSOCIATED PROTEIN 1 - 
Rattus norvegicus (Rat), 1250 aa. 


2..1267 
L.1250 


1095/1276 (85%) 
1151/1276 (89%) 


0.0 



PFam analysis predicts that the NOV40a protein contains the domains shown in the 
Table 40E. 



Table 40E. Domain Analysis of NOV40a 



Pfam Domain 


NOV40a Match Region 


Identities/ 
Similarities 
for the Matched Region 


Expect Value 


PDZ: domain 1 of 1 


38-131 


23/97 (24%) 
70/97 (72%) 


le-07 


SAM: domain 1 of 1 


1202..1265 


27/68 (40%) 
53/68 (78%) 


9.8e-22 



Example 41. 



The NOV41 clone was analyzed, and the nucleotide and predicted polypeptide 
sequences are shown in Table 41 A. 



221 



Table 41A. NOV41 Sequence Analysis 




SEQIDNO:129 |2069 bp 


NOV41a, 

CG57814-01 DNA Sequence 


GGACACTGACATGGACTGAAGGAGTAGAAAGCACTATAAATGTCTTTCCTTATCTGTG 


TGTACTCTTATCTCACTGTTCTATTTTTTCTCCTCATTTATATTAACTCTTTCTTACC 


TTTTTTTCTGAACTTCTAGGCCTTCTCTTTCCAGAACTGGTGGAAGACAAATGAAACG 


GCCAAGATGGTAAGAAACAAGCCGCATTTCTCCTTGGGGAGACTGATAATTTAAAAGG 


TTTGTTGTGTCAGAAACATTCCCAGCTTCATCACCAACCCTTTCCTTCCACCTCTGCC 


CACTGGAGACCACTTACATCCCGAAGCGGACGCGGCAGCTGAAGTCAGGAAACCATGC 


ATCACATTAGCAGGAGCCAACTGCAGACTTTAAACTCCGTTCAACATGTGGATGCGGC 


AGAGAAATGACCTGTCCAGACAAGCCGGGGCAGCTCATAAACTGGTTCATCTGCTCCC 
TGTGCGTCCCGCGGGTGCGTAAGCTCTGGAGCAGCCGGCGTCCAAGGACCCGGAGAAA 
CCTTCTGCTGGGCACTGCGTGTGCCATCTACTTGGGCTTCCTGGTGAGCCAGGTGGGG 
AGGGCCTCTCTCCAGCATGGACAGGCGGCTGAGAAGGGGCCACATCGCAGCCGCGACA 
CCGCCGAGCCATCCTTCCCTGAGATACCCCTGGATGGTACCCTGGCCCCTCCAGAGTC 
CCAGGGCAATGGGTCCACTCTGCAGCCCAATGTGGTGTACATTACCCTACGCTCCGAG 
CGCAGCAAGCCGGCCAATATCCGTGGCACCGTGAAGCCCAAGCGCAGGAAAAAGCATG 
CAGTGGCATCGGCTGCCCCAGGGCAGGAGGCTTTGGTCGGACCATCCCTTCAGCCGCA 
GGAAGCGGCAAGGGAAGCTGATGCTGTAGCACCTGGGTACGCTCAGGGAGCAAACCTG 
GTTAAGATTGGAGAGCGACCCTGGAGGTTGGTGCGGGGTCCGGGAGTGCGAGCCGGGG 
GCCCAGACTTCCTGCAGCCCAGCTCCAGGGAGAGCAACATTAGGATCTACAGCGAGAG 
CGCCCCCTCCTGGCTGAGCAAAGATGACATCCGAAGAATGCGACTCTTGGCGGACAGC 
GCAGTGGCAGGGCTCCGGCCTGTGTCCTCTAGGAGCGGAGCCCGTTTGCTGGTGCTGG 
AGGGGGGCGCACCTGGCGCTGTGCTCCGCTGTGGCCCTAGCCCCTGTGGGCTTCTCAA 
GCAGCCCTTGGACATGAGTGAGGTGTTTGCCTTCCACCTAGACAGGATCCTGGGGCTC 
AACAGGACCCTGCCGTCTGTGAGCAGGAAAGCAGAGTTCATCCAAGATGGCCGCCCAT 
GCCCCATCATTCTTTGGGATGCATCTTTATCTTCAGCAAGTAATGACACCCATTCTTC 
TGTTAAGCTCACCTGGGGAACTTATCAGCAGTTGCTGAAACAGAAATGCTGGCAGAAT 
GGCCGAGTACCCAAGCCTGAATCAGGTTGTACTGAAATACATCATCATGAGTGGTCCA 
AGATGGCACTCTTTGATTTTTTGTTACAGATTTATAATCGCTTAGATACAAATTGCTG 
TGGATTCAGACCTCGCAAGGAAGATGCCTGTGTACAGAATGGATTGAGGCCAAAATGT 
GATGACCAAGGTTCTGCGGCTCTAGCACACATTATCCAGCGAAAGCATGACCCAAGGC 
ATTTGGTTTTTATAGACAACAAGGGTTTCTTTGACAGGAGTGAAGATAACTTAAACTT 
CAAATTGTTAGAAGGCATCAAAGAGTTTCCAGCTTCTGCAGTTTCTGTTTTGAAGAGC 
CAGCACTTACGGCAGAAACTTCTTCAGTCTCTGTTTCTTGATAAAGTGTATTGGGAAA 
GTCAAGGAGGTAGACAAGGAATTGAAAAGCTTATCGATGTAATAGAACACAGAGCCAA 
AATTCTrATCACCTATATCAATGCACACGGGGTCAAAGTATTACCTATGAATGAATGA 


AAATCAAGCACATCAACCTCAAGCCCGTTTAGCAATGAG 




ORF Start: ATG at 413 


ORF Stop: TGA at 1970 




SEQ ID NO: 130 


519 aa |MW at 57552.4kD 


NOV41a, 

CG578 14-01 Protein Sequence 


MTCPDKPGQLINWFICSLCVPRVRKLWSSREPRTRRNLLLGTACAIYLGFLVSQVGRA 
SLQHGQAAEKGPHRSRDTAEPSFPEIPLDGTLAPPESQGNGSTLQPNWYITLRSERS 
KPANIRGTVKPKRRKKHAVASAAPGQEALVGPSLQPQEAAREADAVAPGYAQGANLVK 
IGERPWRLVRGPGVRAGGPDFLQPSSRESNIRIYSESAPSWLSKDDIRRMRLLADSAV 
AGLRPVSSRSGARLLVLEGGAPGAVLRCGPSPCGLLKQPLDMSEVFAFHLDRILGLNR 
TLPSVSRKAEFIQDGRPCPIILWDASLSSASNDTHSSVKLTWGTYQQLLKQKCWQNGR 
VPKPESGCTEIHHHEWSKMALFDFLLQIYNRLDTNCCGFRPRKEDACVQNGLRPKCDD 
QGSAALAHI I QRKHDPRHLVFIDNKGFFDRSEDNLNFKLLEGI KEFPASAVSVLKSQH 
LRQKLLQSLFLDKVYWESQGGRQGIEKLIDVIEHRAKILITYINAHGVKVLPMNE 




SEQ ID NO: 131 1 1740 bp 


NOV41b, 

CG578 14-02 DNA Sequence 


GGCAGCTGAAGTCAGGAAACCATGCATCACATTAGCAGGAGCCAACTGCAGACTTTAA 


ACTCCGTTCAACATGTGGATGCGGCAGAGAAATGACCTGTCCAGACAAGCCGGGGCAG 
CTCATAAACTGGTTCATCTGCTCCCTGTGCGTCCCGCGGGTGCGTAAGCTCTGGAGCA 
GCCGGCGTCCAAGGACCCGGAGAAACCTTCTGCTGGGCACTGCGTGTGCCATCTACTT 
GGGCTTCCTGGTGAGCCAGGTGGGGAGGGCCTCTCTCCAGCATGGACAGGCGGCTGAG 
AAGGGGCCACATCGCAGCCGCGACACCGCCGAGCCATCCTTCCCTGAGATACCCCTGG 
ATGGTACCCTGGCCCCTCCAGAGTCCCAGGGCAATGGGTCCACTCTGCAGCCCAATGT 
GGTGTACATTACCCTACGCTCCAAGCGCAGCAAGCCGGCCAATATCCGTGGCACCGTG 
AAGCCCAAGCGCAGGAAAAAGCATGCAGTGGCATCGGCTGCCCAAGGGCAGGAGGCTT 
TGGTCGGACCATCCCTTCAGCCGCAAGAAGCGGCAAGGGAAGCTGATGCTGTAGCACT 
GGGTACGCTCAGGAGCAAACTGGTTAAGATGGAGAGCGACCCTGAAGGTGGTGCGGGG 
TCGGGAGTGCGAGCCGGGGGCCCAGACTTCCTGCAGCCCAGCTCCAGGGAGAGCAACA 
TTAGGATCTACAGCGAGAGCGCCCCCTCCTGGCTGAGCAAAGATGACATCCGAAGAAT 
GCGACTCTTGGCGGACAGCGCAGTGGCAGGGCTCCGGCCTGTGTCCTCTAGGAGCGGA 




GCCCCTGTGGGCTTCTCAAGCAGCCCTTGGACATGAGTGAGGTGTTTGCCTTCCACCT 
AGACAGGATCCTGGGGCTCAACAGGACCCTGCCGTCTGTGAGCAGGAAAGCAGAGTTC 
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GTAATGACACCCATTCTTCTGTTAAGCTCACCTGGGGAACTTATCAGCAGTTGCTGAA 
ACAGAAATGCTGGCAGAATGGCCGAGTACCCAAGCCTGAATCAGGTTGTACTGAAATA 
CATCATCATGAGTGGTCCAAGATGGCACTCTTTGATTTTTTGTTACAGATTTATAATC 
GCTTAGATACAAATTGCTGTGGATTCAGACCTCGCAAGGAAGATGCCTGTGTACAGAA 
TGGATTGAGGCCAAAATGTGATGACCAAGGTTCTGCGGCTCTAGCACACATTATCCAG 
CGAAAGCATGACCCAAGGCATTTGGTTTTTATAGACAACAAGGGTTTCTTTGACAGGA 
GTGAAGATAACTTAAACTTCAAATTGTTAGAAGGCATCAAAGAGTTTCCAGCTTCTGC 
AGTTTCTGTTTTGAAGAGCCAGCACTTACGGCAGAAACTTCTTCAGTCTCTGTTTCTT 
GATAAAGTGTATTGGGAAAGTCAAGGAGGTAGACAAGGAATTGAAAAGCTTATCGATG 
TAATAGAACACAGAGCCAAAATTCTTATCACCTATATCAATGCACACGGGGTCAAAGT 
ATTACCTATGAATGAATGACAAAAGAATCTTCTGGCTAGGGTGTTAGATATATTTATG 


CATTTTTGGTTTTGTTTTTAAATCAAGCACATCAACCTCAAGCCCGTTTAGCAATGAG 




ORF Start: ATG at 90 


ORF Stop: TGA at 1641 




SEQ ID NO: 132 


517 aa |MW at 57179.9kD 


NOV41b, 

CG578 14-02 Protein Sequence 


MTCPDKPGQLINWFICSLCVPRVRKLWSSREPRTRRNLLLGTACAIYLGFLVSQVGRA 
SLQHGQAAEKGPHRSRDTAEPSFPEIPLDGTLAPPESQGNGSTLQPNWYITLRSKRS 
KPANIRGTVKPKRRKKHAVASAAQGQEALVGPSLQPQEAAREADAVALGTLRSKLVKM 
ESDPEGGAGSGVRAGGPDFLQPSSRESNIRIYSESAPSWLSKDDIRRMRLLADSAVAG 
LRPVSSRSGARLLVI^EGGAPGAVLRCGPSPCGLLKQPLDMSEVFAFHLDRILGLNRTL 
PSVSRKAEFIQDGRPCPIILWDASLSSASNDrHSSVKLTWGTYQQLLKQKCWQNGRVP 
KPESGCTEIHHHEWSKMALFDFLLQIYNRLDTNCCGFRPRKEDACVQNGLRPKCDDQG 
SAALAHIIQRKHDPRHLVFIDNKGFFDRSEDNLNFKLLEGIKEFPASAVSVLKSQHLR 
QKLLQSLFLDKVYWESQGGRQGIEKLIDVIEHRAKILITYINAHGVKVLPMNE 



Sequence comparison of the above protein sequences yields the following sequence 
relationships shown in Table 4 IB. 



Table 41B. Comparison of NOV41a against NOV41b. 


Protein Sequence 


NOV41a Residues/ 
Match Residues 


Identities/ 
Similarities for the Matched Region 


NOV41b 


1..519 
1..517 


493/519(94%) 
497/519(94%) 



Further analysis of the NOV41a protein yielded the following properties shown in 
Table 41 C. 



Table 41 C. Protein Sequence Properties NOV41a 


PSort 
analysis: 


0.5500 probability located in endoplasmic reticulum (membrane); 0.2404 
probability located in lysosome (lumen); 0.1000 probability located in 
endoplasmic reticulum (lumen); 0.1000 probability located in outside 


SignalP 
analysis: 


Likely cleavage site between residues 59 and 60 



A search of the NOV4 la protein against the Geneseq database, a proprietary 
database that contains sequences published in patents and patent publication, yielded 
several homologous proteins shown in Table 41D. 
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Table 41D. Geneseq Results for NOV41a 


Geneseq 
Identifier 


Protein/Organism/Length 
[Patent #, Date] 


NOV41a 
Residues/ 

Match 
Residues 


Identities/ 
Similarities for 
the Matched 
Region 


Expect 
Value 


AAU 12276 


Human PRO6001 polypeptide 
sequence - Homo sapiens, 519 aa. 
[WO200140466-A2, 07-JUN- 
2001] 


1..519 
1-519 


518/519(99%) 
519/519(99%) 


0.0 


AAM39125 


Human polypeptide SEQ ID NO 
2270 - Homo sapiens, 519 aa. 
[WO200153312-A1, 26-JUL- 
2001] 


1..519 
1..519 


518/519(99%) 
519/519(99%) 


0.0 


AAM4091 1 


Human polypeptide SEQ ID NO 
5842 - Homo sapiens, 537 aa. 
[WQ200153312-A1, 26-JUL- 
2001] 


1..519 
19..537 


491/527 (93%) 
495/527 (93%) 


0.0 


AAM41373 


Human polypeptide SEQ ID NO 
6304 - Homo sapiens, 479 aa. 
[WO200153312-A1, 26-JUL- 
2001] 


212..512 
161. .471 


130/316(41%) 
180/316(56%) 


le-64 


AAM39587 


Human polypeptide SEQ ID NO 
2732 - Homo sapiens, 397 aa. 
[WO200153312-A1, 26-JUL- 
2001] 


212..512 
79..389 


130/316(41%) 
180/316(56%) 


le-64 



In a BLAST search of public sequence databases, the NOV41a protein was found to 
have homology to the proteins shown in the BLASTP data in Table 4 IE. 



Table 41E. Public BLASTP Results for NOV41a 


Protein 

Accession 
Number 


Protein/ Organism/Length 


NOV41a 
Residues/ 
Match 
Residues 


Identities/ 
Similarities for 
the Matched 
Portion 


Expect 
Value 


Q9ET25 


HYPOTHETICAL BASIC 
PROTEIN 1-19 - Mus musculus 
(Mouse), 517 aa. 


1..519 
1..517 


431/519(83%) 
462/519(88%) 


0.0 


Q9NYZ0 


AD021 PROTEIN -Homo 
sapiens (Human), 246 aa. 


274..519 
1..246 


246/246(100%) 
246/246(100%) 


e-145 


Q9UFP1 


HYPOTHETICAL 49.5 KDA 
PROTEIN - Homo sapiens 
(Human), 448 aa (fragment). 


212..512 
130..440 


129/316(40%) 
179/316 (55%) 


2e-63 
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PFam analysis predicts that the NOV41 a protein contains the domains shown in the 
Table 41F. 



Table 41F. Domain Analysis of NOV41a 



Pfam Domain 


NOV41a Match 
Region 


Identities/ 
Similarities 
for the Matched 
Region 


Expect 
Value 


SQS PSY: domain 1 of 
1 


109..145 


8/37 (22%) 
29/37 (78%) 


9.9 



Example 42. 

The NOV42 clone was analyzed, and the nucleotide and predicted polypeptide 
sequences are shown in Table 42 A. 



Table 42A. NOV42 Sequence Analysis 




SEQIDNO:133 1 1294 bp 


NOV42a, 

CG59327-01 DNA Sequence 


GATGGCCACCTTGAACGTTGTACTGATGTTGATGCCCCTTGCCCAGTACATTTTCCAT 
TGTTTTATAACTGTGCTACTGAAGTACTTGTGTGCAGAGTATGGCTGGAGGAATGCCA 
TGTTGATCCAAGGCGCCGTTTCCTTAAACCTGTTTGTTTTTGGGACCCTCATGAGGCC 
CCTCCCTCCTGGGAAAAACCCAAATGACCCAGAAGAGAAAGATCTGCGCGTCCTGCCC 
GCGCACTCCACAGAGTCTGTAATGTCAAATGGACAGCAGGGAAGAATAGAAGAGAAGG 
ATGGCGGGTCTGGGAACGAGGAGACCCTCTGTGACCTGCAAGCCCAGGAGTGCAAGCC 
CAGGAGTGCCCCGATCAGGCCAGATCATGTGCGCTTTCCGGTTCTGAAGACGGTCAGC 
TGGCTCATTATGAGAGTCAAGAAGGGCTTCGAGGATTGGTACTCAGGCTATTTTGGGA 
CAGCCAGCCTATTTACAAArCGAATGTTTGTAGCCTTTGTTTTCTGGGCTTCATTTGC 
ATACAGCAGCTTTGTCATCTCCTTTATTCATCTCCCAGAAATCGTCAATTTGTATAAC 
TTATTGGAGCAAACGAAGGTTTTCCCTCTGACTTCAATTATAGCAATAGTTCACATTG 
TTGGAAAAGTGATCCTGGGCGTCATAGCTGACTTACCTTGCATCAGTGTTTGGAATGT 
CTTCCTGTTGGCCAGCTTCGTTCTTGTCCTCAGTATTTTTGTTTTGCTGCCTTTGATG 
CATATGTACGCTGGCCTGGTGGTCATCTGCACACTGACAGGGTTTTCCAGCGGTTATT 
TCTCCCTAATGCCCATAGTGACTGAAGACTTGGTTGGCATTGAACATTTGGCCAATGC 
CTACGGCATCATCATCTGTGCTAATGGCATCTCTGCGTTGTTGGGACCACCTTTTGCA 
GGTAAACTGTCTGAGGTTTTAAGAGTTCATAGTGCATATAGATACGGTGTGTTAGCTC 
TGCGAGGAGACGGATGCAGAGCACTCACATCTTCTCTTATACATAGAAGTGAAATGGC 
TTTCTAAAGTTAGATCACTGGCCAGAGTTTTTGAGTCACAAGAGCTATTCCACAGATT 


TCCTTTAGAAAAACAATCACCACTGGCAGTCCACTTCAGTGACACAGAATGGGTTGCA 


GAACTTGCTTACTTATGTGACACATTCAACCTGCTCAATGAACTCAArCTGTCACTTC 


AGGGGAGAAGGACAACTGTGTTCAAGTCAGCAAATAAAGTGGCTACArTCAAAACCAA 


ACTGGAATTACGGGGGTG 




ORF Start: ATG at 2 


ORF Stop: TAA at 1049 




SEQIDNO: 134 


349 aa |MW at 38694.2kD 


NOV42a, 

CG5 9327-01 Protein Sequence 


MATLNWLMLMPLAQY I FHCF ITVLLKYLCAEYGWRNAMLI QGAVSLNIjFVFGTLMRP 
LPPGKNPNDPEEKDLRVLPAHSTESVMSNGQQGRIEEKDGGSGNEETLCDLQAQECKP 
RSAPIRPDHVRFPVLKTVSWLIMRVKKGFEDVfYSGYFGTASLFTNRMFVAFVFWASFA 
YSSFVISFIHLPEIVNLYNLLEQTKVFPLTSIIAIVHIVGKVILGVIADLPCISVWNV 
FLIASFVLVLSIFVLLPLMHMYAGLWICTLTGFSSGYFSLMPIVTEDLVGIEHLANA 
YGI 1 1 CANGISALLGPPFAGKLSEVLRVHSAYRYGVLALRGDGCRALTSSLIHRSEMA 
F 



Further analysis of the NOV42a protein yielded the following properties shown in 
Table 42B. 
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Table 42B. Protein Sequence Properties NOV42a 


PSort 
analysis: 


0.6850 probability located in endoplasmic reticulum (membrane); 0.6400 
probability located in plasma membrane; 0.4600 probability located in Golgi 
body; 0.1000 probability located in endoplasmic reticulum (lumen) 


SignalP 
analysis: 


Likely cleavage site between residues 32 and 33 



A search of the NOV42a protein against the Geneseq database, a proprietary 
database that contains sequences published in patents and patent publication, yielded 
several homologous proteins shown in Table 42C. 



Table 42C. Geneseq Results for NOV42a 


Geneseq 
Identifier 


Protein/Organism/Length [Patent 
#, Date] 


NOV42a 
Residues/ 

Match 
Residues 


Identities/ 
Similarities for 
the Matched 
Region 


Expect 
Value 


AAO07132 


Human polypeptide SEQ ID NO 
21024 - Homo sapiens, 107 aa. 
[WO200164835-A2, 07-SEP-2001] 


257.33 1 
5..81 


49/77 (63%) 
58/77 (74%) 


6e-20 


AAY31642 


Human transport-associated protein- 
4 (TRANP-4) - Homo sapiens, 465 
aa. [W09941373-A2, 19-AUG- 
1999] 


157..342 
221. .401 


54/197 (27%) 
86/197 (43%) 


le-07 


AAY02737 


Human secreted protein encoded by 
gene 88 clone HKAFB88 - Homo 
sapiens, 229 aa. [WO9902546-A1, 
21-JAN-1999] 


198..342 
24.. 164 


41/147 (27%) 
65/147(43%) 


9e-06 



In a BLAST search of public sequence databases, the NOV42a protein was found to 
have homology to the proteins shown in the BLASTP data in Table 42D. 



Table 42D. Public BLASTP Results for NOV42a 


Protein 
Accession 
Number 


Protein/Organism/Length 


NOV42a 
Residues/ 

Match 
Residues 


Identities/ 
Similarities for 
the Matched 
Portion 


Expect 
Value 


Q96NI7 


CDNA FLJ30794 FIS, CLONE 
FEBRA2001093, WEAKLY 
SIMILAR TO 
MONOCARBOXYLATE 


22..331 
1.310 


250/312(80%) 
266/312(85%) 


e-138 
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{^Huinsn), 336 bsl. 








Q9D1K0 


1 1 10004H10RIK PROTEIN - Mus 
musculus (Mouse), 336 aa. 


22..331 
1..310 


220/312(70%) 
250/312 (79%) 


e-119 


AAL39716 


LD30953P - Drosophila 
melanogaster (Fruit fly), 894 aa. 


142..314 
665..843 


50/180(27%) 
89/180(48%) 


2e-15 


Q9V9B3 


CG3409 PROTEIN - Drosophila 
melanogaster (Fruit fly), 800 aa. 


142.314 
571. .749 


50/180(27%) 
89/180(48%) 


2e-15 


Q9W0L6 


CGI 3907 PROTEIN - Drosophila 
melanogaster (Fruit fly), 816 aa. 


157.331 
565.J38 


55/178 (30%) 
91/178 (50%) 


le-14 



PFam analysis predicts that the NOV42a protein contains the domains shown in the 
Table 42E. 



Table 42E. Domain Analysis of NOV42a 


Pfam Domain 


NOV42a Match 
Region 


Identities/ 
Similarities 
for the Matched 
Region 


Expect 
Value 


oxidored q3: domain 1 of 
1 


197.314 


25/177(14%) 
73/177(41%) 


9.1 



Example 43. 

The NOV43 clone was analyzed, and the nucleotide and predicted polypeptide 
sequences are shown in Table 43 A. 



Table 43A. NOV43 Sequence Analysis 




SEQIDNO:135 |455 bp 


NOV43a, 

CG59494-01 DNA Sequence 


TAGAA.CTGTGTTGAGCTCTCACCCATCACGATGAGCAACAAATTCTTGGGAACCTGGA 
AGCTGGTCTCCAGTGAAAACTTTGAGGATTACATGAAAGAACTGGGAGTGAATTTCGC 
AGCCCGGAACATGGCAGGGTTAGTGAAACCGACAGTAACTATTAGTGTTGATGGGAAA 
ATGATGACCATAAGAACAGAAAGTTCTTTCCAGGACACTAAGATCTCCTTCAAGCTGG 
GGGAAGAATTTGATGAAACTACAGCAGACAACCGGAAAGTAAAGAGCACCATAACATT 
AGAGAATGGCTCAATGATTCACGTCCAAAAArGGCTTGGCAAAGAGACAACAATCAAA 
AGAAAAATTGTGGATGAAAAAATGGTAGTGGAATGTAAAATGAATAATATTGTCAGCA 
CCAGAATCTACGAAAAGGTCTGAAAAATCATTTCTTCATTGAAGTGGCT 




ORF Start: ATG at 31 


ORF Stop: TGA at 427 




SEQ ID NO: 136 


132 aa ImW at 15096.4kD 


NOV43a, 

CG59494-01 Protein Sequence 


MSNKFLGTWKLVSSENFEDYMKELGVNFAARNMAGLVKPTVTISVDGKMMTIRTESSF 
QDTKI SFKLGEEFDETTADNRKVKST I TLENGSMI HVQKWLGKETTI KRKIVDEKMW 
ECKMNNIVSTRIYEKV 



Further analysis of the NOV43a protein yielded the following properties shown in 
Table 43B. 
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Table 43B. Protein Sequence Properties NOV43a 


PSort 
analysis: 


0.6500 probability located in cytoplasm; 0.1000 probability located in 
mitochondrial matrix space; 0.1000 probability located in lysosome (lumen); 
0.0053 probability located in microbody (peroxisome) 


SignalP 
analysis: 


No Known Signal Sequence Predicted 



A search of the NOV43a protein against the Geneseq database, a proprietary 
database that contains sequences published in patents and patent publication, yielded 
several homologous proteins shown in Table 43C. 



Table 43C. Geneseq Results for NOV43a 


Geneseq 
Identifier 


Protein/Organism/Length 
[Patent #, Date] 


NOV43a 
Residues/ 

Match 
Residues 


Identities/ 
Similarities for 
the Matched 
Region 


Expect 
Value 


AAW40227 


Human myelin P2 protein - Homo 
sapiens, 136 aa. [WO9803647-A2, 
29-JAN-1998] 


1..130 
1..130 


89/130(68%) 
107/130(81%) 


2e-47 


AAW40228 


Bovine myelin P2 protein - Bos 
taurus, 136 aa. [WO9803647-A2, 
29-JAN-1998] 


1..130 
1..130 


89/130 (68%) 
106/130 (81%) 


9e-47 


AAY90320 


Human AFABP protein sequence - 
Homo sapiens, 132 aa. 
[WO200047734-A1, 17-AUG- 
2000] 


1..131 
1 ..131 


84/131 (64%) 
110/131 (83%) 


3e-46 


AAY90319 


Mouse AFABP protein sequence - 
Mus sp, 132 aa. [WO200047734- 
Al, 17-AUG-2000] 


1..131 
1..131 


83/131 (63%) 
108/131 (82%) 


7e-45 


AAG66576 


Mouse MDGI polypeptide - Mus 
sp, 133 aa. [US6232291-B1, 15- 
MAY-2001] 


1..131 
1.-131 


73/131 (55%) 
103/131 (77%) 


6e-40 



In a BLAST search of public sequence databases, the NOV43a protein was found to 
have homology to the proteins shown in the BLASTP data in Table 43D. 
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Table 43D. Public BLASTP Results for NOV43a 


Protein 
Accession 
Number 


Protein/Organism/Length 


NOV43a 
Residues/ 

Match 
Residues 


Identities/ 
Similarities for 
the Matched 
Portion 


Expect 
Value 


MPRB2 


myelin P2 protein - rabbit, 132 aa. 


1..132 
1-132 


95/132 (71%) 
109/132 (81%) 


3e-49 


P02691 


Myelin P2 protein - Oryctolagus 
cuniculus (Rabbit), 131 aa. 


2..132 
1.-131 


94/131 (71%) 
108/131 (81%) 


le-48 


MPHU2 


myelin P2 protein [validated] - 
human, 1 32 aa. 


1..132 
1-132 


92/132(69%) 
109/132 (81%) 


3e-48 


Q90X56 


ADIPOCYTE FATTY ACID 
BINDING PROTEIN - Gallus 
gallus (Chicken), 132 aa. 


1.-131 
1-131 


86/131 (65%) 
113/131 (85%) 


le-47 


P02689 


Myelin P2 protein - Homo sapiens 
(Human), 131 aa. 


2..132 
1-131 


91/131 (69%) 
108/131 (81%) 


le-47 



PFam analysis predicts that the NOV43a protein contains the domains shown in the 
Table 43E. 



Table 43E. Domain Analysis of NOV43a 


Pfam Domain 


NOV43a Match 
Region 


Identities/ 
Similarities 
for the Matched 
Region 


Expect 
Value 


lipocalin: domain 1 of 
1 


4..132 


45/157(29%) 
113/157(72%) 


3.2e-36 



Example 44. 

The NOV44 clone was analyzed, and the nucleotide and predicted polypeptide 
sequences are shown in Table 44 A. 



Table 44A. NOV44 Sequence Analysis 




SEQIDNO:137 1 1561 bp 


NOV44a, 

CG59432-01 DNA Sequence 


AAGAATTGTAGCTCTCCACTGAATTGCAGGGGTTCTTGAATGTTGTCAACATTTGGAG 


GCAGTTGGAGGAGGGAGCTCTATTGATGAAAAATGGCTACATATTCAAAATTTCAGTG 


TATACCAGGAAGATAATTCAATTCAATCTCTGGCTTACCCAAAGAATCTTGGAGTTAC 


TGCCAATGAGGAAATCCCCAGGGTCTAATAAAAATATCTTTAGGAGTGAAGGAGTTAA 


CTGAGTGTGTAAGCTTTATCTTCTGTCCAATGGACTTGTGGTTTGCTTATAAAACTCT 


CCAGTAAATAATTGTTAGAGACCTGTCATTGATAGCAGTTGCTAGTTGCTGCCTTTTA 


AGAGCTCGTTGATTCCTCTGCAAGGTGGTGCAGCATCCTCTGTCCCTTCATTCATTTC 


AGATCTACTCAGGTCTCCCTGTAAACAGATCTCTCGGATCAATAAGCATGAATGACGA 


AGACTACAGCACCATCTATGACACAATCCAAAATGAGAGGACGTATGAGGTTCCAGAC 
CAGCCAGAAGAAAATGAAAGTCCCCATTATGATGATGTCCATGAGTACTTAAGGCCAG 
AAAATGATTTATATGCCACTCAGCTGAATACCCATGAGTATGATTTTGTGTCAGTCTA 
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TACCATTAAGGGTGAAGAGACCAGCTTGGCCTCTGTCCAGTCAGAAGACAGAGGCTAC 
CTCCTGCCTGATGAGATATACTCTGAACTCCAGGAGGCTCATCCAGGTGAGCCCCAGG 
AGGACAGGGGCATCTCAATGGAAGGGTTATATTCATCAACCCAGGACCAGCAACTCTG 
CGCAGCAGAACTCCAGGAGAATGGGAGTGTGATGAAGGAAGATCTGCCTTCTCCTTCA 
AGCTTCACCATTCAGCACAGTAAGGCCTTCTCTACCACCAAGTATTCCTGCTATTCTG 
ATGCTGAAGGTTTGGAAGAAAAGGAGGGAGCTCACATGAACCCTGAGATTTACCTCTT 
TGTGAAGGTAAGGTCTGCCTCTGACAGGCATACCCTGTTCATGCAGATATTATGGCTG 
GTGTTTTATTTTGCTCTGAATGACCAGGGAAAGATTCATAATGCCATGGTCCTTGGAT 
CTCAATACATATTCAGGAGTCGGAGGGACTAAATCAGTCATTAGAGTGTACTCAGCTC 
TTCACAAAATTAGAGGAATTGGAAGGTGCATTTAAAGCACGTATTTAATCACrGACTT 


TTACATACCATGGGCAAAGTATTTTTCAAAACGGTTCACATAAGTGAGCCATAACTGC 


TGCCCAAATCCTTGCCATTGTGGCTGACATTAAGTACATTTTTCTGTCTGGTTAAATT 


TCCTTTGTCGACATGTTTAAAAGTGAAACCAAAGCTTGTGAAAGAAAGACCTTCTTGT 


GCTTCTAAGGTCACAGATTTGTCAGATAGGTGGTCAATAAAGGCTATCTCTGTCACTA 


GCTTGCCCCTTTGGCACCAATATAACTAAAAATTTGATGAAGTCAAATGATTrCAGTA 


TV [ IAATACT1 TATCTAj t, \ 




ORF Start: ATG at 454 


ORF Stop: TAA at 1132 




SEQ ID NO: 138 


226 aa |MW at 26132.2kD 


NOV44a, 

CG59432-01 Protein Sequence 


MNDEDYSTIYDTIQNERTYEVPDQPEENESPHYDDVHEYLRPENDLYATQLNTHEYDF 
VSVYTIKGEETSLASVQSEDRGYLLPDEIYSELQEAHPGEPQEDRGISMEGLYSSTQD 
QQLCAAELQENGSVMKEDLPSPSSFTIQHSKAFSTTKYSCYSDAEGLEEKEGAHMNPE 
IYLFVKVRSASDRHTLFMQILWLVFYFALNDQGKIHNAMVLGSQYIFRSRRD 




SEQ ID NO: 139 |809bp 


NOV44b 

CG59432-02 DNA Sequence 


ATCCTCTGTCCCTTCATTCATTTCAGATCTACTCAGGTCTCCCTGTAAACAGATCTCT 


CGGATCAATAAGCATGAATGACGAAGACTACGGCACCATCTATGACACAATCCAAAAT 
GAGAGGACGTATGAGGTTCCAGACCAGCCAGAAGAAAATGAAAGTCCCCATTATGATG 
ATGTCCATGAGTACTTAAGGCCAGAAAATGATTTATATGCCACTCAGCTGAATACCCA 
TGAGTATGATTTTGTGTCAGTCTATACCATTAAGGGTGAAGAGACCAGCTTGGCCTCT 
GTCCAGTCAGAAGACAGAGGCTACCTCCTGCCTGATGAGATATACTCTGAACTCCAGG 

ATCAACCCAGGACCAGCAACTCTGCGCAGCAGAACTCCAGGAGAATGGGAGTGTGATG 
AAGGAAGATCTGCCTTCTCCTTCAAGCTTCACCATTCAGCACAGTAAGGCCTTCTCTA 
CCACCAAGTATTCCTGCTATTCTGATGCTGAAGGTTTGGAAGAAAAGGAGGGAGCTCA 
CATGAACCCTGAGATTTACCTCTTTGTGAAGGTAAGGTCTGCCTCTGACAGGCATACC 
CTGTTCATGCAGATATTATGGCTGGTGTTTTATTTTGCTCTGAATGACCAGGGAAAGA 
TTCATAATGCCATGGTCCTTGGATCTCAATACATATTCAGGAGTCGGAGGGACTAAAT 
CAGTCATTAGAGTGTACTCAGCTCTTCACAAAATTAGAGGAATTGGAAGGTGCAT 




ORF Start: ATG at 72 


ORF Stop: TAA at 750 




SEQ ID NO: 140 


226 aa |MW at 26102.2kD 


NOV44b, 

CG59432-02 Protein Sequence 


MNDEDYGTIYDTIQNERTYEVPDQPEENESPHYDDVHEYLRPENDLYATQLNTHEYDF 
VSVYT IKGEETSLASVQSEDRGYLLPDE I YSELQEAHPGEPQEDRG I SMEGLYSSTQD 
QQLCAAELQENGSVMKEDLPSPSSFTIQHSKAFSTTKYSCYSDAEGLEEKEGAHMNPE 
I YLFVKVRSASDRHTLFMQILWLVFYFALHDQGKI HNAMVLGSQYI FRSRRD 



Sequence comparison of the above protein sequences yields the following sequence 
relationships shown in Table 44B. 



Table 44B. Comparison of NOV44a against NOV44b. 


Protein Sequence 


NOV44a Residues/ 
Match Residues 


Identities/ 
Similarities for the Matched Region 


NOV44b 


1..226 
1..226 


225/226 (99%) 
225/226 (99%) 



Further analysis of the NOV44a protein yielded the following properties shown in 
Table 44C. 
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Table 44C. Protein Sequence Properties NOV44a 


PSort 
analysis: 


0.6500 probability located in cytoplasm; 0.1000 probability located in 
mitochondrial matrix space; 0.1000 probability located in lysosome (lumen); 
0.0000 probability located in endoplasmic reticulum (membrane) 


SignalP 
analysis: 


No Known Signal Sequence Predicted 



A search of the NOV44a protein against the Geneseq database, a proprietary 
database that contains sequences published in patents and patent publication, yielded 
several homologous proteins shown in Table 44D. 



Table 44D. Geneseq Results for NOV44a 






NOV44a 


Identities/ 




Geneseq 


Protein/Organism/Length 


Residues/ 


Similarities for 


Expect 


Identifier 


[Patent #, Date] 


Match 


the Matched 


Value 






Residues 


Region 




No Significant Matches Found 



In a BLAST search of public sequence databases, the NOV44a protein was found to 
have homology to the proteins shown in the BLASTP data in Table 44E. 



Table 44E. Public BLASTP Results for NOV44a 


Protein 
Accession 
Number 


Protein/Organism/Length 


NOV44a 
Residues/ 

Match 
Residues 


Identities/ 
Similarities for 
the Matched 
Portion 


Expect 
Value 


Q96JT5 


CLIC5B - Homo sapiens (Human), 
410 aa. 


1..200 
1..202 


185/202 (91%) 
191/202 (93%) 


e-104 


Q9NPY9 


DJ447E2 1 .4 (SIMILAR TO 
BOVINE CHLORIDE CHANNEL 
PROTEIN (P64)) - Homo sapiens 
(Human), 1 80 aa (fragment). 


1..180 
1..180 


180/180(100%) 
180/180(100%) 


e-103 


A47104 


chloride channel 64K chain - 

bovine, 437 aa. 


1 ..197 

1 ..229 


104/231 (45%) 
133/231 (57%) 


Je-39 


P35526 


Chlorine channel protein p64 - Bos 
taurus (Bovine), 437 aa. 


1..197 
1..229 


103/231 (44%) 
131/231 (56%) 


le-38 



PFam analysis predicts that the NOV44a protein contains the domains shown in the 
Table 44F. 
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Table 44F. Domain Analysis of NOV44a 



Pfam Domain 



NOV44a Match Region 



Identities/ 
Similarities 
for the Matched Region 



Expect Value 



No Significant Matches Found 



Example 45. 

The NOV45 clone was analyzed, and the nucleotide and predicted polypeptide 
sequences are shown in Table 45A. 



Table 45A. NOV45 Sequence Analysis 




SEQIDNO:141 j 877 bp 


NOV45a, 

CG59394-01 DNA Sequence 


ACTTTGTCCTCTTGGGCTTCACACAGAATCCAAAGGaGCAGAAAGTACTTTTTGTTAT 
GTTCTTGCTCTTCTACATTTTGACCATGGTGGGCAACCTGCTCATTGTAGTGACCGTA 


TAGATATCATTTATTCTTCATCCATTTCCCCCAGATTGATTTCAGGCTTGTTCTTTGG 
GAATAATTCCATATCCTTCCAATCTTGCATGGCCCAGCTCTTTATCGAGCACATTTTC 
GGTGGGTCAGAGGTCTTTCTCCTGTTGGTGATGGCCTATGACTGCTATGTGGCCATCT 
GTAAGCCCTTGCATTATTTGGTTATCATGAGACAATGGGTGTGTGTTGTGCTGCTGGT 
AGTGTCCTGGGTTGGAGGATTTCTGCACTCAGTATTTCAACTTAGCATTATTTATGGG 
CTCCCATTCTGTGGCCCCAATGTCATTGATCATTTTTTCTGTGACATGTATCCCTTAT 
TGAAACTGGTCTGCACTGACACCCATGCTATTGGCCTCTTAGTGGTGGCCAATGGAGG 
ACTGGCTTGCACTATTGTGTTTCTGCTCTTACTCATCTCTTATGGTGTCATCTTGCAC 
TCTTTAAAGAACCTTAGTCAGAAAGGGAGGCAAAAAGCCCTCTCAACCTGCAGTTCCC 




TAGGACCTTCCCCATTGACAAATCAGTGAGTGTGTTTTATACAGTCATAACCCCAATG 
CTGAACCCCTTAATCTACACTCTGAGAAATTCTGAGATGACAAGTGCTATGAAGAAGC 
TTTAGAG 




ORF Start: TTT at 3 


ORP Stop: TAG at 873 




SEQIDNO: 142 


290 aa |MW at 32485. 7kD 


NOV45a, 

CG593 94-01 Protein Sequence 


FVLLGFTQNPKEQKVLFVMFIJ^YILTMVGNLJjIVVTVTVSETLGSPMYFFIjAGLSFI 
DIIYSSSISPRLISGLFFGNNSISFQSCMAQLFIEHIFGGSEVFLLLVMAYDCYVAIC 
KPLHYLVIMRQWVCWLLWSWVGGFLHSVFQLSIIYGLPFCGPNVIDHFFCDMYPLL 
KLVCTDTHAIGIiWANGGLACTIVFLLLLISYGVILHSLKNLSQKGRQJCALSTCSSH 
MTVWFFFVPCIFMYARPARTFPIDKSVSVFYTVITPMLNPLI YTLRNSEMTSAMKKL 



Further analysis of the NOV45a protein yielded the following properties shown in 
Table 45B. 



Table 45B. Protein Sequence Properties NOV45a 


PSort 
analysis: 


0.6400 probability located in plasma membrane; 0.4600 probability located in 
Golgi body; 0.3700 probability located in endoplasmic reticulum (membrane); 
0.1000 probability located in endoplasmic reticulum (lumen) 


SignalP 
analysis: 


Likely cleavage site between residues 42 and 43 



A search of the NOV45a protein against the Geneseq database, a proprietary 
database that contains sequences published in patents and patent publication, yielded 
several homologous proteins shown in Table 45C. 
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Table 45C. Geneseq Results for NOV45a 


Geneseq 
Identifier 


Protein/Organism/Length [Patent 
#, Date] 


NOV45a 
Residues/ 

Match 
Residues 


Identities/ 
Similarities for 
the Matched 
Region 


Expect 
Value 


AAU24536 


Human olfactory receptor 
AOLFR21 - Homo sapiens, 299 aa. 
[WO200168805-A2, 20-SEP-2001] 


1..290 
10..299 


273/290 (94%) 
278/290 (95%) 


e-155 


AAG71950 


Human olfactory receptor 
polypeptide, SEQIDNO: 1631 - 
Homo sapiens, 299 aa. 
[WO200127158-A2, 19-APR-2001] 


1..290 
10..299 


273/290 (94%) 
278/290 (95%) 


e-155 


AAG72258 


Human olfactory receptor 
polypeptide, SEQ ID NO: 1939 - 
Homo sapiens, 262 aa. 
[WO200127158-A2, 19-APR-2001] 


33..290 
1..250 


234/258 (90%) 
240/258 (92%) 


e-131 


AAG72553 


Human OR-like polypeptide query 
sequence, SEQ ID NO: 2234 - 
Homo sapiens, 327 aa. 
[WO200127158-A2, 19-APR-2001] 


1..290 
10..299 


198/290 (68%) 
242/290 (83%) 


e-121 


AAG71909 


Human olfactory receptor 
polypeptide, SEQ ID NO: 1590 - 
Homo sapiens, 327 aa. 
[WO200127158-A2, 19-APR-2001] 


1..290 
10..299 


198/290 (68%) 
242/290 (83%) 


e-121 



In a BLAST search of public sequence databases, the NOV45a protein was found to 
have homology to the proteins shown in the BLASTP data in Table 45D. 



Table 45D. Public BLASTP Results for NOV45a 


Protein 
Accession 
Number 


Protein/Organism/Length 


NOV45a 
Residues/ 

Match 
Residues 


Identities/ 
Similarities for 
the Matched 
Portion 


Expect 
Value 


Q9QW37 


OR1 8=ODORANT RECEPTOR 
- Rattus sp, 307 aa. 


1 ..290 
10..299 


192/290 (66%) 
237/290 (81%) 


e-115 


Q96R66 


OLFACTORY RECEPTOR - 
Homo sapiens (Human), 213 aa 
(fragment). 


57..269 
1..213 


198/213 (92%) 
202/213 (93%) 


e-111 


Q9R0K2 


ODORANT RECEPTOR 
MORI 8 - Mus musculus 
(Mouse), 308 aa. 


1 ..290 
10..299 


177/290 (61%) 
229/290 (78%) 


e-105 
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Q9R0K1 


ODORANT RECEPTOR A16 - 
Mus musculus (Mouse), 302 aa. 


1..290 
10..299 


171/290 (58%) 
226/290 (76%) 


e-102 


CAC 88333 


SEQUENCE 34 FROM PATENT 
WOO 164879 - Homo sapiens 
(Human), 309 aa. 


1..290 
10..299 


167/290 (57%) 
221/290 (75%) 


5e-99 



PFam analysis predicts that the NOV45a protein contains the domains shown in the 
Table 45E. 



Table 45E. Domain Analysis of NOV45a 


Pfam Domain 


NOV45a Match Region 


Identities/ 
Similarities 
for the Matched Region 


Expect Value 


7tm_l : domain 1 of 1 


30..276 


50/268 (19%) 
174/268 (65%) 


4.4e-23 



Example 46. 

The NOV46 clone was analyzed, and the nucleotide and predicted polypeptide 
sequences are shown in Table 46A. 



Table 46A. NOV46 Sequence Analysis 




SEQIDNO:143 1 1746 bp 


NOV46a, 

CG59383-01 DNA Sequence 


ATAATTCAGTTTGAAAACCAGTGGTTTCTCTTTCCTTCCCTATAGGTGTAAAGAATAT 


CCAGCTGGTGGCTACAGTTCCCCCTCTGGTTTTGCTGCCATGCATCCTGGGCGAACTA 
CTGGTAAAGGGCCCTCTACTCACACTCAGATTGACCAGCAACCTCCACGGCTTCTCAT 
TGTGCACATTGCTCTACCGTCCTGGGCTGACATCTGCACCAACCTCTGTGAGGCTCTG 
CAGAACTTCTTCTCTCTAGCCTGCAGCTTGATGGGCCCCAGCCGCATGTCCCTGTTCA 




GAACTTTGCTAGGTTGCAGACCTGCATCTCAGAACTCCGCATGTTACAGAGAGAAGGG 
TGTTTCAGATCACAAGGTGCTTCTCTGCGGCTGGCAGTAGAGGATGGGCTCCAGCAAT 
TCAAACAATACAGCAGACATGTGACCACAAGGGCAGCTCTGACCTATACCTCCCTGGA 
GATTACTATTCTGACTTCTCAGCCTGGAAAAGAGGTGGTCAAACAGTTGGAGGAAGGG 
TTGAAAGATACAGACCTAGCCAGAGTCAGGAGGTTTCAGGTCGTTGAGGTCACAAAGG 




TTCTATTCTGGGAACTGACATTGACCTTCAGACTATAGACAATGATATCGTCAGCATG 
GAGATTTTCTTCAAAGCCTGGCTACATAACAGTGGAACAGACCAAGAACAAATCCATC 




TCTGAAATGTGATCTCCAAGAGCGACTGCTCrGCCCATCCCTACTCGCTGGCACAGCT 
GACGGCTCCTTGAGAATGGATGACCCTAAAGGAGACTTCATCACACTCTACCAGATGG 
CTTCCCAGTCATCGGCCTCTCATTACAAGCTCCAAGTGATCAAGGCTTTAAAATCTAG 
CGGGCTCTGCGAGTCATTGACATATGGACTCCCGTTCATCCTCAGACCTACAAGCTGT 
TGGCAGCTGGACTGGGATGAGCTGGAGACAAATCAGCAACATTTCCATGCTTTGTGTC 
ACAGCCTGCTGAAAAGGGAATGGCTGCTGTTAGCCAAGGGGGAACCACCGGGCCCAGG 
ACACAGCCAGAGAATTCCTGCCAGCACCTTCTATGTGATCATGCCGTCACACTCCCTC 
ACACTGCTGGTAAAGGCGGTGGCCACGCGGGAACTGATGCTGCCCAGCACCTTCCCCC 
TGCTACCTGAGGACCCACATGATGATAGCCTTAAGAATAGCATGCTGGACAGCCTGGA 
GCTGGAGCCCACCTACAACCCCTTGCATGTTCAAAGCCACCTGTACTCACACCTGAGC 
AGCATCTATGCCAAGCCTCAGGGGCGGCTCCACCCACACTGGGAGAGCCGAGCTCCGA 
GAAAGACTGGGCAGTTGCAGACCAACCGAGCTCGAGCTACTGTGGCCCCCCTGCCTAT 
GACTCCTGTCCCAGGCAGAGCCTCCAAGATGCCAGCAGCCAGCAAATCTTCCTCAGAT 
GCCTTCTTCCTGCCTTCAGAGTGGGAGAAGGATCCCTCAAGGCCCTAAGTCACCAGCA 
CCAGAGCCCAGCTGCCCAGCTTAACCATATCCATGCTCAGGTTCACATAATGGCTATC 


TGTGGT 




ORF Start: ATG at 98 


ORF Stop: TAA at 1670 




SEQ ID NO: 144 


524 aa |MW at 58691. 3kD 
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NOV46a, 

CG5 93 83-01 Protein Sequence 


SRNSLFSLYMVQDQHECILPFVQVKGNFARLQTCISELRMLQREGCFRSQGASLRLAV 
EDGLQQFKQYSRHVTTRAALTYTSLEITILTSQPGKEWKQLEEGLKDTDLARVRRFQ 
WEVTKG I LEHVDSAS PVEDTSNDESSI LGTD I DLQTI DND IVSME I FFKAWLHNSGT 
DQEQIHLLLSSQCFSNISRPRDNPMCLKCDLQERLLCPSLLAGTADGSLRMDDPKGDF 
ITLYQMASQSSASHYKLQVIKALKSSGLCESLTYGLPFILRPTSCWQLDWDELETNQQ 
HFHALCHSLLKREWLLLAKGEPPGPGHSQRIPASTFYVIMPSHSLTLLVKAVATRELM 
LPSTFPLLPEDPHDDSLKNSMLDSLELEPTYNPLHVQSHLYSHLSSIYAKPQGRLHPH 
WESRAPRKTGQLQTNRARATVAPLPMTPVPGRASKMPAASKSSSDAFFLPSEWEKDPS 
RP 




SEQIDNO:145 1 1647 bp 


NOV46b, 

CG59383-02 DNA Sequence 


AAAGAATATCCAGCTGGTGGCTACAGTTCCCCCTCTGGTTTTGCrGCCATGCATCCTG 


GGCGAACTACTGGTAAAGGGCCCTCTACTCACACTCAGATTGACCAGCAACCTCCACG 
GCTTCTCATTGTGCACATTGCTCTACCGTCCTGGGCTGACATCTGCACCAACCTCTGT 
GAGGCTCTGCAGAACTTCTTCTCTCTAGCCTGCAGCTTGATGGGCCCCAGCCGCATGT 
CCCTGTTCAGTTTATACATGGTACAAGATCAGCATGAGTGCATCCTCCCTTTTGTGCA 
AGTGAAAGGGAACTTTGCTAGGTTGCAGACCTGCATCTCAGAACrCCGCATGTTACAG 
AGAGAAGGGTGTTTCAGATCACAAGGTGCTTCTCTGCGGCTGGCAGTAGAGGATGGGC 
TCCAGCAATTCAAACAATACAGCAGACATGTGACCACAAGGGCAGCTCTGACCTATAC 
CTCCCTGGAGATTACTATTCTGACTTCTCAGCCTGGAAAAGAGGTGGTCAAACAGTTG 
GAGGAAGGGTTGAAAGATACAGACCTAGCCAGAGTCAGGAGGTTTCAGGTCGTTGAGG 
TCACAAAGGGAATCCTAGAGCACGTGGACTCAGCGTCTCCTGTTGAGGATACCAGCAA 
TGATGAGAGTTCTATTCTGGGAACTGACATTGACCTTCAGACTATAGACAATGATATC 
GTCAGCATGGAGATTTTCTTCAAAGCCTGGCTACATAACAGTGGAACAGACCAGGAAC 
AAATCCATCTTCTTCTTTCTTCACAGTGTTTCAGCAACATTTCCAGACCCAGAGATAA 
TCCAATGTGTCTGAAATGTGATCTCCAAGAGCGACTGCTCTGCCCATCCCTACTCGCT 
GGCACAGCTGACGGCTCCTTGAGAATGGATGACCCTAAAGGAGACTTCATCACACTCC 
ACCAGATGGCTTCCCAGTCATCGGCCTCTCATTACAAGCTCCAAGTGATCAAGGCTTT 
AAAATCTAGCGGGCTCTGCGAGTCATTGACATATGGACTCCCGTTCATCCTCAGACCT 
ACAAGCTGTTGGCAGCTGGACTGGGATGAGCTGGAGACAAATCAGCAACATTTCCATG 
CTTTGTGTCACAGCCTGCTGAAAAGGGAATGGCTGCTGTTAGCCAAGGGGGAACCACC 
GGGCCCAGGACACAGCCAGAGAATTCCTGCCAGCACCTTCTATGTGATCATGCCGTCA 
CACTCCCTCACACTGCTGGTAAAGGCGGTGGCCACGCGGGAACTGATGCTGCCCAGCA 
CCTTCCCCCTGCTGCCTGAGGACCCACATGATGATAGCCTTAAGAATGTGGAGAGCAT 
GCTGGACAGCCTGGAGCTGGAGCCCACCTACAACCCCTTGCATGTTCAAAGCCACCTG 
TACTCACACCTGAGCAGCATCTATGCCAAGCCTCAGGGGCGGCTCCACCCACACTGGG 
AGAGCCGAGCTCCGAGAAAGCATCCCTGCAAGACTGGGCAGTTGCAGACCAACCGAGC 




ATCCCTCAAGGCCCTAAGTCACC 




ORF Start: ATG at 49 


ORF Stop: TAA at 1639 




SEQ ID NO: 146 


530 aa |MW at 59359. lkD 


NOV46b, 

CG59383-02 Protein Sequence 


MHPGRTTGKGPSTHTQIDQQPPRLLIVHIALPSWADICTHLCEALQNFFSLACSLMGP 
SRMSLFSLYMVQDQHECILPFVQVKGNFARLQTCISELRMLQREGCFRSQGASLRLAV 
EDGLQQFKQYSRHVTTRAALTYTSLEITILTSQPGKEWKQLEEGLKDTDLARVRRFQ 
WEVTKGILEHVDSASPVEDTSNDESSIL.GTDIDLQTIDNDIVSMEIFFKAWLHNSGT 
DQEQIHLLLSSQCFSNISRPRDNPMCLKCDLQERLLCPSLLAGTADGSLRMDDPKGDF 
I TLHQMASQSSASHYKLQVI KALKSSGLCESLTYGLPFI LRPTSCWQLDWDELETNQQ 
HFHALCHSLLKREWLLLAKGEPPGPGHSQRIPASTFYVIMPSHSLTLLVKAVATRELM 
LPSTFPLLPEDPHDDSLKNVESMLDSLELEPTYNPLHVQSHLYSHLSSIYAKPQGRLH 
PHWESRAPRKHPCKTGQLQTNRARATVAPLPMTPVPGRASKMPAASKSSSDAFFLPSE 
WEKDPSRP 



Sequence comparison of the above protein sequences yields the following sequence 
relationships shown in Table 46B. 



Table 46B. Comparison of NOV46a against NOV46b. 


Protein Sequence 


NOV46a Residues/ 
Match Residues 


Identities/ 
Similarities for the Matched Region 


NOV46b 


1..524 
1..530 


509/530 (96%) 
510/530 (96%) 
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Further analysis of the NOV46a protein yielded the following properties shown in 
Table 46C. 



Table 46C. Protein Sequence Properties NOV46a 


PSort 
analysis: 


0.4500 probability located in cytoplasm; 0.3000 probability located in 
microbody (peroxisome); 0.1000 probability located in mitochondrial matrix 
space; 0.1000 probability located in lysosome (lumen) 


SignalP 
analysis: 


No Known Signal Sequence Predicted 



A search of the NOV46a protein against the Geneseq database, a proprietary 
database that contains sequences published in patents and patent publication, yielded 
several homologous proteins shown in Table 46D. 



Table 46D. Geneseq Results for NOV46a 


Geneseq 
Identifier 


Protein/Organism/Length [Patent 
#, Date] 


NOV46a 
Residues/ 

Match 
Residues 


Identities/ 
Similarities 
for the 
Matched 
Region 


Expect 
Value 


AAM34317 


Peptide #8354 encoded by probe for 
measuring placental gene expression 
- Homo sapiens, 52 aa. 
[WO200157272-A2, 09-AUG-2001] 


259..310 
1..52 


52/52(100%) 
52/52(100%) 


7e-23 


ABB 18624 


Protein #623 encoded by probe for 
measuring heart cell gene expression 
- Homo sapiens, 42 aa. 
[WO2001 57274-A2, 09-AUG-200 1 ] 


101..142 
1..42 


42/42(100%) 
42/42(100%) 


2e-16 


AAM66343 


Human bone marrow expressed 
probe encoded protein SEQ ID NO: 
26649 - Homo sapiens, 42 aa. 
[WO200157276-A2, 09-AUG-2001] 


101..142 
1..42 


42/42(100%) 
42/42(100%) 


2e-16 


AAM53955 


Human brain expressed single exon 
probe encoded protein SEQ ID NO: 
26060 - Homo sapiens, 42 aa. 
[WO200157275-A2, 09-AUG-2001] 


101..142 
1..42 


42/42(100%) 
42/42(100%) 


2e-16 


AAM26622 


Peptide #659 encoded by probe for 
measuring placental gene expression 
- Homo sapiens, 42 aa. 
[ WO200 1 57272-A2, 09-AUG-200 1 ] 


101..142 
1..42 


42/42(100%) 
42/42(100%) 


2e-16 
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In a BLAST search of public sequence databases, the NOV46a protein was found to 
have homology to the proteins shown in the BLASTP data in Table 46E. 



Table 46E. Public BLASTP Results for NOV46a 


Protein 
Accession 
Number 


Protein/Organism/Length 


NOV46a 
Residues/ 

Match 
Residues 


Identities/ 
Similarities for 
the Matched 
Portion 


Expect 
Value 


Q9Z0E1 


D6MM5E PROTEIN - Mus 
musculus (Mouse), 529 aa. 


1..524 
1..526 


380/526 (72%) 
423/526 (80%) 


0.0 


Q96L07 


SIMILAR TO DNA SEGMENT, 
CHR 6, MIRIAM MEISLER 5, 
EXPRESSED - Homo sapiens 
(Human), 365 aa. 


1..358 
1..358 


358/358 (100%) 
358/358 (100%) 


0.0 



PFam analysis predicts that the NOV46a protein contains the domains shown in the 
Table 46F. 



Table 46F. Domain Analysis of NOV46a 


Pfam Domain 


NOV46a Match Region 


Identities/ 
Similarities 
for the Matched Region 


Expect Value 


RA: domain 1 of 1 


124..214 


18/115(16%) 
65/115 (57%) 


8.4 



Example 47. 

The NOV47 clone was analyzed, and the nucleotide and predicted polypeptide 
sequences are shown in Table 47A. 



Table 47 A. NOV47 Sequence Analysis 




SEQIDNO:147 1 960 bp 


NOV47a, 

CG58526-01 DNA Sequence 


AGGACTAAATAAAATGGCCTAAATTTAAATATGGATTGGGATTTCCATTCTCTTGCAG 
ATGCCCAGAACCAAAGAAGAGGTCTGCCTGGTTTTCTTCCTGGAGCTCCAGACCCAGA 
CCAAAGCCTTCCTGCCTCTTCCAATCCAGGGAACCAAGCATGGCAGCTGAGrCTCCCT 
CTGCCAAGCAGTTTCCTGCCAACAGTCAGTCTCCCTCCTGGTCTAGAATATTTAAGCC 
AGTTAGACCTGATAATTATACACCAGCAGGTGGAGCTGCTTGTGATACTTGGTACTGA 
GACCTCCAACAAATATGAGATTAAAAACAGCTTGGGACAAAGAATTTACTTTGCAGTG 
GAGGAAAGCATCTGCTTCAATCGTACTTTCTGTTCCACTCTGCGATCTTGCACCCTGA 
GGATCACAGATAACTCAGGTCGAGAGGTCATTACAGTGAACAGGCCCTTGAGATGTAA 




ATAGTTGGTTACGTTACGCAGAAGTGGGACCCCTTTCTGCCTAAATTCACAATCCAAA 
ATGCAAACAAAGAAGATATTTTGAAAATTGTTGGTCCTTGTGTGACATGTGGCTGTTT 
TGGCGATGTGGATTTTGAGAAGGTGAAAACCATTAATGAAAAGCTTACAATTGGGAAG 
ATTTCAAAGTACTGGTCAGGATTTGTAAATGATGTCTTCACAAATGCTGACAATTTCG 
GAATTCATGTTCCTGCAGATCTAGATGTAACAGTCAAAGCAGCAATGATCGGTGCCTG 
TTTTCTCTTTGTAAGTATGGGCTTTGAGAGCCCAGCCCTCCAAGATGAGAAAGAGTCA 
GTGTGGCAATTCAAAAAATCAGAGTGCCCTCTCACCTCCAAACAAGCCCACTTGTTCC 
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CCAGCGATGGTTCTTAGCCAGACTGAAATGAC 




ORF Start: ATG at 31 


ORF Stop: TAG at 943 




SEQ ID NO: 148 


304 aa |MW at 33794.2kD 


NOV47a, 

CG5 8526-01 Protein Sequence 


MDWDFHSLADAQNQRRGLPGFLPGAPDPDQSLPASSNPGNQAWQLSLPLPSSFLPTVS 
LPPGLEYLSQLDLI 1 1 HQQVELIiVI LGTETSNKYE I KNSLGQR I YFAVEES I CFNRTF 
CSTLRSCTLRITDNSGREVITVNRPLRCNSCWCPCYLQELEIQAPPGTIVGYVTQKWD 
PFLPKFT I QNANKEDI LKIVGPCVTCGCFGDVDFEKVKTI NEKLTIGKI SKYWSGFVN 
DVFTNADNFGIHVPADLDVTVKAAMIGACFLFVSMGFESPALQDEKESVWQFKKSECP 
LTSKQAHLFPSDGS 



Further analysis of the NOV47a protein yielded the following properties shown in 
Table 47B. 



Table 47B. Protein Sequence Properties NOV47a 


PSort 
analysis: 


0.8500 probability located in endoplasmic reticulum (membrane); 0.4400 
probability located in plasma membrane; 0.4244 probability located in 
microbody (peroxisome); 0.1000 probability located in mitochondrial inner 
membrane 


SignalP 
analysis: 


No Known Signal Sequence Predicted 



A search of the NOV47a protein against the Geneseq database, a proprietary 
database that contains sequences published in patents and patent publication, yielded 
several homologous proteins shown in Table 47C. 



Table 47C. Geneseq Results for NOV47a 


Geneseq 
Identifier 


Protein/Organism/Length [Patent 
#,Date] 


NOV47a 
Residues/ 

Match 
Residues 


Identities/ 
Similarities for 
the Matched 
Region 


Expect 
Value 


AAG78341 


Human Mm-1 cell line derived 
transplantability-associated gene lb 
- Homo sapiens, 3 1 8 aa. 
[WO200164894-A2, 07-SEP-2001] 


24..282 
60..318 


1 52/263 (57%) 
187/263 (70%) 


5e-84 


AAB24113 


Human phospholipid scramblase 
HPLS protein sequence - Homo 
sapiens, 318 aa. [CN1259574-A, 12- 
JUL-2000] 


24..282 
60..318 


152/263 (57%) 
187/263 (70%) 


5e-84 


AAB24112 


Mouse phospholipid scramblase 
MPLS protein sequence - Mus sp, 
318 aa. [CN1259574-A, 12-JUL- 
2000] 


24..2S2 
60..318 


152/263 (57%) 
187/263 (70%) 


5e-84 
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AAY09309 


Human phospholipid scramblase - 
Homo sapiens, 318 aa. 
[W09919352-A2, 22-APR-1999] 


24..282 
60..318 


152/263 (57%) 
187/263 (70%) 


5e-84 


AAY29323 


Human PL scramblase - Homo 
sapiens, 318 aa. [W09936536-A2, 
22-JUL-1999] 


24..282 
60..318 


152/263 (57%) 
187/263 (70%) 


5e-84 


In a BLAST search of public sequence databases, the NOV47a protein was found to 
have homology to the proteins shown in the BLASTP data in Table 47D. 


Table 47D. Public BLASTP Results for NOV47a 


Protein 
Accession 
Number 


Protein/Organism/Length 


NOV47a 
Residues/ 

Match 
Residues 


Identities/ 
Similarities for 
the Matched 
Portion 


Expect 
Value 


Q9JJ00 


Phospholipid scramblase 1 (PL 
scramblase 1) (Transplantability 
associated protein 1) (TRA1) (NOR1) 
- Mus musculus (Mouse), 328 aa. 


20..283 
66..328 


150/267 (56%) 
191/267 (71%) 


4e-84 


Q99M50 


PHOSPHOLIPID SCRAMBLASE 1 
- Mus musculus (Mouse), 327 aa. 


20..282 
66..327 


150/266 (56%) 
191/266 (71%) 


6e-84 


015162 


Phospholipid scramblase 1 (PL 
scramblase 1) (Erythrocyte 
phospholipid scramblase) (Ca2+ 
dependent phospholipid scramblase 1) 
(MmTRAlb) - Homo sapiens 
(Human), 318 aa. 


24..2S2 
60..318 


152/263 (57%) 
187/263(70%) 


2e-83 


P58195 


Phospholipid scramblase 1 (PL 
scramblase 1) (Ca2+ dependent 
phospholipid scramblase 1) - Rattus 
norvegicus (Rat), 335 aa. 


28..282 
84..335 


145/256 (56%) 
1 83/256 (70%) 


3e-81 


Q9NRY7 


Phospholipid scramblase 2 (PL 
scramblase 2) (Ca2+ dependent 
phospholipid scramblase 2) - Homo 
sapiens (Human), 224 aa. 


55..270 
6..22 1 


135/217(62%) 
164/217(75%) 


le-75 



PFam analysis predicts that the NOV47a protein contains the domains shown in the 
Table 47E. 
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Table 47E. Domain Analysis of NOV47a 



Pfam Domain 



NOV47a Match Region 



Identities/ 
Similarities 
for the Matched Region 



Expect Value 



No Significant Matches Found 



Example 48. 

The NOV48 clone was analyzed, and the nucleotide and predicted polypeptide 
sequences are shown in Table 48A. 



Table 48A. NOV48 Sequence Analysis 



SEQ ID NO: 149 1 957 bp 



NOV48a, 

CG57851-01 DNA Sequence 



CCCCTGCTGGTGCCCAAGACCACCGTGGAAGGAA TGGCTAAAGAGGAGACAAGTGAGT 
TAGAATGGGGCTTGTTACCCCCAGAAGAATTTTCCCAAGTGAATGGAATCATTCTTCA 
AAAGAAAATGTGCGATTTCTGGGATAAGATCTGGAACTTCCAAGCCAAGCCTGATGAC 
SCTTCTTACCCCAAAGCAGGTACCACTTGGACACAGGAAATTGTAGATC 
TGATACAAAATGATGGCGATATTGAGAAAAGCAGGCGCGCTTCCATTCAACTTCAACA 
ICTTTCCTGGAGTGGATAAGAATGACACACGCCAGGAAAATTTTTGCAGGGATTGAC 
CAGGCTAACACAATGCCTTCCCCAAGGACCCTGAAAACTCATCTTCCTGTACAACTAC 
TGCCTCCATCCTTCTGGGAGGAAAACTGTAAGATAATCTATGTGGCAAGAAATGCCAA 
GGATAACCTGGTGTCCTACTACCATTTTCAAAGGATGAGCAAAGCACTCCCTGACGTT 
TTGACAGTGGGAGAATACATTATGTGTGGGGAAGTGTTGTGGGGAATATGGGAAGAGA 
TTCGGACTTGGCAACTGCATAGGTTGTTCTGCTGGTTCTTTGATCATGCTTCTGAGAA 
TCCTAGAAAGTTCAAAAGGATAATGGAATTTATGGGGAATAAACTAGATGAAGATCCT 
GTCAAAAGAATTGTTCAGCACACATCTTTTGAAAGTAAGAAGAAAAACCAGATGACCA 
ACTATGTAATGATAACCTGTGACATCATGGACCACTCCATCTCCCCATTTATGAGGAA 
AGGGACCGTTGGAGAGTGGAAGGATTACTTCTCAGCAGCACAGAATAAGAGATTTGAT 
GAAGACAGGAAAATGGCTGACTCTTCTCTGACCTTCCACACGGAGCTCTA AAGAGAGA 
GAGACAAAGTCTATACTACACAGGGGCAC 



ORF Start: ATG at 34 ORF Stop: TAA at 919 



SEQ ID NO: 150 



295 aa MW at 34853.7kD 



NOV48a, 
CG57851-01 Protein Sequence 



MAKEETSELEWGLLPPEEFSQVNGIILQKKMCDFWDKIWNFQAKPDDLLIASYPKAGT 
TWTQEIVDLIQHDGDIEKSRRASIQLQHPFLEWIRMTHARKIFAGIDQANTMPSPRTL 
KTHLPVQLLPPSFWEENCKIIYVARNAKDNLVSYYHFQRMSKALPDVLTVGEYIMCGE 
VLWGIWEEIRTWQLHRLFCWFFDHASENPRKFKRIMEFMGNKLDEDPVKRIVQHTSFE 
SKKKNQMTNYVMITCDIMDHSISPFMRKGTVGEWKDYFSAAQNKRFDEDRKMADSSLT 
FHTEL 



Further analysis of the NOV48a protein yielded the following properties shown in 
Table 48B. 



Table 48B. Protein Sequence Properties NOV48a 


PSort 
analysis: 


0.6400 probability located in microbody (peroxisome); 0.4500 probability 
located in cytoplasm; 0.1000 probability located in mitochondrial matrix 
space; 0.1000 probability located in lysosome (lumen) 


SignalP 
analysis: 


No Known Signal Sequence Predicted 
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A search of the NOV48a protein against the Geneseq database, a proprietary 
database that contains sequences published in patents and patent publication, yielded 
several homologous proteins shown in Table 48C. 



Table 48C. Geneseq Results for NOV48a 


Geneseq 
Identifier 


Protein/Organism/Length [Patent 
#, Date] 


NOV48a 
Residues/ 

Match 
Residues 


Identities/ 
Similarities for 
the Matched 
Region 


Expect 
Value 


AAE 12209 


Human ST drug-metabolising 
protein 2 encoded by DNA 
transcript 2 - Homo sapiens, 304 aa. 
[WO200172977-A2, 04-OCT-2001] 


16..295 
15. .304 


137/293 (46%) 
200/293 (67%) 


9e-74 


AAE12210 


Human ST drug-metabolising 
protein 3 encoded by cDNA - Homo 
sapiens, 304 aa. [WO2001 72977- 
A2, 04-OCT-2001] 


16..295 
15. .304 


129/293 (44%) 
190/293 (64%) 


le-67 


AAE 12208 


Human ST drug-metabolising 
protein 1 encoded by DNA 
transcript 1 - Homo sapiens, 304 aa. 
[WO200172977-A2, 04-OCT-2001] 


16..295 
15..304 


128/293 (43%) 
190/293 (64%) 


6e-67 


AAE05178 


Human drug metabolising enzyme 
(DME-9) protein - Homo sapiens, 
304 aa. [WO200151638-A2, 19- 
JUL-2001] 


16..295 
15..304 


128/293 (43%) 
189/293 (63%) 


le-66 


AAY67294 


Human STP2 (phenol 
sulphotransferase 2) amino acid 
sequence - Homo sapiens, 295 aa. 
[WO9964630-A1, 16-DEC-1999] 


15. .295 
10..295 


133/292 (45%) 
186/292(63%) 


5e-66 



In a BLAST search of public sequence databases, the NOV48a protein was found to 
have homology to the proteins shown in the BLASTP data in Table 48D. 



Table 48D. Public BLASTP Results for NOV48a 


Protein 
Accession 
Number 


Protein/Organism/Length 


NOV48a 
Residues/ 
Match 
Residues 


Identities/ 
Similarities for 
the Matched 
Portion 


Expect 
Value 


Q90WR6 


SULFOTRANSFERASE 1C - 
Gallus gallus (Chicken), 307 aa. 


3..295 
5. .3 07 


170/304(55%) 
218/304(70%) 


3e-94 
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P50237 


N-hydroxyarylamine 
sulfotransferase (EC 2.8.2.-) 
(HAST-I) - Rattus norvegicus 
(Rat), 304 aa. 


1..295 
1..304 


172/308 (55%) 
222/308 (71%) 


3e-92 


070262 


PHENOL SULFOTRANSFERASE 
- Mus musculus (Mouse), 304 aa. 


1 8..295 
19..304 


164/289(56%) 
215/289 (73%) 


le-91 


075897 


Sulfotransferase 1C2 (EC 2.8.2.-) 
(SULT1C) (SULT1C#2) - Homo 
sapiens (Human), 302 aa. 


22..292 
22..299 


160/282 (56%) 
203/282 (71%) 


le-87 


000338 


Sulfotransferase 1C1 (EC 2.8.2.-) 
(SULT1C#1)(ST1C2) 
(humSULTC2) - Homo sapiens 
(Human), 296 aa. 


18..295 
12..296 


149/289 (51%) 
201/289 (68%) 


le-80 



PFam analysis predicts that the NOV48a protein contains the domains shown in the 
Table 48E. 



Table 48E. Domain Analysis of NOV48a 


Pfam Domain 


NOV48a Match 
Region 


Identities/ 
Similarities 
for the Matched 
Region 


Expect 
Value 


Sulfotransfer: domain 1 of 
1 


23.-285 


116/298 (39%) 
207/298 (69%) 


6.2e-82 



Example 49. 

The NOV49 clone was analyzed, and the nucleotide and predicted polypeptide 
sequences are shown in Table 49A. 



Table 49A. NOV49 Sequence Analysis 




SEQIDNO:151 1 1934 bp 


NOV49a, 

CG59377-01 DNA Sequence 


CTTGATTACGGAGACTGAACCTTCATAGGGTGCGCACTTACCAAGGACAGGAAGGTTT 


CTCTGTTTGAAGGGCTTTAAACTTATAACAAAGAAAATAAAAATGACGACTTCGTCTA 


TCAGACGGCAGATGAAAAACATCGTGAACAArTACTCAGAGGCAGAAATCAAAGTCCG 
GACCTGACCTACAACGTGGTGGCCTTCTCGGAGATCATGAGCATGGTGTGGAAGCGGC 




CCTCATCAAGACAGGCTCCGAACGTGTGGCCCAGCAGTGCCGGGAGAACATCTTCGCC 
ATCCAGACCCTGAAGGACTTCCAGTACATTGACCGAGATGGCAAGGACCAGGGCATCA 
ATGTGCGTGAGAAGTCAAAGCAACTGGTGGCTCTCCTCAAGGACGAGGAACGGTTGAA 
GGCTGAGAGGGCCCAGGCTCTCAAAACCAAAGAGCGCATGGCCCAGGTTGCCACTGGC 
ATGGGCAGCAACCAGATCACCTTTGGGCGAGGCTCCAGCCAGCCCAACCTCTCCACCA 
GCCACTCGGAGCAGGAGTATGGCAAGGCCGGGGGCTCCCCGGCCTCCTACCATGGCTC 
CACCTCCCCGCGAGTGTCCTCCGAGCTGGAGCAAGCCCGGCCCCAGACTAGTGGAGAA 




AACGCCTCAGGCGGGGTGATGACCTCAGATTACAGATGGCCCTGGAAGAAAGCCGAAG 
GGACACAGTTAAAATTCCAAAAAAGAAAGAGCAGACTACGCTGTTGGATTTAATGGAT 
GCTCTCCCCAGCTCGGGCCCCGCGGCCCAGAAAGCAGAGCCCTGGGGCCCGTCAGCCT 
CCACTAACCAGACCAACCCCTGGGGCGGGCCAGCGGCTCCTGCGAGTACTTCAGACCC 
CTGGCCATCGTTT A I J 1 v T 
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GGAGCCACCGCACAATCTGTCCCCAAGAACTCGGACCCCTGGGCAGCTTCACAGCAGC 
CTGCCTCCAGTGCTGGGAAAAGAGCTTCTGACGCGTGGGGCGCAGTCTCCACCACCAA 
GCCCGTGTCTGTCTCTGGGTCCTTTGAGCTCTTCAGTAATCTGAATGGTACAATTAAA 
GATGACTTTTCTGAATTTGACAACCTTCGGACTTCAAAAAAAACAGCCGAATCTGTGA 
CCTCTCTGCCATCCCAAAACAATGGAACTACCAGCCCTGACCCCTTTGAGTCTCAACC 
CCTGACTGTCGCCTCAAGCAAGCCCAGCAGTGCCCGGAAAACACCTGAGTCCTTCCTG 
GGCCCCAACGCGGCCCTGGTGAACCTGGACTCACTGGTGACCAGGCCTGCCCCACCAG 
CCCAGTCCCTCAACCCTTTCCTGGCACCAGGTGCTCCCGCCACCTCGGCCCCTGTTAA 
CCCTTTCCAGGTGAACCAGCCCCAGCCGCTGACACTGAACCAGCTTCGGGGGAGCCCA 
GTCCTGGGGACCAGCACATCCTTTGGGCCTGGCCCAGGAGTGGAGTCCATGGCTGTGG 
CCTCGATGACCTCCGCGGCCCCACAGCCAGCTCTGGGGGCCACTGGTTCCTCTCTGAC 
ACCACTGGGCCCTGCAATGATGAACATGGTGGGCAGTGTGGGTATACCCCCATCAGCA 
GCCCAGGCCACTGGCACAACCAACCCTTTCCTTCTCTAGTGCCTGGGCCTGGGACCCA 
CCCAGAGCACCTGTGCTGGAGGATGCCGAGCAGGGACTCTCGTCTGTGGGACGGGATC 


CAAGAGTTTGGGGATTAGGG 




ORF Start: ATG at 101 


ORF Stop: TAG at 1835 




SEQ ID NO: 152 


578 aa |MW at 61651. 2kD 


NOV49a, 

CG59377-01 Protein Sequence 


MTTSSIRRQMKNIVNNYSEAEIKVREATSNDPWGPSSSIJlTEIADLT'raVVAFSEIMS 
MVWKR1NDHGKNWRHVY KALTLLDYL I KTGSERVAQQCRENI FAI QTLKDFQY I DRDG 
KDQGINVREKSKQLVALLKDEERLKAERAQALKTKERMAQVATGMGSNQITFGRGSSQ 
PNLSTSHSEQEYGKAGGSPASYHGSTSPRVSSELEQARPQTSGEEELQLQLALAMSRE 
VAEQEERLRRGDDLRLQMALEESRRDTVKIPKKKEQTTLLDLMDALPSSGPAAQKAEP 
WGPSASTNQTNPWGGPAAPASTSDPWPSFGTKPAASIDPWGVPTGATAQSVPKNSDPW 
AASQQPASSAGKRASDAWGAVSTTKPVSVSGSFELFSNUJGTIKDDFSEFDNLRTSKK 
TAESVTSLPSQNNGTTSPDPFESQPLTVASSKPSSARKTPESFLGPNAALVNLDSLVT 
RPAPPAQSIjNPFLAPGAPATSAPVNPFQVNQPQPLTLNQLRGSPVLGTSTSFGPGPGV 
ESMAVASMTSAAPQPALGATGSSLTPLGPAMMNMVGSVGIPPSAAQATGTTNPFLL 



Further analysis of the NOV49a protein yielded the following properties shown in 
Table 49B. 



Table 49B. Protein Sequence Properties NOV49a 


PSort 
analysis: 


0.4936 probability located in mitochondrial matrix space; 0.3000 probability 
located in nucleus; 0.2087 probability located in mitochondrial inner 
membrane; 0.2087 probability located in mitochondrial intermembrane space 


SignalP 
analysis: 


No Known Signal Sequence Predicted 



A search of the NOV49a protein against the Geneseq database, a proprietary 
database that contains sequences published in patents and patent publication, yielded 
several homologous proteins shown in Table 49C. 



Table 49C. Geneseq Results for NOV49a 


Geneseq 
Identifier 


Protein/Organism/Length [Patent 
#, Date] 


NOV49a 
Residues/ 

Match 
Residues 


Identities/ 
Similarities for 
the Matched 
Region 


Expect 
Value 


AAB93525 


Human protein sequence SEQ ID 
NO: 12872 - Homo sapiens, 584 aa. 
[EP1074617-A2, 07-FEB-2001] 


1..578 
1..584 


578/584 (98%) 
578/584 (98%) 


0.0 



243 



AAB95663 


Human protein sequence SEQ ID 
NO: 1 8438 - Homo sapiens, 370 aa. 
[EP1074617-A2, 07-FEB-2001] 


40..403 
1..370 


364/370 (98%) 
364/370 (98%) 


0.0 


AAB93011 


Human protein sequence SEQ ID 
NO: 1 1762 - Homo sapiens, 484 aa. 
[EP1074617-A2, 07-FEB-2001] 


1..407 
1..470 


385/470 (81%) 
390/470 (82%) 


0.0 


AAB42049 


Human ORFXORF1 813 
polypeptide sequence SEQ ID 
NO:3626 - Homo sapiens, 551 aa. 
[WO200058473-A2, 05-OCT-2000] 


1..578 
1..551 


306/636 (48%) 
370/636(58%) 


e-141 


AAB95100 


Human protein sequence SEQ ID 
NO: 1 7064 - Homo sapiens, 576 aa. 
[EP1074617-A2, 07-FEB-2001] 


1..578 
1..576 


298/636 (46%) 
371/636 (57%) 


e-137 


In a BLAST search of public sequence databases, the NOV49a protein was found to 
have homology to the proteins shown in the BLASTP data in Table 49D. 


Table 49D. Public BLASTP Results for NOV49a 


Protein 
Accession 
Number 


Protein/Organism/Length 


NOV49a 
Residues/ 

Match 
Residues 


Identities/ 
Similarities for 
the Matched 
Portion 


Expect 
Value 


095207 


EPSIN 2A - Homo sapiens 
(Human), 584 aa. 


L.578 
1..584 


576/584 (98%) 
576/584 (98%) 


0.0 


Q9UPT7 


KIAA1065 PROTEIN - Homo 
sapiens (Human), 641 aa. 


1..578 
L.641 


557/641 (86%) 
562/641 (86%) 


0.0 


095208 


EPSIN 2B - Homo sapiens 
(Human), 642 aa. 


1 ..578 
1 ..642 


556/642 (86%) 
560/642 (86%) 


0.0 


Q9Z1Z3 


EH DOMAIN BINDING 
PROTEIN EPSIN 2 - Rattus 
norvegicus (Rat), 583 aa. 


1..578 
1..583 


512/590 (86%) 
526/590 (88%) 


0.0 


070447 


INTERSECTIN-EH BINDING 
PROTEIN IBP2 - Mus musculus 
(Mouse), 509 aa (fragment). 


76..578 
2..509 


438/515 (85%) 
459/515 (89%) 


0.0 



PFam analysis predicts that the NOV49a protein contains the domains shown in the 
Table 49E. 
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Table 49E. Domain Analysis of NOV49a 


Pfam Domain 


NOV49a Match Region 


Identities/ 
Similarities 
for the Matched Region 


Expect Value 


ENTH: domain 1 of 1 


17..140 


70/131 (53%) 
117/131 (89%) 


7.9e-68 


VHS: domain 1 of 1 


14..158 


33/160 (21%) 
90/160 (56%) 


3.3 


UIM: domain 1 of 2 


217..234 


11/18(61%) 
16/18(89%) 


0.043 


UIM: domain 2 of 2 


242..259 


5/18(28%) 
12/18(67%) 


80 



Example 50. 

The NOV50 clone was analyzed, and the nucleotide and predicted polypeptide 
sequences are shown in Table 50A. 



Table 50A. NOV50 Sequence Analysis 



SEQIDNO:153 2580 bp 



NOV50a, 

CG59258-01 DNA Sequence 



ATGCTGCTGGCCCCCTTTTATTGCTGGGTGTGTGCCCATGCTGCTGGCCCCCTTTTAT 
TGCTGGGCAGTGACAAACTGTACCATCAGTGGCTCTCCACTGTCCGGAAAGGAAGTGG 
AGCAATTCTGAATACTGTAAAGACCAAAGCAAATCCGGCCATGAAGACTGTCTACAAG 
TTCGACATTGCCGAGAATGGCTGCGCCCCCACCCCAGAAGAGCAGCTGCCAAAGACTG 
CACCGTCCCCACTGGTGGAGGCCAAGGACCCCAAGCTCCGAGAAGACCGGCGGCCAAT 
CACAGTCCACTTTGGACAGGACCAGTCTGAGATGTCTTTCAGCTCAGCACTCACTCAC 
GGCAAAGAGAGTGCCCGGACCCAGCCGGAGAGAGTCGTTGACAGGACTGGCGAGCCCC 
TGAATCCTGAGCGCGCTCTCTCCGGAGATCATCTCTGGCCTGTTACGCACTTGCTCTG 
GGCAACCCTGGGCAAGTCCTTGCTTGCCCTCATCTGTGAAATGGGTAGCAGCCCTCGT 
TCCCTGCAGAGGAGCCTTGCGCTGCTGGGGACACCCCAGCTTATTTGGGAAACTGCAA 
CCACCATGGCCGATGGCCCCACCACGCCCTGTCTAGGAAGCAGAGGCCTCCCCAGCAG 
CGTGTCCACTGTGCCCCTGGCCCTGCGTGAAGTGCCATCAGATGCCCCGCATCCCTGC 
AGCAGGGCCCTCGTGACTGGCCTCACAGATGAGGACACAGAGGCCCAGGGAAGTCACT 
TGCTTGCCAAAGTCACTCAGCAAACCATGTCTGTCTGGCTCTCAGAAAATGGGAAAGA 
AGCCTGGGCATTCAGCCATGAGGGAGCCACGGCTGTAGCCAGTGGAATGACGTACCCT 
CAGTCCAGGATGTGCACCCGGGCAGCCAGGTCCCACAGCCACTACTTTCTTGCCCCCA 
CCACTGCTCCCACAGTTCCCAGAACTCAGTCTCCAGATCTGGGCTCCAGGATGCAGAG 
GCTGTCCTCAGGCCTGGTAAAGCCCTTGCGACACTATGCGGTCTTCCTCTCCGAAGAC 
TCCTCTGATGATGAATGCCAGCGGGAAGAGGGCCCGAGCTCTGGCTTCACCGAGAGCT 
TTTTCTTCTCCGCTCCCTTTGAATGGCCGCAGCCGTATCGGACACTCAGGGAGTCAGA 
CAGCGCGGAAGGCGACGAGGCAGAGAGTCCAGAGCAGCAAGTGCGGAAGTCCACAGGC 
CCTGTCCCAGCTCCCCCTGACCGGGCTGCCAGCATCGACCTTCTGGAAGACGTCTTCA 
GCAACCTGGACATGGAGGCCGCACTGCAGCCACTGGGCCAGGCCAAGAGCTTAGAGGA 
CCTTCGTGCCCCCAAAGACCTGAGGGAGCAGCCAGGGACCTTTGACTATCAGAGGCTG 
GATCTGGGCGGGAGTGAGAGGAGCCGCGGGGTGACAGTGGCCTTGAAGCTTACCCACC 
CGTACAACAAGCTCTGGAGCCTGGGCCAGGACGACATGGCCATCCCCAGCAAGCCCCC 
AGCTGCCTCCCCTGAGAAGCCCTCAGCCCTGCTCGGAAACTCCCTGGCCCTGCCTCGA 
AGGCCCCAGAACCGGGACAGCATCCTGAACCCCAGTGACAAGGAGGAGGTGCCCACCC 
CTACTCTGGGCAGCATCACCATCCCCCGGCCCCAAGGCAGGAAGACCCCAGAGCTGGG 
CATCGTGCCTCCACCGCCCATTCCCCGCCCGGCCAAGCTCCAGGCTGCCGGCGCCGCA 
'CTTGGTGACGTCTCAGAGCGGCTGCAGACGGATCGGGACAGGCGAGCTGCCCTGAGTC 

CCCTGGCCCCGGGGCTGCAGGCACGAGCAGTGACGCCCTGCTCGCCCTCCTGGACCCG 
CTCAGCACAGCCTGGTCAGGCAGCACCCTCCCGTCACGCCCCGCCACCCCGAATGTAG 
CCACCCCATTCACCCCCCAATTCAGCTTCCCCCCTGCAGGGACACCCACCCCATTCCC 
ACAGCCACCACTCAACCCCTTTGTCCCATCCATGCCAGCAGCCCCACCCACCCTGCCC 
CTGGTCTCCACACCAGCCGGGCCTTTCGGGGCCCCTCCAGCTTCCCTGGGGCCGGCTT 
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TGCCCCTGGCCCGCTCAAGTGCCAGGGCTGCTGAGACCAAGCAGGGGCTGGCCCTGAG 
GCCTGGAGACCCCCCGCTTCTGCCTCCCAGGCCCCCTCAAGGCCTGGAGCCAACACTG 
CAGCCCTCTGCTCCTCAACAGGCCAGAGACCCCTTTGAGGATTTGTTACAGAAAACCA 
AGCAAGACGTGAGCCCGAGTCCGGCCCTGGCCCCGGCCCCAGACTCGGTGGAGCAGCT 
CAGGAAGCAGTGGGAGACCTTCGAGTGA 




ORF Start: ATG at 1 


ORF Stop: TGA at 2578 




SEQIDNO: 154 


859 aa [MW at 91746.7kD 


NOV50a, 

CG59258-01 Protein Sequence 


MIJAPFYCWCAHAAGPLLLIX3SDKLYHQWLSTVRKGSGAILNTVKTKANPAMKTVYK 
FDIAENGCAPTPEEQLPKTAPSPLVEAKDPKLREDRRPITVHFGQDQSEMSFSSALTH 
GKESARTQPERWDRTGEPLNPERALSGDHLWPVTHLLWATLGKSLLALICEMGSSPR 
SLQRSLALLGTPQLIWETATTMADGPTTPCLGSRGLPSSVSTVPLALREVPSDAPHPC 
SRALVTGLTDEDTEAQGSHLLAKVTQQTMSWJLSENGKEAWAFSHEGATAVASGMTYP 
QSRMCTRAARSHSHYFLAPTTAPTVPRTQSPDLGSRMQRLSSGLVKPLRHYAVFLSED 
SSDDECQREEGPSSGFTESFFFSAPFEWPQPYRTLRESDSAEGDEAESPEQQVRKSTG 
PVPAPPDRAASIDLLEDVFSNLDMEAALQPLGQAKSLEDLRAPKDLREQPGTFDYQRL 
DLGGSERSRGVTVALKLTHPYNKLWSLGQDDMAIPSKPPAASPEKPSALLGNSLALPR 
RPQNRDSILNPSDKEEVPTPTLGSITIPRPQGRKTPELGIVPPPPIPRPAKLQAAGAA 
LGDVSERLQTDRDRRAALSPGLLPGWPQGPTELLQPLSPGPGAAGTSSDALLALLDP 
LSTAWSGSTLPSRPATPNVATPFTPQFSFPPAGTPTPFPQPPLNPFVPSMPAAPPTLP 
LVSTPAGPFGAPPASLGPAFASGLLLSSAGFCAPHRSQPNLSALSMPNLFGQMPMGTH 
TSPLQPLGPPAVAPSRIRTLPLARSSARAAETKQGLALRPGDPPLLPPRPPQGLEPTL 
QPSAPQQARDPFEDLLQKTKQDVSPSPALAPAPDSVEQLRKQWETFE 



Further analysis of the NOV50a protein yielded the following properties shown in 
Q Table 50B. 



Table 50B. Protein Sequence Properties NOV50a 


PSort 
analysis: 


0.4500 probability located in cytoplasm; 0.3000 probability located in 
microbody (peroxisome); 0.1940 probability located in Iysosome (lumen); 
0.1000 probability located in mitochondrial matrix space 


SignalP 
analysis: 


Likely cleavage site between residues 15 and 16 



A search of the NOV50a protein against the Geneseq database, a proprietary 
database that contains sequences published in patents and patent publication, yielded 
5 several homologous proteins shown in Table 50C. 



Table 50C. Geneseq Results for NOV50a 


Geneseq 
Identifier 


Protein/Organism/Length 
[Patent #, Date] 


NOV50a 
Residues/ 

Match 
Residues 


Identities/ 
Similarities for 
the Matched 
Region 


Expect 
Value 


AAM41501 


Human polypeptide SEQ ID NO 
6432 - Homo sapiens, 545 aa. 
[WO200153312-A1, 26-JUL- 
2001] 


22.. 103 
401.. 482 


82/82(100%) 
82/82 (100%) 


2e-42 


AAM39715 


Human polypeptide SEQ ID NO 
2860 - Homo sapiens, 559 aa. 
[WO200153312-A1, 26-JUL- 
2001] 


22..103 
396..496 


82/101 (81%) 
82/101 (81%) 


6e-39 
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AAWiloDj 


Mycobacterium tuberculosis 55 
kDa protein - Mycobacterium 
tuberculosis, 572 aa. 
[W09741252-A2, 06-NOV-1997] 


4y<S..O'r J 

71. .389 


yo/ijo (ZO/o) 
125/358 (34%) 




AAW31852 


Mycobacterium tuberculosis 74 
kDa protein - Mycobacterium 
tuberculosis, 763 aa. 
[W09741252-A2, 06-NOV-1997] 


498..845 
262..580 


96/358 (26%) 
125/358 (34%) 


8e-12 


AAB50363 


Human SRCAP - Homo sapiens, 
2972 aa. [WO200073467-A1, 07- 
DEC-2000] 


501. .845 
1235..1575 


112/369 (30%) 
141/369 (37%) 


le-11 


In a BLAST search of public sequence databases, the NOV50a protein was found to 
have homology to the proteins shown in the BLASTP data in Table 50D. 


Table 50D. Public BLASTP Results for NOVSOa 


Protein 
Accession 
Number 


Protein/Organism/Length 


NOV50a 
Residues/ 

Match 
Residues 


Identities/ 
Similarities for 
the Matched 
Portion 


Expect 
Value 


Q9HCG4 


KIAA1608 PROTEIN - Homo 
sapiens (Human), 603 aa 
(fragment). 


309..859 
62..603 


501/555 (90%) 
510/555 (91%) 


0.0 


Q9H796 


CDNA: FLJ21 129 FIS, CLONE 
CAS06266 - Homo sapiens 
(Human), 559 aa. 


22.. 103 
396..496 


81/101 (80%) 
81/101 (80%) 


2e-37 


AAK44515 


HYPOTHETICAL 58.5 KDA 
PROTEIN - Mycobacterium 
tuberculosis CDC 1551, 598 aa. 


499..845 
299..562 


104/354(29%) 
121/354(33%) 


8e-14 


Q9SN46 


EXTENSIN-LIKE PROTEIN - 
Arabidopsis thaliana (Mouse-ear 
cress), 839 aa. 


604..848 
407..626 


73/249 (29%) 
100/249 (39%) 


3e-12 


Q41805 


EXTENSIN-LIKE PROTEIN 
PRECURSOR - Zea mays 
(Maize), 1188 aa. 


492..848 
415..749 


88/361 (24%) 
124/361 (33%) 


5e-12 



PFam analysis predicts that the NOV50a protein contains the domains shown in the 
Table 50E. 
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Table 50E. Domain Analysis of NOV50a 




Identities/ 




Pfam Domain 


NOV50a Match Region I Similarities 


Expect Value 




for the Matched Region 




No Significant Matches Found 



Example 51. 

The NOV51 clone was analyzed, and the nucleotide and predicted polypeptide 
sequences are shown in Table 51 A. 



Table 51A. NOV51 Sequence Analysis 




SEQIDNO:155 1 1394 bp 


NOV51a, 

HjDy4yz-ui ujNA sequence 


GTGGCCT6CTCCTGCAGCAATCCCAGGACCCCCTGCTCATGGGGCTGTTTCCTACTAA 


CCCCAAAGAGAAGACCCAGGAGGAACCCCCTGGCCAGAGCAGGGCCCCTGTGTTGACC 
GTGGTGTCCAAGTTCAAGGCCTCACTGGAGCAGCTTCTGCAGGTCCTACACAGCACCA 
CGCCCCACTACATTCGCTGCATCAAGCCCAACAGCCAGGGCCAGGCGCAGACCTTTCT 
CCAAGAGGAGGTCCTGAGCCAGCTGGAGGCCTGTGGCCTCGTGGAGACCATCCATATC 
AGTGCTGCTGGCTTCCCCATCCGGGTCTCTCACCGAAACTTTGTAGAACGATACAAGT 
TACTAAGAAGGCTTCATCCTTGCACATCCTCTGGCCCCGACAGCCCATATCCTGCCAA 
AGGGCTCCCTGAATGGTGTCCACACAGCGAGGAAGCCACGCTTGAACCTCTCATCCAG 
GACATTCTCCACACTCTGCCGGTCCTAACTCAGGCAGCAGCCATAACTGGTGACTCGG 
CTGAGGCCATGCCAGCCCCCATGCACTGTGGCAGGACCAAGGTGTTCATGACTGACTC 
TATGCTGGAGCTTCTGGAATGTGGGCGTGCCCGGGTGCTGGAGCAGTGTGCCCGCTGC 
ATCCAGGGTGGCTGGAGGCGACACCGGCACCGAGAGCAGGAGCGGCAGTGGCGGGCCG 
TCATGCTCATCCAGGCAGCCATTCGTTCCTGGTTAACTCGGAAACACATCCAGAGGCT 
GCATGCAGCTGCCACAGTCATCAAGCGTGCATGGCAGAAGTGGAGAATCAGAATGGCC 
TGCCTTGCTGCTAAAGAGCTGGATGGTGTGGAAGAAAAACACTTCTCTCAAGCTCCCT 
GTTCCCTGAGCACCTCGCCGCTGCAGACCAGGCTCCTGGAGGCAATAATCCGCTTCTG 
GCCCCTGGGACTGGTCCTGGCCAATACGGCTATGGGTGTAGGCAGCTTTCAGAGGAAA 
TTAGTGGTCTGGGCTTGCCTCCAGCTCCCCAGGGGCAGCCCCAGTAGCTACACTGTCC 
AGACAGCACAAGACCAGGCTGGTGTCACGTCCATCCGAGCGCTGCCTCAGGGATCGAT 
AAAGTTTCACTGCAGAAAGTCTCCACTGCGGTATGCTGACATCTGCCCTGAACCTTCA 
CCCTACAGCATTGCAGGCTTTAATCAGATTCTGCTGGAAAGACACAGGCTGATCCACG 
TGACCTCTTCTGCCTTCACTGGGCTGGGGTGATCCTTGGTGCCTTTGTTTCCACAAGG 
CCTTTTCCTGCCCCCTGCCTTGCCAAAGACATTTAATCAGCACACAGCTGCCAGACTA 


TTCCCACAGTGCTCCAAATGCACATGAACAACAGTGACGGCTCCAGCCTTCGACCCAG 


AG 




ORF Start: ATG at 39 


ORF Stop: TGA at 1248 




SEQIDNO: 156 


403 aa )mW at 45142.8kD 


NOV51a, 

CG59492-01 Protein Sequence 


MGLFPTNPKEKTQEEPPGQSRAPVLTWSKFKASLEQLLQVLHSTTPHYI RC I KPNSQ 
GQAQTFLQEEVLSQLEACGLVET I H I SAAGFP I RVSHRNFVERYKLLRRIjHP CTSSGP 
DSPYPAKGLPEWCPHSEEATLEPLIQDILHTLPVLTQAAAITGDSAEAMPAPMHCGRT 
KVFMTDSMLELLECGRARVLEQCARCIQGGWRRHRHREQERQWRAVMLIQAAIRSWLT 
RKH I QRLHAAATV I KRAWQKWR I RMACLAAKELDGVEEKHFSQAPCSLSTS P LQTRLL 
EAIIRFWPLGLVLANTAMGVGSFQRKLVVWACLQLPRGSPSSYTVQTAQDQAGVTSIR 
ALPQGSIKFHCRKSPLRYADICPEPSPYSIAGFNQILLERHRLIHVTSSAFTGLG 



Further analysis of the NOV5 1 a protein yielded the following properties shown in 
Table 5 IB. 



^ — - — — — — — ' ' — • — — - — — -1 

Table 51B. Protein Sequence Properties NOV51a 


PSort 
analysis: 


0.3000 probability located in nucleus; 0.2029 probability located in lysosome 
(lumen); 0.1000 probability located in mitochondrial matrix space; 0.0320 
probability located in microbody (peroxisome) 
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SignalP 
analysis: 



No Known Signal Sequence Predicted 



A search of the NOY5 1 a protein against the Geneseq database, a proprietary 
database that contains sequences published in patents and patent publication, yielded 
several homologous proteins shown in Table 51C. 



Table 51 C. Geneseq Results for NOV51a 


Geneseq 
Identifier 


Protein/Organism/Length 
[Patent #, Date] 


NOV51a 
Residues/ 

Match 
Residues 


Identities/ 
Similarities for 
the Matched 
Region 


Expect 
Value 


AAY94290 


Human myosin heavy chain 
homologue - Homo sapiens, 612 
aa. [WO200026372-A1, 11-MAY- 
2000] 


1..403 
210..612 


401/403 (99%) 
401/403 (99%) 


0.0 


AAU23676 


Novel human enzyme polypeptide 
#762 - Homo sapiens, 387 aa. 
[WO200155301-A2, 02-AUG- 
2001] 


17..403 
1..387 


384/387 (99%) 
384/387 (99%) 


0.0 


ABB 10243 


Human cDNA SEQ ID NO: 551 - 
Homo sapiens, 570 aa. 
[WO200154474-A2, 02-AUG- 
2001] 


1..365 
206..570 


365/365 (100%) 
365/365 (100%) 


0.0 


AAU23123 


Novel human enzyme polypeptide 
#209 - Homo sapiens, 567 aa. 
[WO2001 55301 -A2, 02-AUG- 
2001] 


1..365 
203..567 


364/365 (99%) 
364/365 (99%) 


0.0 


AAM23563 


Human EST encoded protein SEQ 
ID NO: 1088 - Homo sapiens, 477 
aa. [WO200154477-A2, 02-AUG- 
2001] 


1.-189 
288..476 


188/189 (99%) 
188/189(99%) 


e-108 



In a BLAST search of public sequence databases, the NOV51a protein was found to 
have homology to the proteins shown in the BLASTP data in Table 5 ID. 
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Table 51D. Public BLASTP Results for NOV51a 


Protein 
Accession 
Number 


Protein/Organism/Length 


NOV51a 
Residues/ 

Match 
Residues 


Identities/ 
Similarities for 
the Matched 
Portion 


Expect 
Value 


Q96H55 


HYPOTHETICAL 86.7 KDA 
PROTEIN - Homo sapiens (Human), 
770 aa. 


72..403 
439..770 


330/332 (99%) 
330/332 (99%) 


0.0 


Q9D2Z3 


1 1 10055A02RIK PROTEIN 
(RIKEN CDNA 1 1 10055A02 
GENE) - Mus musculus (Mouse), 
395 aa. 


3..394 
2..395 


288/394 (73%) 
320/394 (81%) 


e-162 


Q948A2 


PUTATIVE MYOSIN HEAVY 
CHAIN - Oryza sativa (Rice), 1601 
aa. 


2. .255 
663.-876 


84/258 (32%) 
125/258 (47%) 


le-23 


074805 


HYPOTHETICAL MYOSIN-LIKE 
PROTEPN C2D10.14C IN 
CHROMOSOME II - 
Schizosaccharomyces pombe 
(Fission yeast), 1471 aa. 


20..347 
615-903 


96/340 (28%) 
152/340 (44%) 


le-21 


T30148 


hypothetical protein E02C12.1 - 
Caenorhabditis elegans, 1019 aa. 


5.-249 
619..830 


74/248 (29%) 
119/248(47%) 


6e-21 



PFam analysis predicts that the NOV5 la protein contains the domains shown in the 
Table 5 IE. 



Table 51 E. Domain Analysis of NOV51a 


Pfam Domain 


NOV51a Match 
Region 


Identities/ 
Similarities 
for the Matched 
Region 


Expect 
Value 


myosin head: domain 1 of 
1 


26..105 


37/81 (46%) 
60/81 (74%) 


5.1e-25 



Example 52. 

The NOV52 clone was analyzed, and the nucleotide and predicted polypeptide 
sequences are shown in Table 52A. 
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Table 52A. NOV52 Sequence Analysis 




M O ID "NO: l>7 


1380 bp 


NOV52a, 

CG59564-01 DNA Sequence 


TAGAATTCCAGCGGCCGCTGAAATCCTCACTCGGTCAGTTCCTCGGGCGAGTTACGGG 


GACGACCTGCGGGAGCACGCGGGCAGTGGCCGGACGCTGAAGCCCAGGAGAGCGATGG 


AGACGTATGCGGAGGTTGGGAAGGAGGGCAAGCCTTCCTGTGCATCGGTGGATCTGCA 
GGGAGACAGCTCCTTACAGGTGGAGATTTCTGACGCAGTGAGTGAGCGGGACAAGGTG 
AAATTCACTGTTCAAACAAAGAGCTGCCTCCCTCACTTCGCCCAGACCGAGTTCTCAG 
TCGTGCGGCAGCACGAGGAGTTCATCTGGCTGCATGATGCCTACGTGGAGAATGAGGA 
GTACGCCGGCCTCATCATCCCCCCAGCCCCTCCGAGGCCAGACTTTGAGGCTTCGAGG 
GAAAAGCTACAGAAATTGGGCGAGGGGGACAGCTCTGTCACTCGGGAAGAGTTTGCCA 
AGATGAAGCAGGAGCTGGAAGCGGAGTACCTGGCCATCTTTAAGAAGACAGTTGCGAT 
GCACGAAGTCTTTCTGCAGCGCCTGGCGGCCCACCCCACCCTGCGTCGAGACCACAAC 
TTCTTTGTGTTTTTGGAATATGGACAGGATCTGAGTGTCCGGGGGAAGAACAGGAAGG 
AGCTCCTCGGAGGGTTTCTGAGGAATATTGTGAAGTCCGCGGATGAAGCCCTCATCAC 
GGGCATGTCAGGGCTCAAGGAGGTGGATGACTTCTTTGAGCATGAGAGGACCTTCCTG 
TTGGAGTATCACACCCGTATCCGAGATGCCTGCCTGCGGGCCGACCGCGTCATGCGCG 
CCCACAAGTGCCTGGCAGACGATTATATCCCTATCTCAGCTGCGCTGAGCAGTCTGGG 
AACACAGGAAGTCAACCAGCTAAGGACGAGCTTCCTCAAATTGGCAGAGCTCTTTGAC 
CGGCTGAGGAAGCTGGAGGGCCGGGTGGCTTCCGATGAGGACCTGAAGCTGTCAGACA 
TGCTGAGGTACTACATGCGTGACTCACAGGCAGCCAAGGACCTGCTGTACCGGCGGCT 
GCGGGCACTGGCCGACTACGAGAATGCCAACAAGGCGCTGGACAAGGCGCGCACCAGG 
AACCGGGAGGTGCGGCCCGCCGAGAGCCACCAGCAGCTGTGCTGCCAACGCTTCGAGC 
GCCTCTCCGACTCCGCCAAGCAAGAGCTCATGGACTTCAAGTCCCGCCGGGTCTCCTC 
TTTTCGAAAGAATCTCATTGAGCTGGCAGAGCTGGAGCTCAAACACGCCAAGGCCAGC 
ACCCTGATTCTCCGGAACACCCTTGTTGCCCTAAAGGGGGAGCCTTAGAGTAGCCAGA 
GCTCAGCCAGACCCTAATCTGGGATCTCCAGTGACCAGGGTATCCC 




ORF Start: ATG at 1 13 


ORF Stop: TAG at 1322 




SEQ ID NO: 158 


403 aa |MW at 46384.2kD 


NOV52a, 

CG59564-01 Protein Sequence 


METYAEVGKEGKPSCASVDLQGDSSLQVE I SDAVSERDKVKFTVQTKSCLPHFAQTEF 
SVWQHEEFIWLHDAYVENEEYAGLIIPPAPPRPDFEASREKLQKLGEGDSSVTREEF 
AKMKQELEAEYLAIFKKTVAMHEVFLQRLAAHPTLRRDHNFFVFLEYGQDLSVRGKNR 
KELLGGFLRNIVKSADEALITGMSGLKEVDDFFEHERTFLLEYHTRIRDACLRADRVM 
RAHKCIADDYIPISAALSSLGTQEVNQLRTSFLKLAELFDRLRKLEGRVASDEDLKLS 
DMLRYYMRDSQAAKDLLYRRLRALADYENANKALDKARTRNREVRPAESHQQLCCQRF 
HfL D ->I il . F E F 3_i ^ Z - XL' NIL L, ,E 



Further analysis of the NOV52a protein yielded the following properties shown in 
Table 52B. 



Table 52B. Protein Sequence Properties NOV52a 


PSort 
analysis: 


0.6500 probability located in cytoplasm; 0.1000 probability located in 
mitochondrial matrix space; 0.1000 probability located in lysosome (lumen); 
0.0000 probability located in endoplasmic reticulum (membrane) 


SignalP 

analysis: 


No Known Signal Sequence Predicted 



A search of the NOV52a protein against the Geneseq database, a proprietary 
database that contains sequences published in patents and patent publication, yielded 
several homologous proteins shown in Table 52C. 
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Table 52C. Geneseq Results for NOV52a 


Geneseq 
Identifier 


Protein/Organism/Length [Patent 
#, Date] 


NOV52a 
Residues/ 

Match 
Residues 


Identities/ 
Similarities for 
the Matched 
Region 


Expect 
Value 


AAY94209 


Human TRAF four associated factor 
TFAF2 - Homo sapiens, 406 aa. 
[CA2245340-A1, 19-FEB-2000] 


17..402 
23..405 


273/386 (70%) 
333/386 (85%) 


e-160 


AAB07856 


Amino acid sequence of Smadl 
interactor protein clone SI +12-2 - 
Homo sapiens, 414 aa. 
[WO200047102-A2, 17-AUG-2000] 


17..402 
31..413 


273/386 (70%) 
333/386 (85%) 


e-160 


AAB43157 


Human ORFXORF2921 
polypeptide sequence SEQ ID 
NO:5842 - Homo sapiens, 460 aa. 
[WO200058473-A2, 05-OCT-2000] 


17..402 
77..459 


273/386 (70%) 
333/386 (85%) 


e-160 


AAB58368 


Lung cancer associated polypeptide 
sequence SEQ ID 706 - Homo 
sapiens, 414 aa. [WO200055180- 
A2, 21-SEP-2000] 


17..402 
31..413 


273/386 (70%) 
333/386 (85%) 


e-160 


AAO13507 


Human polypeptide SEQ ID NO 
27399 - Homo sapiens, 443 aa. 
[WO200164835-A2, 07-SEP-2001] 


17.. 400 
61. .441 


242/384 (63%) 
317/384 (82%) 


e-144 



In a BLAST search of public sequence databases, the NOV52a protein was found to 
have homology to the proteins shown in the BLASTP data in Table 52D. 



Table 52D. Public BLASTP Results for NOV52a 



Protein 
Accession 
Number 


Protein/Organism/Length 


NOV52a 
Residues/ 

Match 
Residues 


Identities/ 
Similarities for 
the Matched 
Portion 


Expect 
Value 


Q9UNH7 


Sorting nexin 6 (TRAF4-associated 
factor 2) - Homo sapiens (Human), 
406 aa. 


17..402 
23..405 


273/386 (70%) 
333/386 (85%) 


e-159 


Q9CZ03 


2810425K19RIK PROTEIN - Mus 
musculus (Mouse), 406 aa. 


17.. 402 
23.. 405 


271/386 (70%) 
333/386 (86%) 


e-159 


Q9Y5X3 


Sorting nexin 5 - Homo sapiens 
(Human), 404 aa. 


17..400 
22.. 402 


242/384 (63%) 
317/384 (82%) 


e-143 
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Q9D8U8 


Sorting nexin 5 - Mus musculus 
(Mouse), 404 aa. 


17. .400 
22..402 


241/384 (62%) 
314/384 (81%) 


e-142 


Q96NG4 


CDNA FLJ30934 FIS, CLONE 
FEBRA2007017, MODERATELY 
SIMILAR TO HOMO SAPIENS 
TRAF4-ASSOCIATED FACTOR 2 
MRNA - Homo sapiens (Human), 
277 aa. 


1..237 
1..237 


236/237 (99%) 
236/237 (99%) 


e-134 



PFam analysis predicts that the NOV52a protein contains the domains shown in the 
Table 52E. 



Table 52E. Domain Analysis of NOV52a 


Pfam Domain 


NOV52a Match Region 


Identities/ 
Similarities 
for the Matched Region 


Expect Value 


PX: domain 1 of 1 


23..164 


39/160(24%) 
103/160 (64%) 


1.6e-15 





Example 53. 



The NOV53 clone was analyzed, and the nucleotide and predicted polypeptide 
5 sequences are shown in Table 53 A. 



Table 53A. NOV53 Sequence Analysis 


ISEQ IDNO: 159 |3056 bp 


NOV53a, 

CG59553-01 DNA Sequence 


CTCCTGCGGGGTCAAATACAGAATTTACGCACCCTTGGCTTCCTTGGAGCCTAGCGGC 


TCTCCCCGCGTCCAAGATGGCGGCAGAAGCAGCTGGTGGGAAATACAGAAGCACAGTC 

GTGACGATGTCGAAGACAGGGAAAATGAAAAGGGTCGCCTTGAAGAAGCCTACGAGAA 
ATGTGACCGTGACCTGGATGAATTGATTGTACAGCACTACACAGAATTGACGACAGCC 
ATTCGCACATACCAGAGCATCACAGAGCGCATCACTAACTCCCGAAATAAAATAAAGC 
AGGTAAAAGAGAACCTGCTTTCATGCAAGATGCTGCTGCACTGCAAACGGGATGAGCT 
TCGGAAACTGTGGATTGAAGGAATTGAGCATAAGCATGTCCTGAACTTGTTGGATGAA 
ATTGAGAATATCAAGCAAGTGCCTCAAAAGCTGGAACAGTGCATGGCCAGCAAGCACT 
ATCTCAGTGCCACTGACATGTTGGTGTCAGCAGTTGAGTCTTTGGAGGGCCCCCTGCT 
CCAGGTGGAAGGACTGAGTGACCTTCGACTAGAGCTTCACAGCAAGAAGATGAACCTT 
CACTTGGTTCTCATAGATGAACTACACCGGCACCTGTACATCAAATCGACTAGCCGAG 
TTGTGCAGCGTAACAAGGAAAAAGGGAAAATCAGCTCCCTCGTGAAAGATGCTTCTGT 
TCCTCTGATTGATGTTACAAACCTCCCTACTCCTCGAAAATTCCTTGATACCTCTCAC 
TATTCTACTGCTGGAAGCTCAAGTGTGAGGGAGATAAATCTGCAGGACATCAAGGAAG 
ATTTAGAATTGGATCCAGAGGAAAACAGCACCCTGTTTATGGGTATCCTCATTAAGGG 
CTTGGCGAAACTGAAGAAGATCCCAGAAACAGTTAAGGCAATCATAGAGCGCTTGGAG 
CAGGAGTTGAAGCAAATTGTGAAGAGGTCTACAACCCAGGTGGCAGACAGTGGCTATC 
AGCGGGGGGAGAACGTTACTGTGGAGAACCAACCAAGGTTGCTTCTAGAACTGCTGGA 
GTTACTGTTTGACAAGTTTAATGCTGTAGCCGCTGCACACTCTGTGGTCCTGGGATAC 
CTGCAGGACACTGTAGTGACTCCACTGACTCAGCAGGAAGATATCAAACTGTATGATA 
TGGCAGATGTATGGGTGAAGATCCAAGATGTTCTACAGATGCTATTAACTGAGTACTT 
GGATATGAAAAATACTCGTACGGCCTCTGAACCATCAGCTCAACTAAGCTATGCCAGC 
ACTGGACGAGAGTTTGCAGCCTTTTTTGCCAAGAAGAAACCTCAAAGGCCAAAAAATT 
CTCTTTTCAAGTTCGAATCGTCCTCCCATGCCATCAGTATGAGCGCCTATCTGCGAGA 
ACAGAGAAGGGAGCTCTATAGTCGGAGTGGAGAACTGCAAGGGGGTCCTGATGACAAC 
TTAATTGAAGGTGGAGGAACAAAATTTGTCTGCAAACCTGGAGCCAGAAACATTACCG 
TCATATTCCACCCATTACTAAGATTTATTCAGGAGATTGAGCATGCTCTGGGTCTTGG 
CCCAGCCAAACAGTGTCCTCTTCGAGAGTTTCTCACCGTGTACATCAAAAACATCTTT 
■I ^ , ~ T I 
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ACCCTTTGAAGATTCTGGCCAA::GCAGACACCATGAAGGTGCTGGGAGTGCAGCGGCC 
TCTCCTACAGAGCACAATCATTGTGGAGAAGACAGTTCAAGACCTCCTGAACCTGATG 
CATGACTTGAGTGCATATTCAGATCAATTCCTCAACATGGTGTGCGTGAAGCTCCAGG 
AGTACAAGGACACCTGCACTGCAGCTTACAGGGGTATTGTCCAGTCAGAAGAAAAACT 
TGTCATCAGTGCATCCTGGGCAAAAGATGATGATATCAGCAGACTCTTGAAATCTCTA 
CCAAACTGGATGAATATGGCTCAACCCAAACAGCTGAGGCCAAAAAGAGAGGAGGAAG 
AAGATTTCATAAGGGCAGCTTTTGGCAAGGAGTCTGAAGTTCTTATTGGGAACCTGGG 
TGATAAATTAATCCCTCCACAAGACATCCTTCGTGACGTCAGTGACCTCAAAGCCTTG 
GCCAACATGCATGAAAGCCTGGAATGGTTGGCAAGTCGAACAAAGTCAGCTTTCTCCA 
ATCTTTCTACATCCCAGATGCTTTCTCCTGCTCAAGACAGCCACACGAACACGGATCT 
CCCCCCAGTGTCAGAGCAGATCATGCAGACTCTCAGTGAACTTGCCAAATCGTTCCAG 
GATATGGCTGACCGCTGCTTGCTTGTCTTACATCTGGAAGTGAGGGTTCACTGTTTCC 
ACTATCTTATCCCTCTTGCAAAGGAGGGGAACTATGCCATTGTGGCTAATGTGGAAAG 
TATGGATTATGACCCCCTGGTGGTCAAGCTCAACAAAGATATCAGCGCCATTGAAGAG 
GCCATGAGCGCCAGCCTTCAGCAGCACAAGTTCCAGTATATCTTCGAAGGCCTGGGCC 
ACCTGATCTCCTGCATCCTCATTAATGGTGCCCAGTACTTCAGGCGCATCAGTGAGTC 
TGGCATCAAGAAAATGTGTAGGAACATTTTTGTTCTTCAGCAGAATTTGACCAACATC 
ACCATGTCGCGGGAGGCAGACCTGGACTTTGCAAGGCAGTACTACGAGATGCTTTACA 
ACACAGCTGACGAGCTCCTGAACCTGGTGGTGGACCAGGGTGTGAAGTACACGGAGCT 
GGAGTACATCCACGCTCTGACCCTGCTGCACCGCAGCCAGACTGGGGTGGGGGAACTG 
ACCACCCAGAACACGAGCTGCAGAGGAGGCTCAAAGAGATCATCTGCGAGCAGGCTGC 
CATCAAGCAAGCCACCAAGGACAAGAAGATAACTACCGTTTAGCAGGGCGTACTGCGG 
TTGGTGACGGGGGTCCCCTCAGTCACACTCACTTTTTTCC 




ORF Start: ATG al 75 


ORF Stop: TAA at 2988 




SEQ ID NO: 160 


971 aa |MW at 109984.9kD 


NOV53a, 

CG59553-01 Protein Sequence 


MAAEAAGGKYRSTVSKSKDPSGLLISVIRTLSTSDDVEDRENEKGRLEEAYEKCDRDL 
DELIVQHYTELTTAIRTYQSITERITHSRNKIKQVKENLLSCKMLLHCKRDELRKLWI 
EGIEHKHVLNLLDEIENIKQVPQKLEQCMASKHYLSATDMLVSAVESLEGPLLQVEGL 
SDLRLELHSKKMNLHLVLIDELHRHLYIKSTSRWQRNKEKGKISSLVKDASVPLIDV 
TNLPTPRKFLDTSHYSTAGSSSVREINLQDIKEDLELDPEENSTLFMGILIKGLAKLK 
KIPETVKAIIERLEQELKQIVKRSTTQVADSGYQRGENVTVENQPRLLLELLELLFDK 
FNAVAAAHSWLGYLQDTWTPLTQQEDIKLYDMADVWVKIQDVLQMLLTEYLDMKNT 
RTASEPSAQLSYASTGREFAAFFAKKKPQRPKNSLFKFESSSHAISMSAYLREQRREL 
YSRSGELQGGPDDNLI EGGGTKFVCKPGARNI TVI FHPLLRFI QE I EHALGLG PAKQC 
PLREFLTVYIKNIFLNQVLAEINKEIEGVTKTSDPLKILANADTMKVLGVQRPLLQST 
IIVEKTVQDLLNLMHDLSAYSDQFLNMVCVKLQEYKDTCTAAYRGIVQSEEKLVISAS 
WAKDDDISRLDKSLPNWMNMAQPKQLRPKREEEEDFIRAAFGKESEVLIGNLGDKLIP 
PQDILRDVSDLKALANMHESLEWLASRTKSAFSNLSTSQMLSPAQDSHTNTDLPPVSE 
QIMQTLSELAKSFQDMADRCLLVLHLEVRVHCFHYLIPIAKEGNYAIVANVESMDYDP 
LWKLNKDISAIEEAMSASLQQHKFQYIFEGLGHLISCILINGAQYFRRISESGIKKM 
CRNIFVLQQNLTNITMSREADLDFARQYYEMLYNTADELLNLWDQGVKYTELEYIHA 



Further analysis of the NOV53a protein yielded the following properties shown in 
Table 53B. 



Table 53B. Protein Sequence Properties NOV53a 


PSort 
analysis: 


0.5500 probability located in endoplasmic reticulum (membrane); 0.1900 
probability located in lysosome (lumen); 0.1000 probability located in 
endoplasmic reticulum (lumen); 0.1000 probability located in outside 


SignalP 
analysis: 


No Known Signal Sequence Predicted . 



A search of the NOV53a protein against the Geneseq database, a proprietary 
database that contains sequences published in patents and patent publication, yielded 
several homologous proteins shown in Table 53C. 
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Table 53C. Geneseq Results for NOV53a 


Geneseq 
Identifier 


Protein/Organ is m/Length 
[Patent #, Date] 


NOV53a 
Residues/ 

Match 
Residues 


Identities/ 
Similarities for 
the Matched 
Region 


Expect 
Value 


AAB93175 


Human protein sequence SEQ ID 
NO: 121 14 - Homo sapiens, 974 aa. 
[EP1 07461 7-A2, 07-FEB-2001] 


1..947 
1..947 


947/947(100%) 
947/947(100%) 


0.0 


AAW69801 


Amino acid sequence of rsecS, a 
protein present in SA-17S complex 
- Rattus sp, 975 aa. [W09828419- 
A2, 02-JUL-1998] 


1 ..947 
1..948 


902/948 (95%) 
925/948 (97%) 


0.0 


AAB95143 


Human protein sequence SEQ ID 
NO: 17163 - Homo sapiens, 572 aa. 
[EP1074617-A2, 07-FEB-2001] 


403. .947 
1..545 


545/545 (100%) 
545/545 (100%) 


0.0 


AAB58175 


Lung cancer associated 
polypeptide sequence SEQ ID 513 
- Homo sapiens, 41 8 aa. 
[WO200055180-A2, 21-SEP- 
2000] 


571. .947 
15..391 


369/377 (97%) 
369/377 (97%) 


0.0 


AAG00950 


Human secreted protein, SEQ ID 
NO: 5031 - Homo sapiens, 100 aa. 
[ LP 1 03340 1-A2. 06-SEP-2000] 


451. .544 
7.. 100 


76/94(80%) 1 3e-36 
79/94 (83%) 

i 



m 

In a BLAST search of public sequence databases, the NOV53a protein was found to 
have homology to the proteins shown in the BLASTP data in Table 53D. 



Table 53D. Public BLASTP Results for NOV53a 


Protein 
Accession 
Number 


Protein/Organism/Length 


NOV53a 
Residues/ 

Match 
Residues 


Identities/ 
Similarities for 
the Matched 
Portion 


Expect 
Value j 


Q96A65 


CDNA FLJ 14782 FIS, CLONE 
NT2RP4000524, HIGHLY 
SIMILAR TO MUS MUSCULUS 
SEC8 MRNA (SECRETORY 
PROTEIN SEC 8) - Homo sapiens 
(Human), 974 aa. 


1..947 
1..947 


947/947(100%) 
947/947(100%) 


0.0 | 


Q9C0G4 


KIAA1699 PROTEIN -Homo 
sapiens (Human), 966 aa (fragment). 


9. 947 
1..939 


939.9*9(100%) 
939/939(100%) 


0.0 


035382 


SEC8 - Mus musculus (Mouse), 971 
aa. 


1..971 
1..971 


923/972 (94%) 
946/972 (96%) 


0.0 
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Q62824 


RSEC8 - Rattus norvegicus (Rat), 


1..947 


902/948 (95%) 


o.(T~j 




975 aa (fragment). 


1..948 


925/948 (97%) 


] 

___ 


Q9P102 


REC8 - Homo sapiens (Human), 637 


339..947 


609/609 (100%) 






aa (fragment). 


2..610 


609/609(100%) 


| 



PFam analysis predicts that the NOV53a protein contains the domains shown in the 
Table 53E. 



Table 53E. Domain Analysis of NOV53a 



! Pfam Domain 



NOV53a Match Region 



Identities/ 
Similarities 
for the Matched Region 



Expect Value 



No Significant Matches Found 



Example 54. 

The NOV54 clone was analyzed, and the nucleotide and predicted polypeptide 
sequences are shown in Table 54A. 



Table 54A. NOV54 Sequence Analysis 




SEQ ID NO: 161 J501 bp 


NOV54a, 

CG59545-01 DNA Sequence 


CAACACGAGGAACAATGTOT r TTT' * ' - "PTGTCTTTGTC 
TGTTGGTTCCTGCGTGATAATCAAAGGGACACTGATCGACTCTTCTATCAACGAACCA 
CAGCTGCAGGTGGATTTCTACACTGAGATGAATGAGGACTCAGAAATTGCCTTCCATT 
TGCGAC TG< irT- i TGGGATATGGAT 

ATCTACGTGTGTCTCAATGAGTATGAGGTAAAGGTAAATGGTGAATACATTTATGCCT 
TTGTCCATCGAATCCCGCCATCATATGTGAAGATGATTCAAGTGTGGAGAGATGTCTC 
CCTGGACTCAGTGCTTGTCAACAATGGACGGAGATGATCACACTCCTCATTGTTGAGG 
AAACCCTCTTTCTACCTGACCATGGGATTCCTAGAGC 




ORF Man: ATG at 15 


ORF Stop: TGA at 441 




SEQ ID NO: 162 


142 aa MW al 1651 l.9kD 


NOV54a, 

CG59545-01 Protein Sequence 


MSSLPVPYKLPVSLSVGSCVIIKGTLIDSSINEPQLQVDFYTEMNEDSEIAFHLRVHL 
GRRWMNSREFGIWMLEENLHYVPFEDGKPFDLRIYVCLNEYEVKVNGEYIYAFVHRI 
PPSYVKMIQVWRDVSLDSVLVNNGRR 



Further analysis of the NOV54a protein yielded the following properties shown in 
Table 54B. 



Table 54B. Protein Sequence Properties NOV54a 


PSort 
analysis: 


0.5500 probability located in endoplasmic reticulum (membrane); 0.1900 
probability located in lysosome (lumen); 0.1000 probability located in 
endoplasmic reticulum (lumen); 0.1000 probability located in outside 


SignalP 
analysis: 


No Known Signal Sequence Predicted 
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A search of the NOV54a protein against the Geneseq database, a proprietary 
database that contains sequences published in patents and patent publication, yielded 
several homologous proteins shown in Table 54C. 



Table 54C. Geneseq Results for NOV54a 


Geneseq 
Identifier 


Protein/Organism/Length 
[Patent #, Date] 


NOV54a 
Residues/ 

Match 
Residues 


Identities/ 
Similarities for 
the Matched 
Region 


Expect 
Value 


AAG66741 


Human Charcot-Leyden crystal 
protein 5A (CLC5A) - Homo 
sapiens, 142 aa. [CN1302875-A, 
ll-JUL-2001] 


1..142 
1..142 


139/142 (97%) 
139/142 (97%) 


2e-77 


AAG66742 


Human Charcot-Leyden crystal 
protein 5B (CLC5B) - Homo 
sapiens, 170 aa. [CN1302875-A, 
ll-JUL-2001] 


6..142 
34..170 


136/137(99%) 
136/137 (99%) 


3e-76 


AAM79041 


Human protein SEQ ID NO 1703 - 
Homo sapiens, 139 aa. 
[WO200157190-A2, 09-AUG- 
2001] 


1..139 
1..139 


107/139 (76%) 
116/139 (82%) 


2e-56 


AAY28350 


Full Placental Protein 13 amino 
acid sequence - Homo sapiens, 139 
aa. [WO9938970-A1, 05-AUG- 
1999] 


1..139 
1..139 


107/139(76%) 
116/139(82%) 


2e-56 


AAG78627 


Human Charcot-Leyden crystal 4 
CLC4 protein #2 - Homo sapiens, 
167 aa. [CN1302876-A, ll-JUL- 
2001] 


6..139 
34..167 


102/134 (76%) 
111/134 (82%) 


2e-53 



In a BLAST search of public sequence databases, the NOV54a protein was found to 
have homology to the proteins shown in the BLASTP data in Table 54D. 



Table 541). Public BLASTP Results for NOV54a 


Protein 
Accession 
Number 


Protein/Organism/Length 


NOV54a 
Residues/ 

Match 
Residues 


Identities/ 
Similarities for 
the Matched 
Portion 


Expect 
Value 


Q9UHV8 


PLACENTAL PROTEIN 13 
(PLACENTA PROTEIN 13) - Homo 
sapiens (Human), 139 aa. 


1..139 
1..139 

- - 


107/139 (76%) 
116/139 (82%) 


9e-56 
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Q9NR03 


PI AC) NT Al PR( ITEIN 13-1 1KJ 
PROTEIN - Homo sapiens (Human), 
139 aa. 


1..139 
1..139 


86 139(61%) 
107/139(76%) 


9e-45 


A46523 


Charcot-Leyden crystal protein - 
human, 142 aa. 


1..142 
1. 142 


76 142 (53%) 
96/142 (67%) 


7e-36 


Q05315 


Eosinophil lysophospholipase (EC 
3.1 .1 .5) (Charcot-Leyden crystal 
protein) (Lysolecithin acylhydrolase) 
(CLC) (Galactin-10) - Homo sapiens 
(Human), 141 aa. 


2.. 142 
1..141 


75/141 (53%) 
95/141 (67%) | 


3e-35 


Q96KD6 


PLACENTAL PROTEIN 13-LIKE - 
Homo sapiens (Human), 1 04 aa 
(fragment). 


1..104 
1..104 


66/104 (63%) 
79/104 (75%) 


le-31 



PFam analysis predicts that the NOV54a protein contains the domains shown in the 
Table 54E. 



Table 54E. Domain Analysis of NOV54a 


Pfam Domain 


NOV54a Match 
Region 


Identities/ 
Similarities 
for the Matched 
Region 


Expect 
Value 


Gal-bind lectin: domain 1 

on 


5..137 


37/142 (26%) 
106/142 (75%) 


3.1e-28 



Example 55. 



The NOV55 clone was analyzed, and the nucleotide and predicted polypeptide 
sequences are shown in Table 55A. 



Table 55A. NOV55 Sequence Analysis 




SEQIDNO:163 (2071 bp 


NOV55a, 

CG59435-01 DNA Sequence 


AAACTATTTGTAGGCGCAGTCATGCAGGAAAACCTCAGATTTGCTTCATCAGGAGATG 
ATATTAAAATATGGGATGCTTCATCTATGACATTGGTGGATAAATTCAACCCACACAC 
ATCACCACATGGAATCAGCTCAATATGTTGGAGCAGCAATAGTAACTTTTTAGTAACA 
GCATCTTCCAGTGGCGACAAAATAGTTGTCTCAAGTTGCAAATGTAAACCTGTTCCAC 
TTTTAGAGCTTGCTGAAGGGCAAAAGCAGACATGTGTCAATTTAAATTCTACATCTAT 
GTATTTGGTAAGCGGAGGCCTAAATAACACTGTTAATATTTGGGATTTAAAATCAAAA 
AGAGTTCATCGATCTCTTAAGGATCATAAAGATCAAGTAACTTGTGTAACATACAATT 
GGAATGATTGCTACATTGCTTCTGGATCTCTTAGTGGTGAAATTATTTTACACAGTGT 
AACCACTAATTTATCTAGTACTCCTTTTGGCCATGGTAGTAACCAGGTTCGGCACTTG 
AAGTACTCCTTGTTTAAGAAATCACTACTGGGCAGTGTTTCGGATAATGGAATAGTAA 
CTCTCTGGGATGTAAATAGTCAGAGTCCATACCATAACTTTGACAGTGTACACAAAGC 
TCCAGCGTCAGGCATCTGTTTTTCTCCTGTCAATGAATTGCTCTTTGTAACCATAGGC 
TTGGATAAAAGAATCATCCTCTATGACACTTCAAGTAAGAAGCTAGTGAAAACTTTAG 
TGGCTGACACTCCTCTAACTGCGGTAGATTTCATGCCTGATGGAGCCACTTTGGCTAT 
TGGATCTTCCCGGGGGAAAATATATCAATATGATTTAAGAATGTTGAAATCACCAGTT 
AAGACCATCAGTGCTCACAAGACATCTGTGCAGTGTATAGCATTTCAGTACTCCACTG 
TTCTTACTAAGTCAAGTTTAAATAAAGGCTGTTCAAATAAGCCCACAACAGTGAACAA 
ACGAAGTGTTAATGTGAATGCTGCTAGTGGAGGAGTTCAGAATTCCGGAATTGTCAGA 
GAAGCAGCTGCCACGTCGATGGCCAGAGGTCTACCACAACCTATGACATCAGCTATGG 
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GGAAAGGAACAGTTGCTGTTCAAGAAAAAGCAGGTTTGCCTCGAAGCATAAACACAGA 
CACTTTATCTAAGGAAACAGACAGTGGAAAAAATCAGGATTTCTCCAGCTTTGATGAT 
ACTGGGAAAAGTAGTTTAGGTGACATGTTCTCACCTATCAGAGATGATGCTGTAGTTA 
ACAAGGGAAGTGATGAGTCCATAGGCAAAGGAGATGGCTTTGACTTTCTACCGCAGTT 
GAACTCAGTGTTTCCTCCAAGAAAAAATCCAGTAACTTCAAGTACTTCAGTATTGCAT 
TCTAGTCCTCTTAATGTTTTTATGGGATCTCCAGGGAAAGAGGAAAATGAAAACCGTG 
ATCTAACAGCTGAGTCTAAGAAAATATATATGGGAAAACAGGAATCTAAAGACTCCTT 
CAAACAGTTAGCAAAGTTGGTCACATCTGGTGCTGAAAGTGGAAATCTAAATACCTCT 
CCATCATCTAACCAAACAAGAAATTCTGAGAAATTTGAAAAGCCAGAGAATGAAATTG 
AAGCCCAGTTGATATGTGAACCCCCAATCAATGGATCCTCAACTCCAAATCCAAAGAT 
AGCATCTTCTGTCACTGCTGGAGTTGCCAGTTCACTCTCAGAAAAAATAGCCGACAGC 
ATTGGAAATAACCGGCAAAATGCACCATTGACTTCCATTCAAATTCGTTTTATTCAGA 
ACATGATACAGGAAACGTTGGATGACTTTAGAGAAGCATGCCATAGGGACATTGTGAA 
TTTGCAAGTGGAGATGATTAAACAGTTTCATATGCAACTGAATGAAATGCATTCTTTG 
CTGGAAAGATACTCAGTGAATGAAGGTTTAGTGGCTGAAATTGAAAGACTACGAGAAG 
AAAACAAAAGATTACGGGCCCACTTTTGAAATTTCAGTGAATACCTTAATGTTCTGTA 
ATTTGGGAAGTTTCTGGCAACACAGAACTACATAGAATCAT 




ORF Start: A 1 G at 22 


ORF Stop: TGA at 1 999 




SEQ ID NO: 164 


659 aa M\* at 718 l i I) 


NOV55a, 

CG59435-01 Protein Sequence 


MQENLRFASSGDI i ^KFNPHTSPHGISSICWSSNSNFLVTASSSGDK 
IWSSCKCKPVPLLELAEGQKQTCVNLNSTSMYLVSGGLKMTVNIWDLKSKRVHRSLK 
DHKDQVTCVTYNWNDCYIASGSLSGEIILHSVTTNLSSTPFGHGSNQVRHLKYSLFKK 
SLLGSVSDNGIVTLWDWSQSPYHNFDSVHKAPASGICFSPVNELLFVTIGLDKRIIL 
YDTSSKKLVKTLVADTPLTAVDFMPDGATLAIGSSRGKIYQYDLRMLKSPVKTISAHK 
TSVQCIAFQYSTVLTKSSLNKGCSNKPTTVNKRSVNVNAASGGVQNSGIVREAPATSI 
ATVLPQPMTSAMGKGTVAVQEKAGLPRSINTDTLSKETDSGKNQDFSSFDDTGKSSLG 
DMFSP I RDDAWNKGSDES I GKGDGFDFLPQLNSVFPPRKNPVTSSTSVLHSS PLNVF 
MGSPGKEENENRDLTAESKKIYMGKQESKDSFKQIjAKLVTSGAESGNLJNITSPSSNQ.tr 
NSEKFEKPENEIEAQLICEPPINGSSTPNPKIASSVTAGVASSLSEKIADSIGNNRQN 
APLTSIQIRFIQNMIQETLDDFREACHRDIVNLQVEMIKQFHMQLNEMHSLLERYSVN 
E L AE -j EEI U It 




SEQ ID NO: 165 1 2009 bp 


NOV55b, 

CG59435-02 DNA Sequence 


AAACTATTTGTAG W ^ ( r^T T T 
ATATTAAAATATGGGATGCTTCATCTATGACATTGGTGGATAAATTCAACCCACACAC 
ATCACCACATGGAATCAGCTCAATATGTTGGAGCAGCAATAATAACTTTTTAGTAACA 
GCATCTTCCAGTGGCGACAAAATAGTTGTCTCAAGTTGCAAATGTAAACCTGTTCCAC 

GTATTTGGTAAGCGGAGGCCTAAATAACACTGTTAATATTTGGGATTTAAAATCAAAA 
AGAGTTCATCGATCTCTTAAGGATCATAAAGATCAAGTAACTTGTGTAACATACAATT 
GGAATGATTGCTACATTGCTTCTGGATCTCTTAGTGGTGAAATTATTTTACACAGTGT 
AACCACTAATTTATCTAGTACTCCTTTTGGCCATGGTAGTAACCAGTCTGTTCGGCAC 
TTGAAGTACTCCTTGTTTAAGAAATCACTACTGGGCAGTGTTTCGGATAATGGAATAG 
TAACTCTCTGGGATGTAAATAGTCAGAGTCCATACCATAACTTTGACAGTGTACACAA 
AGCTCCAGCGTCAGGCATCTGTTTTTCTCCTGTCAATGAATTGCTCTTTGTAACCATA 
GGCTTGGATAAAAGAATCATCCTCTATGACACTTCAAGTAAGAAGCTAGTGAAAACTT 
TAGTGGCTGACACTCCTCTAACTGCGGTAGATTTCATGCCTGATGGAGCCACTTTGGC 
TATTGGATCTTCCCGGGGGAAAATATATCAATATGATTTAAGAATGTTGAAATCACCA 
GTTAAGACCATCAGTGCTCACAAGACATCTGTGCAGTGTATAGCATTTCAGTACTCCA 
CTGTTCTTACTAAGTCAAGTTTAAATAAAGGCTGTTCAAATAAGCCCACAACAGTGAA 
CAAACGAAGTGTTAATGTGAATGCTGCTAGTGGAGGAGTTCAGAATTCCGGAATTGTC 
AGAGAAGCACCTGCCACGTCCATTGCCACAGTTCTACCACAACCTATGACATCAGCTA 
TGGGGAAAGGAACAGTTGCTGTTCAAGAAAAAGCAGGTTTGCCTCGAAGCATAAACAC 
AGACACTTTATCTAAGGAAACAGACAGTGGAAAAAATCAGGATTTCTCCAGCTTTGAT 
GATACTGGGAAAAGTAGTTTAGGTGACATGTTCTCACCTATCAGAGATGATGCTGTAG 
TTAACAAGGGAAGTGATGAGTCCATAGGCAAAGGAGATGGCTTTGACTTTCTACCGCA 
GTTGAACTCAGTGTTTCCTCCAAGAAAAAATCCAGTAACTTCAAGTACTTCAGTATTG 
CATTCTAGTCCTCTTAATGTTTTTATGGGATCTCCAGGGAAAGAGGAAAATGAAAACC 
GTGATCTAACAGCTGAGTCTAAGAAAATATATATGGGAAAACAGGAATCTAAAGACTC 
CTTCAAACAGTTAGCAAAGTTGGTCACATCTGGTGCTGAAAGTGGAAATCTAAATACC 
TCTCCATCATCTAACCAAACAAGAAATTCTGAGAAATTTGAAAAGCCAGAGAATGAAA 
TTGAAGCCCAGTTGATATGTGAACCCCCAATCAATGGATCCTCAACTCCAAATCCAAA 
GATAGCATCTTCTGTCACTGCTGGAGTTGCCAGTTCACTCTCAGAAAAAATAGCCGAC 
AGCATTGGAAATAACCGGCAAAATGCACCATTGACTTCCATTCAAATTCGTTTTATTC 
AGAACATGATACAGGAAACGTTGGATGACTTTAGAGAAGCATGCCATAGGGACATTGT 
GAATTTGCAAGTGGAGATGATTAAACAGTTTCATATGCAACTGAATGAAATGCATTCT 
TTGCTGGAAAGATACTCAGTGAATGAAGGTTTAGTGGCTGAAATTGAAAGACTACGAG 
a - a f , ^ TTACGGGCCCACTTTTGAAATTT 




ORF Start: Alii at 22 


ORF Stop: TGA at 2002 




SEQ IDNI >. 166 


660 aa !MW at 71965.3kD 


NOV55b, 


MQENLRFASSGDDIKIWDASSMTLVDKFNPHTSPHGI SS I CWSSNNNFLVTASSSGDK 



259 



CG59435-02 Protein Sequence 



IWSSCKCKPVPLLELAEGQKQTCVNLNSTSMYLVSGGLNNTVNIWDLKSKRVHRSLK 
DHKDQVTCVTYNWNDCYIASGSLSGEIILHSVTTNLSSTPFGHGSNQSVRHLKYSLFK 
KSLLGSVSDNGIVTLWDVNSQSPYHNFDSVHKAPASGICFSPVNELLFVTIGLDKRII 
LYDTSSKKLVKTLVADTPLTAVDFMPDGATLAIGSSRGKIYQYDLRMLKSPVKTISAH 
KTSVQCIAFQYSTVLTKSSLNKGCSNKPTTVNKRSVNVNAASGGVQNSGIVREAPATS 
IATVLPQPMTSAMGKGTVAVQEKAGLPRSINTDTLSKETDSGKNQDFSSFDDTGKSSL 
GDMFSPIRDDAWNKGSDESIGKGDGFDFLPQLNSVFPPRKNPVTSSTSVLHSSPLNV 
FMGSPGKEENENRDLTAESKKIYMGKQESKDSFKQLAKLVTSGAESGNLNTSPSSNQT 
RNSEKFEKPENEIEAQLICEPPINGSSTPNPKIASSVTAGVASSLSEKIADSIGNNRQ 
NAPLTSIQIRFIQNMIQETLDDFREACHRDIVNLQVEMIKQFHMQLNEMHSLLERYSV 
NEGLVAE I ERLREENKRLRAHF 



Sequence comparison of the above protein sequences yields the following sequence 
relationships shown in Table 55B. 



Table 55B. Comparison of NOV55a against NOV55b. 


Protein Sequence 


NOV55a Residues/ 


Identities/ 


Match Residues 


Similarities for the Matched Region 


NOV55b 


1..659 


658/660 (99%) 




1. .660 


659/660 (99%) 



Further analysis of the NOV55a protein yielded the following properties shown in 
Table 55C. 



Table 55C. Protein Sequence Properties NOV55a 


PSort 
analysis: 


0.4500 probability located in cytoplasm; 0.3000 probability located in 
microbody (peroxisome); 0.1000 probability located in mitochondrial matrix 
space; 0.1000 probability located in lysosome (lumen) 


SignalP 
analysis: 


No Known Signal Sequence Predicted 



A search of the NOV55a protein against the Geneseq database, a proprietary 
database that contains sequences published in patents and patent publication, yielded 
several homologous proteins shown in Table 55D. 



Table 551). Geneseq Results for NOV55a 


Geneseq 
Identifier 


Protein/Organism/Length [Patent 
#, Date] 


NOV55a 
Residues/ 

Match 
Residues 


Identities/ 
Similarities for 
the Matched 
Region 


Expect 
Value 




AAG74568 


. 

Human colon cancer antigen protein 
SEQ ID NO-.5332 - Homo sapiens, 
404 aa. [WO200122920-A2, 05- 
\PR-2001] 


256..659 
1..404 


399/404 (98%) 
399/404 (98%) 


0.0 
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AAE 10677 


Human NEDDl-related protein - 
Homo sapiens, 208 aa. 
[WO200172955-A2, 04-OCT-2001] 


453..611 
2..159 


145/159 (91%) 
149/159 (93%) 


4e-75 


AAM70774 


Human bone marrow expressed 
probe encoded protein SEQ ID NO: 
31080 - Homo sapiens, 67 aa. 
[WO200157276-A2, 09-AUG-2001] 


240..306 
1..67 


67/67(100%) 
67/67(100%) 


9e-31 


AAM06190 


Peptide #4872 encoded by probe for 
measuring breast gene expression - 
Homo sapiens, 67 aa. 
[WO200157270-A2, 09-AUG-2001] 


240..306 
1..67 


67/67 (100%) 
67/67(100%) 


9e-31 


ABB23122 


Protein #5121 encoded by probe for 
measuring heart cell gene 
expression - Homo sapiens, 65 aa. 
[WO200157274-A2, 09-AUG-2001] 


307..371 
1..65 


65/65 (100%) 
65/65 (100%) 


3e-29 


In a BLAST search of public sequence databases, the NOV55a protein was found to 
have homology to the proteins shown in the BLASTP data in Table 55E. 


Table 55E. Public BLASTP Results for NOV55a 


j Protein 
| Accession 
1 Number 


Protein/Organism/Length 


NOV55a 
Residues/ 

Match 
Residues 


Identities/ 
Similarities for 
the Matched 
Portion 


Expect 
Value 


160167 


regulatory protein Neddl - mouse, 
660 aa. 


1..659 
1..660 


564/660 (85%) 
607/660 (91%) 


0.0 


P33215 


NEDD1 protein - Mus musculus 
(Mouse), 675 aa (fragment). 


1..659 
16..675 


564/660 (85%) 
607/660 (91%) 


0.0 

— 1 


Q9CWK2 


NEURAL PRECURSOR CELL 
EXPRESSED, 

DEVELOPMENTALLY DOWN- 
REGULATED GENE 1 - Mus 
musculus (Mouse), 660 aa. 


1..659 
1..660 


563/660 (85%) 
606/660 (91%) 


0.0 


Q9FI89 


SIMILARITY TO REGULATORY 
PROTEIN NEDD1 - Arabidopsis 
thaliana (Mouse-ear cress), 787 aa. 


8..533 
15..532 


145/550(26%) 
246/550 (44%) 


4e-40 


BAB75165 


WD-40 REPEAT PROTEIN - 
Anabaena sp. (strain PCC 7120), 
1526 aa. 


2..298 
916.. 1208 


92 307 (29%) 
147/307 (46%) 


2e-18 



PFam analysis predicts that the NOV55a protein contains the domains shown in the 
Table 55F. 
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Table 55F. Domain Analysis of NOV55a 


Pfam Domain 


NOV55a Match Region 


Identities/ 
Similarities 
for the Matched Region 


Expect Value 


WD40: domain 1 of 7 


28 61 


6/37(16%) 
27/37 (73%) 


57 j 


WD40: domain 2 of 7 


70.. 105 


10/37 (27%) 
27/37 (73%) 


0.062 1 


|WD40: domain 3 of 7 


111. .147 


9/37 (24%) 


20 | 


| WD40: domain 4 of 7 


153..190 


10/38 (26%) 
29/38 (76%) 


3.4 | 


(WD40: domain 5 of 7 


1 97..234 


7/38(18%) 
25/38 (66%) 


19 


J WD40: domain 6 of 7 


240..275 


14/37 (38%) 
28/37(76%) 


3.1 


| WD40: domain 7 of 7 


282..316 


8/37 (22%) 
26/37 (70%) 


1.3e+03 



Example 56. 



The NOV56 clone was analyzed, and the nucleotide and predicted polypeptide 
sequences are shown in Table 56A. 



Table 56A. NOV56 Sequence Analysis 




SEQIDNO:167 j 1771 bp 


NOV56a, 

CG59439-01 DNA Sequence 


- . VTG< 3 - i * >T~< -4 if T 
CCTTCCACAACATCCACCCTGCCCCTTCACAGCTGCGCTGCCGGTCTTTATCAGAATT 
TGGAGCCCCAAGATGGAATGACTATGAAGTACCGGAGGAATTTAACTTTGCAAGTTAT 
GTACTGGACTACTGGGCTCAAAAGGAGAAGGAGGGCAAGAGAGGTCCAAATCCAGCTT 
TTTGGTGGGTGAATGGCCAAGGGGATGAAGTAAAGTGGAGCTTCAGAGAGATGGGAGA 
CCTAACCCGCCGTGTAGCCAACGTCTTCACACAGACCTGTGGCCTACAACAGGGAGAC 
CATCTGGCCTTGATGCTGCCTCGAGTTCCTGAGTGGTGGCTGGTGGCTGTGGGCTGCA 
TGCGAACAGGGATCATCTTCATTCCTGCGACCATCCTGTTGAAGGCCAAAGACATTCT 
CTATCGACTACAGTTGTCTAAAGCCAAGGGCATTGTGACCATAGATGCCCTTGCCTCA 
GAGGTGGACTCCATAGCTTCTCAGTGCCCCTCTCTGAAAACCAAGCTCCTGGTGTCTG 
ATCACAGCCGTGAAGGGTGGCTGGACTTCCGATCGCTGGTTAAATCAGCATCCCCAGA 
ACACACCTGTGTTAAGTCAAAGACCTTGGACCCAATGGTCATCTTCTTCACCAGTGGG 
ACCACAGGCTTCCCCAAGATGGCAAAACACTCCCATGGGTTGGCCTTACAACCCTCCT 
TCCCAGGAAGTAGGAAATTACGGAGCCTGAAGACATCTGATGTCTCCTGGTGCCTGTC 
GGACTCAGGATGGATTGTGGCTACCATTTGGACCCTGGTAGAACCATGGACAGCGGGT 
TGTACAGTCTTTATCCACCATCTGCCACAGTTTGACACCAAGGTCATCATACAGACAT 
TGTTGAAATACCCCATTAACCACTTTTGGGGGGTATCATCTATATATCGAATGATTCT 
GCAGCAGGATTTCACCAGCATCAGGTTCCCTGCCCTGGAGCACTGCTATACTGGCGGG 
GAGGTCGTGTTGCCCAAGGATCAGGAGGAGTGGAAAAGACGGACGGGCCTTCTGCTCT 
ACGAGAACTATGGGCAGTCGGAAACGGGACTAATTTGTGCCACCTACTGGGGAATGAA 
GATCAAGCCGGGTTTCATGGGGAAGGCCACTCCACCCTACGACGTCCAGGTCATTGAT 
GACAAGGGCAGCATCCTGCCACCTAACACAGAAGGAAACATTGGCATCAGAATCAAAC 
CTGTCAGGCCTGTGAGCCTCTTCATGTGCTATGAGGGTGACCCAGAGAAGACAGCTAA 
AGTGGAATGTGGGGACTTCTACAACACTGGGGACAGAGGTAAGATGGATGAAGAGGGC 
TACATTTGTTTCCTGGGGAGGAGTGATGACATCATTAATGCCTCTGGGTATCGCATCG 
GGCCTGCAGAGGTTGAAAGCGCTTTGGTGGAGCACCCAGCGGTGGCGGAGTCAGCCGT 
i - If 1 ' " 1 r 1 i i 
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\ TTJ 1 iT - Al i S^ACTGCAGCAGCATGTCA 
AGTCAGTGACAGCCCCATACAAGTACCCAAGGAAGGTGGAGTTTGTCTCAGAGCTGCC 
AAAAACCATCACTGGCAAGATTGAACGGAAGGAACTTCGGAAAAAGGAGACTGGTCAG 
TA. 1 3 3AA 3Ai f\CTG 




( >RI Start: ATC at 13 < >RI Stop: l \ A at 1744 




SEQ ID NO: 168 |577 aa |mW at 65272.6kD 


NOV56a, 

CG59439-01 Protein Sequence 


MQWLMRFRTLWGIHKSFHNIHPAPSQLRCRSLSEFGAPRWNDYEVPEEFNFASYVLDY 
WAQKEKEGKRGPNPAFWWVNGQGDEVKWSFREMGDLTRRVANVFTQTCGLQQGDHLAL 
MLPRVP EWWLVAVG CMRTG 1 1 F I PAT ILLKAKDILYRLQLS KAKGIVT I DALAS EVDS 
IASQCPSLKTKLLVSDHSREGWLDFRSLVKSASPEHTCVKSKTLDPMVIFFTSGTTGF 
PKMAKHSHGLALQPSFPGSRKLRSLKTSDVSWCLSDSGWIVATIWTLVEPWTAGCTVF 
IHHLPQFDTKVIIQTLLKYPINHFWGVSSIYRMILQQDFTSIRFPALEHCYTGGEWL 
PKDQEEWKRRTGLLLYENYGQSETGLICATYWGMKIKPGFMGKATPPYDVQVIDDKGS 
I LPPNTEGNIGIRI KPVRPVSLFMCYEGDPEKTAKVECGDFYNTGDRGKMDEEGYI CF 
LGRSDDIINASGYRIGPAEVESALVEHPAVAESAWGSPDPIRGEWKAFIVLTPQFL 
It I r n I I 




M ( ' ID NO: k.<> l/o" hp 


NOV56b, 

CG59439-02 DNA Sequence 


GTTTCACCATGCAGTGGCTAATGAGGTTCCGGACCCTCTGGGGCATCCACAAATCCTT 
CCACAACATCCACCCTGCCCCTTCACAGCTGCGCTGCCGGTCTTTATCAGAATTTGGA 
GCCCCAAGATGGAATGACTATGAAGTACCGGAGGAATTTAACTTTGCAAGTTATGTAC 
TGGACTACTGGGCTCAAAAGGAGAAGGAGGGCAAGAGAGGTCCAAATCCAGCTTTTTG 
GTGGGTGAATGGCCAAGGGGATGAAGTAAAGTGGAGCTTCAGAGAGATGGGAGACCTA 
ACCCGCCGTGTAGCCAACGTCTTCACACAGACCTGTGGCCTACAACAGGGAGACCATC 
TGGCCTTGATGCTGCCTCGAGTTCCTGAGTGGTGGCTGGTGGCTGTGGGCTGCATGCG 
AACAGGGATCATCTTCATTCCTGCGACCATCCTGTTGAAGGCCAAAGACATTCTCTAT 
CGACTACAGTTGTCTAAAGCCAAGGGCATTGTGACCATAGATGCCCTTGCCTCAGAGG 
TGGACTCCATAGCTTCTCAGTGCCCCTCTCTGAAAACCAAGCTCCTGGTGTCTGATCA 
CAGCCGTGAAGGGTGGCTGGACTTCCGATCGCTGGTTAAATCAGCATCCCCAGAACAC 
ACCTGTGTTAAGTCAAAGACCTTGGACCCAATGGTCATCTTCTTCACCAGTGGGACCA 
CAGGCTTCCCCAAGATGGCAAAACACTCCCATGGGTTGGCCTTACAACCCTCCTTCCC 




TCAGGATGGATTGTGGCTACCATTTGGACCCTGGTAGAACCATGGACAGCGGGTTGTA 
CAGTCTTTATCCACCATCTGCCACAGTTTGACACCAAGGTCATCATACAGACATTGTT 




CAGGATTTCACCAGCATCAGGTTCCCTGCCCTGGAGCACTGCTATACTGGCGGGGAGG 
TCGTGTTGCCCAAGGATCAGGAGGAGTGGAAAAGACGGACGGGCCTTCTGCTCTACGA 

AAGCCGGGTTTCATGGGGAAGGCCACTCCACCCTACGACGTCCAGGGTGACCCAGAGA 
AGACAGCTAAAGTGGAATGTGGGGACTTCTACAACACTGGGGACAGAGGAAAGATGGA 
TGAAGAGGGCTACATTTGTTTCCTGGGGAGGAGTGATGACATCATTAATGCCTCTGGG 
TATCGCATCGGGCCTGCAGAGGTTGAAAGTGCTTTGGTGGAGCACCCAGCGGTGGCGG 
AGTCAGCCGTGGTGGGCAGCCCAGACCCGATTCGAGGGGAGGTGGTGAAGGCCTTTAT 
TGTCCTGACCCCACAGTTCCTGTCCCATGACAAGGATCAGCTGACCAAGGAACTGCAG 
CAGCATGTCAAGTCAGTGACAGCCCCATACAAGTACCCAAGGAAAGTGGAGTTTGTCT 
CAGAGCTGCCAAAAACCATCACTGGCAAGATTGAACGGAAGGAACTTCGGAAAAAGGA 
GACTGGTCAGATGTAATCGGCAGTGAACTCAGAAC 




ORF Start: ATG at 9 


OPvF Stop: TAA at 1638 




Sl-O II) \(): 170 


543 aa |mW at 61518.2kD 


jNOV56b, 

CG59439-02 Protein Sequence 


MQWLMRFRTLWGIHKSFHNIHPAPSQLRCRSLSEFGAPRWNDYEVPEEFNFASYVLDY 
WAQKEKEGKRGPNPAFWWVNGQGDEVKWSFREMGDLTRRVANVFTQTCGLQQGDHLAL 
MLPRVPEWWLVAVGCMRTGIIFIPATILLKAKDILYRLQLSKAKGIVTIDALASEVDS 
IASQCPSLKTKLLVSDHSREGWLDFRSLVKSASPEHTCVKSKTLDPMVIFFTSGTTGF 
PKMAKHSHGLALQPSFPGSRKLRSLKTSDVSWCLSDSGWIVATIWTLVEPWTAGCTVF 
IHHLPQFDTKVI IQTLLKYPIHHFWGVSS I YRMI LQQDFTS I RFPALEHCYTGGEWL 
PKDQEEWKRRTGLLLYENYGQSETGLICATYWGMKIKPGFMGKATPPYDVQGDPEKTA 
KVECGDFYNTGDRGKMDEEGYICFLGRSDDIINASGYRIGPAEVESALVEHPAVAESA 
WGSPDPIRGEWKAFIVLTPQFLSHDKDQLTKELQQHVKSVTAPYKYPRKVEFVSEL 
PKTITGKI^F K3LRKK3TGQM 



Sequence comparison of the above protein sequences yields the following sequence 
relationships shown in Table 56B. 
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Table 56B. Comparison of NOV56a against NOV56b. 


Protein Sequence 


NOV56a Residues/ 
Match Residues 


Identities/ 
Similarities for the Matched Region 


NOV56b 


1..577 
I. .543 


543/577 (94%) 
543/577 (94%) 



Further analysis of the NOV56a protein yielded the following properties shown in 
Table 56C. 



Table 56C. Protein Sequence Properties NOV56a 


PSort 
analysis: 


0.6400 probability located in microbody (peroxisome); 0.4712 probability 
located in mitochondrial matrix space; 0.1737 probability located in 
mitochondrial inner membrane; 0.1737 probability located in mitochondrial 
intermembrane space 


SignalP 
analysis: 


Likely cleavage site between residues 23 and 24 



A search of the NOV56a protein against the Geneseq database, a proprietary 
database that contains sequences published in patents and patent publication, yielded 
several homologous proteins shown in Table 56D. 



Table 561). Geneseq Results for NOV56a 


Geneseq 
Identifier 


Protein/Organism/Length 
[Patent #, Date] 


NOV56a 
Residues/ 

Match 
Residues 


Identities/ 
Similarities for 
the Matched 
Region 


Expect 
Value 


AAB43245 


Human ORFX ORF3009 
polypeptide sequence SEQ ID 
NO:6018 - Homo sapiens, 537 aa. 
[WO200058473-A2, 05-OCT- 
2000] 


46..573 
1..527 


309/529 (58%) 
402/529 (75%) 


0.0 


AAM80008 


Human protein SEQ ID NO 3654 - 
Homo sapiens, 302 aa. 
[WO200157190-A2, 09-AUG- 
2001] 


331. .577 
24..302 


247/279 (88%) 
247/279 (88%) 


e-140 


AAM80007 

L - _ 


Human protein SEQ ID NO 3653 - 
Homo sapiens, 302 aa. 
[WO200157190-A2, 09-AUG- 
2001] 


331..577 
24..302 


247/279 (88%) 
247/279 (88%) 


e-140 
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AAM41894 


Human polypeptide SEQ ID NO 
6825 - Homo sapiens, 390 aa. 
[WO200153312-A1, 26-JUL-2001] 


257..573 i 
7..323 : 


193/317(60%) 
246/317(76%) 


e-116 


AAM79024 


Human protein SEQ ID NO 1686 - 
Homo sapiens, 196 aa. 
[WO200157190-A2, 09-AUG- 
2001] 


382..577 
1..196 | 


196/196(100%) 
196/196 (100%) 


e-112 


In a BLAST search of public sequence databases, the NOV56a protein was found to 
have homology to the proteins shown in the BLASTP data in Table 56E. 


Table 56E. Public BLASTP Results for NOV56a 


Protein 
Accession 
Number 


Protein/Organism/Length 


NOV56a 
Residues/ 

Match 
Residues 


Identities/ 
Similarities for 
the Matched 
Portion 


Expect 
Value 


Q96A20 


MIDDLE-CHAIN ACYL-COA 
SYNTHETASE 1 (MEDIUM- 
CHAIN ACYL-COA 
SYNTHETASE) - Homo sapiens 
(Human), 577 aa. 


1..577 
1..577 


576/577 (99%) 
576/577 (99%) 


0.0 


Q9TVB5 


XENOBIOTIC/MEDIUM-CHAIN 
FATTY ACID:COA LIGASE 
FORM XL-III PRECURSOR - Bos 
taurus (Bovine), 577 aa. 


1..576 
1..576 


439/576 (76%) 
486/576 (84%) 


0.0 


Q9BEA2 


LIPOATE-ACTIX \T11Mj 
ENZYME PRECURSOR - Bos 
taurus (Bovine), 577 aa. 


1..576 
1..576 


438/576 (76%) 
485/576 (84%) 


0.0 


Q91VA0 


MEDIUM-CHAIN ACYL-COA 
SYNTHETASE (EC 6.2.1.2) 
(HYPOTHETICAL 64.8 KDA 
PROTEIN) - Mus musculus 
(Mouse), 573 aa. 


1..577 
1..573 


406/577 (70%) 
472/577 (81%) 


0.0 


070490 


KIDNEY-SPECIFIC PROT1 l\ ; - 
Rattus norvegicus (Rat), 572 aa. 


1..573 
! 1..567 


U5 ^80(54%) 
417/580 (71%) 


0.0 



PFam analysis predicts that the NOV56a protein contains the domains shown in the 
Table 56F. 
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Table 56F. Domain Analysis of NOV56a 


Pfam Domain 


NOV56a Match 
Region 


Identities/ 
Similarities 
for the Matched 
Region 


Expect 
Value 


AMP-binding: domain 1 
of] 


87..499 


106/425 (25%) 
299/425 (70%) 


2.5e-96 



Example 57. 



The NOV57 clone was analyzed, and the nucleotide and predicted polypeptide 
j==i sequences are shown in Table 57A. 

s 



Table 57 A. NOV57 Sequence Analysis 




SEQ ID NO: 171 12501 bp 


NOV57a, 

CG59354-01 DNA Sequence 


.... >vrG . .. r i.-; T t.-:." T .-; ■;>■■■:: - : — v-TVT^TAT 

CAGCAGTGAGGATGAGGACAGTGACCACGAGGACAAGGACCGAGGCAGATGTGCCCCA 

CAATGACTCTGAAGGAGTTTGCCATAATGAATGAGGACCAAGATGATGAAGAGTTTCT 
GCAGCAGTACCGGAAGCAGCGAATGGAAGAGATGCGGCAGCAGCTTCACAAGGGGCCC 
CAATTCAAGCAGGTTTTTGAGATCTCCAGTGGAGAAGGGTTTTTAGACATGATTGATA 
AAGAACAGAAAAGCATTGTCATCATGGTTCATATTTATGAGGATGGCATTCCAGGGAC 
CGAAGCCATGAATGGTTGCATGATCTGCCTTGCCGCAGAGTACCCAGCTGTCAAGTTC 
TGCAAGGTGAAGAGCTCAGTTATTGGCGCCAGCAGTCAGTTCACCAGGAATGCCCTTC 
CTGCCCTGCTGATCTATAAGGGGGGTGAATTGATCGGCAATTTTGTTCGTGTTACTGA 
CCAGCTGGGGGATGATTTCTTTGCTGTGGACCTTGAAGCTTTTCTCCAGGAATTTGGA 
TTACTCCCAGAAAAGGAAGTCTTGGTGCTGACATCTGTGCGTAACTCTGCCACGTGTC 
ACAGTGAGGATAGCGACCTGGAAATAGATTGAACTGATAGTCTAGTTGCATAGATTTC 
TCATTGTTTGGGTTGGAATACACGTCATTGTTTATTTTTGTTCCTTTGTCTTCTGGCT 


TTTCAGCTGTTCTTTGTAGTCCCTTTTATTATGCATAAAATAAAGAAATTCTTAGATT 


AAATCAGAATGCTGAATAACCTTGTAGCT -\ xAT7 \> 3TGACTTACAATTGTATAAA 




TTGTTTCCAGTCAATATTTACAAAGCATCCTAAAGACAGGGTCTTGGAAATTGTCTTC 


AGATGATCTTAGAGGTCTCTGCCAAGTCTGAGAGTATAATTCTGTAGGTATTGTGTTA 


TTTGCAACGTAAATAGTGCATTTTCTTAATCAAATGATTGTAAATTATATTTACTTGT 


AATCAGTTCCATAGCTTTAGACGGTGGTTAGATTTTTTTTTTCCCCACCAGGGTCTTG 


TTTAAAGGGGTGAGCCACCGCACCCAGTCCTGAGGGGTGGCCTCTGCTGCTGGATTTC 


ATGTCTTCCTCCAGCATGACTAAGTCTGGAACAGCAGGAAGGGTTGATGCTTACTGAC 


CTGGTGATGTTAGAAGACAAGTAGTTTATGGATTTAAACATTAGAGCTGGAGTGGGGC 


TGGAAATCTTTGTAAAGGAAGTTCTTTCAGTAAGATGCCCCTGCTTGTCTTTGTCTCT 


TTTTTTTAAAACTTTTTTTTTTTTTCATATTGGAAAAGTAATTCATATTCAGTAGAGG 


AAAACTGACCAAAACAGAAGCAAAAATAAGAAAATTAAAATAATCTCTAATCCTACTA 


CCTAGAATAAAACACTATTAATATTTTGGTCTGTTTCCTGCCAAGGTGTTTTCTGTGT 


ATACATGGATATTTTGTTTGTTTTTAAACAAAACGATGGGATCATTCTGAACATACTG 


IT T T TTAATATTCCATCGTATAGATGTGATATAATTTGC 


TTGATGGTTGTCTCTTAAAAAGAAAGATAGCAAATACTTTTTTTAAATTACAAAAGTG 


ATAGATGTTCATTGTAGAAAATGTAATAAACACTGTTAAGACTTAAAAGCCATATAAT 




TTTCTATGTATTTATAATAATTATCATTTGCGTTTTTTTCCTTTTTTTAACTTTAAAA 




CACAATTTTGATTTTCCTGGCTTTTCAGGAATTGACTAACTGTAAAAGAGTCTTGAAA 


GTATTTAGTCAACAAACAGAGTGCATTTTTTTTTTTTTGACTAAGAAAGCTCGTTGTA 




CTTATTTCCTTTCAGTCTTTTTTCAATATTTT~PT 17 _ T TTTACTGAATT 


TAGTTCTGTGCTCTTCCTTATTTAGTGTTGTATCATAAATACTTTGATGTTTCAAACA 


TTCTAAATAAATAATTTTCAGTGGCTTCATAATAAAAAAAAAAAAAAAAAAAAAAAAA 


AAAAAAA 




ORF Start: ATG at 6 


ORF Stop: TGA at 726 




SEQ ID NO: 172 


240 aa <MW at 26S66.9RD 


NOV57a, 


MTTLDDKLLGEKLQYYYSSSEDEDSDHEDKDRGRCAPASSSVPAEAELAGEGISVNTM 
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3 



CG59354-01 Protein Sequence 


QKSIVIMVHIYEDGIPGTEAMNGCMICLAAEYPAVKFCKVKSSVIGASSQFTRNALPA 
LLIYKGGELIGNFVRVTDQLGDDFFAVDLEAFLQEFGLLPEKEVLVLTSVRNSATCHS 
EDSDLEID 




SEQIDNO:173 (893 bp 


NOV57b, 

CG59354-02 DNA Sequence 


CACCATGACCACCCTGTATGATAAGTTGCTGGGGGAGAAACTGCAGTACTACTATAGC 
AGCAGTGAAGATGAGGACAGTGACCACGAGGACAAGGACCGAGGCATCTCAGTTAACA 
CAGGCCCAAAAGGTGTGATCAATGACTGGCGCCGCTTCAAGCAGTTGGAGACAGAGCA 
GAGGGAGGAGCAGTGCCGGGAGATGGAAAGGCTGATCAAGAAGCTGTCAATGACTTGC 
AGGTCCCATCTGGATGAAGAGGAGGAGCAACAGAAACAGAAAGACCTCCAGGAGAAGA 
TCAGTGGGAAGATGACTCTGAAGGAGTTTGCCATAATGAATGAGGACCAAGATGATGA 
AGAGTTTCTGCAGCAGTACCGGAAGCAGCGAATGGAAGAGATGCGGCAGCAGCTTCAC 
AAGGGGCCCCAATTCAAGCAGGTTTTTGAGATCTCCAGTGGAGAAGGGTTTTTAGACA 

CAGGGACCGAAGCCATGAATGGTTGCATGATCCGCCTTGCAAGGGGGGTGAATTGATC 
GGCAATTTTGTTCGTGTTACTGACCAGCTGGGGGATGATTTCTTTGCTGTGGACCTTG 
AAGCTTTTCTCCAGGAATTTGGATTACTCCCAGAAAAGGAAGTCTTGGTGCTGACATC 

GATAGTCTAGTTGCATATAGATTTCTCATTGTTTGGGTTGGAATACACCATTGTTTAT 


TTTTGTTCCTTTGTCTTCTGGCTTTTCAGCTGTTCTTTGTAGTCCCTTTTATTATGCA 


TAAAATAAAGAAATTCTTAGATT 




ORF Start: ATG at 5 


ORF Stop: TGA at 749 




SEQIDNO: 174 


248 aa ) MW at 29227.4kD 


NOV57b, 

CG59354-02 Protein Sequence 


MTTLYDKLLGEKLQYYYSSSEDEDSDHEDKDRGISVNTGPKGVINDWRRFKQLETEQR 
EEQCREMERLIKKLSMTCRSHLDEEEEQQKQKDLQEKISGKMTLKEFAIMNEDQDDEE 
FLQQYRKQRMEEMRQQLHKGPQFKQVFEISSGEGFLDMIDKEQKSIVIMVHIYEDGIR 
DRSHEWLHDPPCKGGELIGNFVRVTDQLGDDFFAVDLEAFLQEFGLLPEKEVLVLTSV 
RNSATCHSEDSDLEID 




SEQIDNO: 175 1 891 bp 


NOV57c, 

CG59354-03 DNA Sequence 


CACCATGACCACCCTGTATGATAAGTTGCTGGGGGAGAAACTGCAGTACTACTATAGC 

CAGGCCCAAAAGGTGTGATCAATGACTGGCGCCGCTTCAAGCAGTTGGAGACAGAGCA 
GAGGGAGGAGCAGTGCCGGGAGATGGAAAGGCTGATCAAGAAGCTGTCAATGACTTGC 
AGGTCCCATCTGGATGAAGAGGAGGAGCAACAGAAACAGAAAGACCTCCAGGAGAAGA 
TCAGTGGGAAGATGACTCTGAAGGAGTrTGCCATAATGAATGAGGACCAAGATGATGA 
AGAGTTTCTGCAGCAGTACCGGAAGCAGCGAATGGAAGAGATGCGGCAGCAGCTTCAC 
AAGGGGCCCCAATTCAAGCAGGTTTTTGAGATCTCCAGTGGAGAAGGGTTTTTAGACA 
TGATTGATAAAGAACAGAAAAGCATTGTCATCATGGTTCATATTTATGAGGATGGCAT 
TCCAGGGACCGAAGCCATGAATGGTTGCATGATCCGCCTTGCCGCAGAGTACCCAGCT 

ATGCCCTTCCTGCCCTGCTGATCTATAAGGGGGGTGAATTGATCGGCAATTTTGTTCG 
TGTTACTGACCAGCTGGGGGATGATTTCTTTGCTGTGGACCTTGAAGCTTTTCTCCAG 
GAATTTGGATTACTCCCAGAAAAGGAAGTCTTGGTGCTGACATCTGTGCGTAACTCTG 
CCACGTGTCACAGTGAGGATAGCGACCTGGAAATAGATTGAACTGATAGTCTAGTTGC 
ATAGATTTCTCATTGTTTGGG 




ORF Start: ATG at 5 


ORF Stop: TGA at 851 




SEQIDNO: 176 


282 aa |MW at 32598.5kD 


NOV57c, 

CG593 54-03 Protein Sequence 


MTTLYDKLLGEKLQYYYSSSEDEDSDHEDKDRGISVKTGPKGVINDWRRFKQLETEQR 
EEQCREMERLIKKLSMTCRSHLDEEEEQQKQKDLQEKISGKMTLKEFAIMNEDQDDEE 
FLQQYRKQRMEEMRQQLHKGPQFKQVFEISSGEGFLDMIDKEQKSIVIMVHIYEDGIP 
GTEAMNGCMIRLAAEYPAVKFCKVKSSVIGASSQFTRNALPALLIYKGGELIGNFVRV 
TDQLGDDFFi iLSAFLQEFGLLPE - ] L I E r _ 1 D 



Sequence comparison of the above protein sequences yields the following sequence 
relationships shown in Table 57B. 
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Table 57B. Comparison of NOV57a against NOV57b through NOV57c. 


Protein Sequence 


NOV57a Residues/ 
Match Residues 


Identities/ 
Similarities for the Matched Region 


NOV57b 


58..240 
100..248 


138/183 (75%) 
140/183 (76%) 


NOV 5 7c 


58..240 
100..282 


182/183 (99%) 
182/183 (99%) 



Further analysis of the NOV57a protein yielded the following properties shown in 
Table 57C. 



Table 57C. Protein Sequence Properties NOV57a 


PSort 
analysis: 


0.6500 probability located in cytoplasm; 0.1000 probability located in 
mitochondrial matrix space; 0.1000 probability located in lysosome (lumen); 
0.0000 probability located in endoplasmic reticulum (membrane) 


SignalP 
analysis: 


No Known Signal Sequence Predicted 

^ _ - 



A search of the NOV57a protein against the Geneseq database, a proprietary 
database that contains sequences published in patents and patent publication, yielded 
several homologous proteins shown in Table 57D. 



Table 57D. Geneseq Results lor NO\ 57;i 


Geneseq 
Identifier 


Protein/Organism/Length [Patent 
#, Date] 


NOV57a 
Residues/ 

Match 
Residues 


Identities/ 
Similarities for 
the Matched 
Region 


Expect 
Value 


AAE03537 


Human secreted protein variant, 
SEQ ID NO: 228 - Homo sapiens, 
301 aa. [WO200132675-A1, 10- 
MAY-2001] 


1 ..240 
1..301 


226/301 (75%) 
230/301 (76%) 


e-117 


AAY99657 


Human GTPase associated protein- 
8 - Homo sapiens, 301 aa. 
[WO200031263-A2, 02-JUN-2000] 


1..240 
1..301 


226/301 (75%) 
230/301 (76%) 


e-117 


AAE02004 


Fruitfly viral IAP-associated factor 
(VIAF) - Drosophila melanogaster, 
240 aa. [WO200134798-A1, 17- 
M4Y-2001] 


55..214 
59.213 


52/161 (32%) 
86/161 (53%) 


3e-14 
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AAE02003 


Zebrafish viral IAP-associated 
factor (VIAF) - Brachydanio rerio, 
239 aa. [WO200134798-A1, 17- 
MAY-2001] 


| 21. .236 
j 2..237 


69/241 (28%) 
117/241 (47%) 


5e-13 


AAE02002 


Mouse viral IAP-associated factor 
(VIAF) - Mus musculus, 240 aa. 
[WO200134798-A1, 17-MAY- 
2001] 


| 58..240 
1 52..240 

i 

I _ 


* 59/195 (30%) 
99/195 (50%) 


4e-12 


In a BLAST search of public sequence databases, the NOV57a protein was found to 
have homology to the proteins shown in the BLASTP data in Table 57E. 


Table 57E. Public BLASTP Results for NOV57a 


Protein 
Accession 
Number 


Protein/Organism/Length 

1 

-4 


NOV57a 
Residues/ 

Match 
Residues 


Identities/ 
Similarities for 
the Matched 
Portion 


Expect 
Value 


Q96AF1 


HYPOTHETICAL 34.3 KDA 
PROTEIN - Homo sapiens 
(Human), 301 aa. 


1..240 
1..301 


226/301 (75%) 
230/301 (76%) 


e-117 


Q13371 


Phosducin-like protein (PHLP) - | 
Homo sapiens (Human), 301 aa. 


1..240 
1..301 


225/301 (74%) 
230/301 (75%) 


e-116 


T17321 


hypothetical protein 1 
DKFZp5 64M 1 863 . 1 - human, | 
301 aa. 


1 ..240 
1..301 


225/301 (74%) 
230/301 (75%) 


e-116 


Q923E8 


RIKEN CDNA 120001 1E13 ] 
GENE - Mus musculus (Mouse), 1 
301 aa. 


1..240 
1..301 


210/301 (69%) 
223/301 (73%) 


e-109 


Q63737 


Phosducin-like protein (PHLP) - s 
Rattus norvegicus (Rat), 301 aa. 


1..240 
1..301 


210/301 (69%) 
223/301 (73%) 


e-108 



PFam analysis predicts that the NOV57a protein contains the domains shown in the 
Table 57F. 
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Table 57F. Domain Analysis of NOV57a 




Pfam Domain 


NOV57a Match 
Region 


Identities/ 
Similarities 
for the Matched 
Region 


Expect 
Value 


Phosducin: domain 1 of 

2 


35..57 


14/23 (61%) 
21/23 (91%) 


8 7e-08 


Phosducin: domain 2 of 
2 


5 8.. 240 


133/183 (73%) 
174/183 (95%) 


9.7e-148 



Example 58. 

The NOV58 clone was analyzed, and the nucleotide and predicted polypeptide 
sequences are shown in Table 58A. 



G 



Table 58 A. NOV58 Sequence- Analysis 




SI y ID NO: 177 


756 bp 


NOV58a, 

CG593 19-01 DNA Sequence 


GGATCCCAATGAAGATACAGAATGGAATGACATTTTAAGAGATTTCGGCATTCTTCCT 
CCTAAAGAAGAGTCAAAAGATGAAATTGAAGAAATGGTTTTACGTTTACAGAAAGAAG 
CAATGGTGAAACCATTTGAAAAGATGACTCTTGCACAGCTAAAGGAAGCTGAAGATGA 
ATTCGATGAAGAAGATATGCAGGCTGTTGAAACATATAGAAAGAAGCGGTTACAGGAA 
TGGAAAGCTCTTAAGAAAAAACAAAAATTTGGAGAATTAAGAGAAATTTCTGGAAATC 
AGTATGTGAATGAAGTCACAAATGCAGAAGAAGATGTGTGGGTTATAATTCATCTATA 
CAGATCAAGCATCC CAATGTGTTTGTTGGTTAACCAGCATCTTAGT CTTCTAGCAAGA 
AAGTTTCCAGAAACTAAATTTGTTAAAGCCATCGTGAATAGCTGTATTCAACACTACC 
ATGACAATTGTTTACCAACAATTTTTGTGTATAAAAATGGTCAGATAGAAGCCAAATT 
CATTGGAATTATAGAATGTGGAGGGATAAATCTCAAGCTGGAAGAACTTGAATGGAAG 
CTAGCAGAAGTTGGAGCAATACAGACTGATTTGGAAGAAAACCCCAGAAAAGACATGG 
TAGATATGATGGTATCTTCAATTAGAAACACTTCTATTCATGATGACAGTGATAGCTC 
CAACAGTGATAATGATACCAAATAGAGAGAAATATTCAATAAATAGCTTTTAGTAAAA 
AA 




ORF Start: GAT at 2 


ORF Stop: TAG at 719 




SEQIDNO: 178 


239 aa |MW at 27811.3kD 


NOV58a, 

CG593 19-01 Protein Sequence 


DPNEDTEWNDILRDFGILPPKEESKDEIEEMVLRLQKEAMVKPFEKMTLAQLKEAEDE 
FDEEDMQAVETYRKKRLQEWKALKKKQKFGELRE I SGNQYVNEVTNAEEDVWVI IHLY 
RSS I PMCLLVNQHLSLLARKFPETKFVKAI VNSC I QHYHDNCLPTIFVYKNGQIEAKF 
IG I IECGGINLKLEELEWKLAEVGAI QTDLEENP RKDMVDMMVSS I RNTS I HDDSDSS 
NSDNDTK 




SPOIDNO: 179 


745 bp 


NOV58D, 

CG593 19-02 DNA Sequence 


GGATCCCAATGAAGATACAGAATGGATCCCAATGAAGATACAGAATGGAATGACATTT 


TAAGAGATTTCGGCATTCTTCCTCCTAAAGAAGAGTCAAAAGATGAAATTGAAGAAAT 
GGTTTTACGTTTACAGAAAGAAGCAATGGTGAAACCATTTGAAAAGATGACTCTTGCA 

CAGCTAAAGGAAGCTGAAGATGAATTTGATGAAGAAGATATGCAGGCTGTTGAAACAT 
ATAGAAAGAAGCGGTTACAGGAATGGAAAGCTCTTAAGAAAAAACAAAAATTTGGAGA 
ATTAAGAGAAATTTCTGGAAATCAGTATGTGAATGAAGTCACAAATGCAGAAGAAGAT 
GTGTGGGTTATAATTCATCTATACAGATCAAGCATCCCAATGTGTTTGTTGGTTAACC 
AGCATCTTAGTCTTCTAGCAAGAAAGTTTCCAGAAACTAAATTTGTTAAAGCCATCGT 
GAATAGCTGTATTCAACACTACCATGACAATTGTTTACCAACAATTTTTGTGTATAAA 
AATGGTCAGATAGAAGCCAAATTCATTGGAATTATAGAATGTGGAGGGATAAATCTCA 
AGCTGGAAGAACTTGAATGGAAGCTAGCAGAAGTTGGAGCAATACAGACTGATTTGGA 
AGAAAACCCCAGAAAAGACATGGTAGATATGATGGTATCTTCAATTAGAAACACTTCT 
ATCCATGATGACAGTGATAGCTCCAACAGTGATAATGATACCAAATAGA 




ORF Start: ATG at 22 


OKI Stop: 1 \(i at 742 




SEQIDNO: 180 


1 iti MWat27942.5kD 
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NOV58b, 

CG593 19-02 Protein Sequence 



MDPNEDTEWNDILRDFGILPPKEESKDEIEEMVLRLQKEAMVKPFEKMTLAQLKEAED 
EFDEEDMQAVETYRKKRLQEWKALKKKQKFGELREISGNQYVNEVTNAEEDVWVIIHL 
YRSSIPMCLLVNQHLSLLARKFPETKFVKAIVNSCIQHYHDNCLPTIFVYKNGQIEAK 
FIGIIECGGINLKLEELEWKLAEVGAIQTDLEENPRKDMVDMMVSSIRNTSIHDDSDS 



Sequence comparison of the above protein sequences yields the following sequence 
relationships shown in Table 58B. 



Table 58B. Comparison of NOV58a against NO V58b. 


Protein Sequence 


NOV58a Residues/ 
Match Residues 


Identities/ 
Similarities for the Matched Region 


NOV58b 


1..239 
2..240 


216/239 (90%) 
216/239 (90%) 



Further analysis of the NOV58a protein yielded the following properties shown in 
Table 58C. 



Table 58C. Protein Sequence Properties NOV58a 



PSort 
analysis: 


0.8800 probability located in nucleus; 0.1000 probability located in 
mitochondrial matrix space; 0.1000 probability located in lysosome (lumen); 
0.0000 probability located in endoplasmic reticulum (membrane) 


SignalP 
analysis: 


No Known Signal Sequence Predicted 



A search of the NOV58a protein against the Geneseq database, a proprietary 
database that contains sequences published in patents and patent publication, yielded 
several homologous proteins shown in Table 58D. 



Table 58D. Geneseq Results for NOV58a 


Geneseq 
Identifier 


Protein/Organism/Length [Patent 
#, Date] 


NOV58a 
Residues/ 

Match 
Residues 


Identities/ 
Similarities for 
the Matched 
Region 


Expect 
Value 


AAE02003 


Zebrafish viral IAP-associated 
factor (VIAF) - Brachydanio rerio, 
239 aa. [WO200134798-A1, 17- 
MAY-2001] 


1..237 
3..239 


133/238 (55%) 
181/238(75%) 


3e-75 


AAU27979 


Mouse contig polypeptide sequence 
#1 32 - Mus musculus, 243 aa. 
[WO2001 64834- A2, 07-SEP-2001] 


1..231 
7.. 240 




137/234 (58%) 
176/234 (74%) 


2e-74 
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AAU27807 


Human full-length polypeptide 
sequence #132 - Mus musculus, 
239 aa. [WO200164834-A2, 07- 
SEP-2001] 


1..231 
3..236 


137/234 (58%) 
176/234 (74%) 


2e-74 


AAE02001 


Human viral IAP-associated factor 
(VIAF) - Homo sapiens, 239 aa. 
[WO200134798-A1, 17-MAY- 
2001] 


1..231 
3..236 


137/234 (58%) 
176/234 (74%) 


2e-74 


AAB68507 


Human GTP-binding associated 
protein #7 - Homo sapiens, 239 aa. 
[ WO200I05970-A2, 25-JAN-2001 | 


1..231 
3..236 


137/234 (58%) 
176/234 (74%) 


2e-74 



In a BLAST search of public sequence databases, the NOV58a protein was found to 
have homology to the proteins shown in the BLASTP data in Table 58E. 



Table 58E. Public BLASTP Results for NOV58a 



Protein 
Accession 
Number 


Protein/Organism/Length 


NOV58a 
Residues/ 

Match 
Residues 


Identities/ 
Similarities for 
the Matched 
Portion 


Expect 
Value 


Q9CQU4 


1700010B22RIK l>RO 1 EIN - Mus 
musculus (Mouse), 240 aa. 


1..239 
3..240 


208/239 (87%) 
229/239 (95%) 


e-121 


Q9WUP3 


PDCL2 - Mus musculus (Mouse), 
238 aa (fragment). 


1..239 
1..238 


207/239 (86%) 
228/239 (94%) 


e-121 


Q9DA99 


17 116K07RIK PROTEIN - Mus 

musculus (Mouse), 1 92 aa. 


47..239 
1..192 


. 165/193 (85%) 
183/193 (94%) 


3e-94 


CAC40345 


SEQUENCE 5 FROM PATENT 
WOO 134798 - Brachydanio rerio 
(Zebrafish) (Zebra danio), 239 aa. 


1..237 
3-239 


133/238 (55%) 
181/238 (75%) 


le-74 


Q9H2J4 


HTPHLP (UNKNOWN) 
(PROTEIN FOR MGC:3062) - 
Homo sapiens (Human), 239 aa. 


1-231 
3-236 


137/234 (58%) 
176/234 (74%) 




8e-74 



PFam analysis predicts that the NOV58a protein contains the domains shown in the 
Table 58F. 
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Table 58F. Domain Analysis of NOV58a 


Pfam Domain 


NOV58a Match 
Region 


Identities/ 
Similarities 
for the Matched 
Region 


Expect 
Value 


Phosducin: domain 1 of 
1 


60.. 175 


32/120(27%) 
55/120 (46%) 


5.8 



Example 59. 

The NOV59 clone was analyzed, and the nucleotide and predicted polypeptide 
sequences are shown in Table 59A. 



Table 59A. NOV59 Sequence \nalysis 




SEQIDNO: 181 


981 bp 


NOV59a, 

CG59576-01 DNA Sequence 


GCC 7TTTTT^T( T' T 1 ^ T 1 l "I 

CCATCATTGGTAGTCTTATGGTGTTCTTTGCCATCAAACTGGATTTCTGCCTGCACAG 

CTCCTTCTATTTCTTCATCAGTGTCCTCTCCTTCCTAGAGATCTGGTATACCACCATC 

ACCATCCCCAAGATGTTCTTCAACCTAGCCAGTGAGCAGAAGACCACCTCCCTGGATG 

GTTGCCTATTGCAGATGTATTTCTTTTACTCCCTCGGCATCACTGAGGTTTGCTTGCT 

CACCACCAGGGCTATGGACAGATACCTGGCCATCTGTAATCACCTTTGCTACCCCACA 

GTCACGACACCTCAGCTCTACACTCAGGTGATTCTAGGTTGTTGCATCTGTGGCTTCT 

TCACGCTGCTCCCTGAGATTGCTTGGATATCCACACTGCCATTTTGTGGTCCAAATCA 

AATCCACAACATTTTCTGTGACCTTGATCCTATCCTGAATCTAGCATGTGTAGACACT 

GGCCCAGTTGTTTTAATCAAGGTTGTGGACATTGTACATGCTGTGGAGATCATCACAG 

CTATAATGCTTGTGACTTTGGCTTACGTCCAAATTATTGCAGTGATCCTAAGAAACTG 

CTCTGCTGATGGATGCCAAAAGGCATTTTCTACCTATGCTTTCCACCTTGCTATTTTC 

TTAATCTTTTTTGGAAGTGTAGCCCTGATGTACCTGCTCTTCTCTGCCAAGTACTCCT 

TTTTCTGGGACACAACCATCAGCCTAATGTTTGCAGTGCTGTCACCGACAACAATCAT 

CTGTAGTCTGAGGAATAAAGAGATAAAGGAAGCAATAAAAAAGCACATGTGCCAATCA 

ATGATATGCACACATCATGTCAAATAAGACCAAATACACACCTCTTAATTACCAAAGA 




ATATTTATACAAATATTTACATTAATACGTTCAGTGTGTTTGTTGCTGCTGTG 




ORF Start: GCC at 1 


ORF Stop: TAA at 895 




SEQIDNO: 182 


298 aa jMW at 33780.0kD 


NOV59a, 

CG59576-01 Protein Sequence 


ATAPSWLLFFILLLLIYLFTIIGSLMVFFAIKLDFCLHSSFYFFISVLSFLEIWYTTI 
TIPKMFFNIASEQKTTSLDGCLLQMYFFYSLGITEVCLLTTRAMDRYLAICNHLCYPT 
VTTPQLYTQVILGCCICGFFTLLPEIAWISTLPFCGPNQIHNIFCDLDPILNLACVDT 
GPWLIKWDIVHAVEIITAIMLVTLAYVQIIAVILRNCSADGCQKAFSTYAFHLAIF 
LIFFGSVALMYLLFSAKYSFFWDTTISLMFAVLSPTTIICSLRNKEIKEAIKKHMCQS 
MICTHHVK 



Further analysis of the NOV59a protein yielded the following properties shown in 
Table 59B. 



Table 59B. Protein Sequence Properties NOV59a 


PSort 
analysis: 


0.6400 probability located in plasma membrane; 0.4600 probability located in 
Golgi body; 0.3700 probability located in endoplasmic reticulum (membrane); 
0.1000 probability located in endoplasmic reticulum (lumen) 


SignalP 
analysis: 


Likely cleavage site between residues 25 and 26 
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A search of the NOV59a protein against the Geneseq database, a proprietary 
database that contains sequences published in patents and patent publication, yielded 
several homologous proteins shown in Table 59C. 



r -^„ J ._,__._. .^^——..^—^ 

Table 59C. Geneseq Results for NOV59a 



Geneseq 
Identifier 


T NOV59a 

Protein/Organism/Length [Patent j Residues/ 
#, Date] j Match 
| Residues 


Identities/ 
Similarities for 
the Matched 
Region 


Expect 
Value 


AAG72586 


Human OR-like polypeptide query 
sequence, SEQ ID NO: 2267 - 
Homo sapiens, 289 aa. 
[WO200127158-A2, 19-APR-2001] 


7..295 
1..289 


286/289 (98%) 
286/289 (98%) 


e-167 


AAG71784 


Human olfactory receptor 
polypeptide, SEQ ID NO: 1465 - 
Homo sapiens, 289 aa. 
[WO200127158-A2, 19-APR-2001] 


7..295 
1..289 


286/289 (98%) 
286/289 (98%) 


e-167 


AAG71785 


Human olfactory receptor 
polypeptide, SEQ ID NO: 1466 - 
Homo sapiens, 318 aa. 
[WO200127158-A2, 19-APR-2001] 


5..292 
20..311 


175/293 (59%) 
217/293 (73%) 


6e-95 


AAU24721 


Human olfactory receptor 
AOLFR220 - Homo sapiens, 343 aa. 
[WO200168805-A2, 20-SEP-2001] 


7..283 
53.328 


170/279 (60%) 
212/279 (75%) 


4e-94 


AAG71808 


Human olfactory receptor 
polypeptide, SEQ ID NO: 1489 - 
Homo sapiens, 317 aa. 
[ \V( )200 i 27 1 5S-A2, 1 9-APR-2001 ] 


7..283 
29..304 


170/279 (60%) 
212/279 (75%) 


4e-94 



In a BLAST search of public sequence databases, the NOV59a protein was found to 
have homology to the proteins shown in the BLASTP data in Table 59D. 



Table 591). Public BL \S1 P Results for NO'S 59a 


Protein 
Accession 
Number 


Protein/Organism/Length 


NOV59a 
Residues/ 

Match 
Residues 


Identities/ 
Similarities for 
the Matched 
Portion 


Expect 
Value 


Q96R35 


OLFACTORY RECEPTOR - 
Homo sapiens (Human), 216 aa 
(fragment). 


50..267 
1.216 


107/218 (49%) • 
146/218(66%) 


7e-55 
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Q9EPG2 


M51 OLFACTORY RECEPTOR - 
Mus musculus (Mouse), 314 aa. 


2..285 
19..303 


115/289 (39%) 
172/289 (58%) 


2e-52 


095007 


Olfactory receptor 6B1 (Olfactory 
receptor 7-3) (OR7-3) - Homo 
sapiens (Human), 311 aa. 


10..285 
28..301 


109/279 (39%) 
170/279 (60%) 


le-51 


Q9QWU6 


OLFACTORY RECEPTOR 17 - 
Mus musculus (Mouse), 327 aa. 


1..289 
20..314 


111/298 (37%) 
171/298 (57%) 


2e-50 


P23270 


Olfactory receptor-like protein 17 - 
Rattus norvegicus (Rat), 327 aa. 


1..289 
20..314 


1 1 1/298 (37%) 
171/298(57%) 


2e-50 



PFam analysis predicts that the NOV59a protein contains the domains shown in the 
Table 59E. 



Table 59E. Domain Analysis of NOV59a 


Pfam Domain 


NOV59a Match Region 


Identities/ 
Similarities 
for the Matched Region 


Expect Value 


7tm_l : domain 1 of 1 


37..164 


30/134(22%) 
90/134 (67%) 


5.4e-13 



Example 60. 



The NOV60 clone was analyzed, and the nucleotide and predicted polypeptide 
sequences are shown in Table 60A. 



Table 60A. NOV60 Sequence Analysis 




SEQ ID NO: 183 


1201 bp 


NOV60a, 

CG59557-01 DNA Sequence 


AGGATAACTTTATATGTTGCAAAATGACTCACATAGTATATTTTATTTAACCAGCCTA 


ATTTCAAGGCTGTT 1 T GTTTTTATTTGTTCTTTGCATGTACT 
TAGAATGCTGACTGTGTTTTATGAGCCAACAAGTGAAACCGCTGAAAATATGGATCCA 
GAGAATCAGACAATGGTGACTGAGTTTTATTTCTCTGATTTTCCTCAATCTAAGAATG 
GCAGCCTCTTATTCTTCATTCCTATGCTCTTTATTTATATATTCATTCTTGTTGGAAA 
TTTCATGATTTTCTTTGCTGTCCGACCGGACCCCCATCTCCATAATCCTATGTACAGT 
TTTATCAGTGTCTTCTCCTTCCTGGAGATTTGGTACACCACCGTGACTATCCCCAAGA 
TGCTCTCCAACCTTCTCAGTGAACAGAAAACCATCTCTTTCATAGGTTGCCTCCTGCA 
GATGTACTTCTTCCACTCACTCGGGGTCACAGAAGCCCTAGTCCTCACAGTGATGGCC 
ATTGACAGGTGTGTAGCCATCTGCAACCCCCTTCGCTATGCAATCACTATGTCCCCTA 
GACTGTGCATCCAGCTCTCCACTGGCTCTTGCATTTTTGGCTTCCTCATGTTACTGCC 
AGAGATTGTGTGCATTTCCACTCTTCCATTCTGTGGCGCCAACCAAATTCATCAACTC 
TTTTGTGACTTTGAACCTGTGCTGCAGTTAGCCTGCACAGATACGTACATAATTCTGG 
TTGAAGATGTGATCCGTGCTATTTCCATTCTGACCTCTGTCTCTGTCATCACCCTTTT 
CTATTTAAGAATCATCACGGTGATCCTGAGGATTCCCTCTGGTGAGAGTCGTCAGAAG 
GCTTTCTTCACATGTGCAGCCCACATTGCTATTTTCTTGCTGTTTTTTGGCAGTGTGT 
CACTCATGTATCTGCGCTTCTCTGTCACATTCCCACCATTACTGGACAAGGCCATTGC 
ACTGATGTTTGCTGTCCTTGCCCTACTTTTCAACCCAGTAATCTATAGTCTGAGGAAC 
AAAGATATGAAAAACGCCACCAAGAAAATCCTCTGTTCTCAAAAGATGTTCAATGCCT 
CTGGGAGCTAATGGAGTTCACACACACCTCTTCAAAGAAATCTCATCATCTCCTTAAG 


'i I rAAAATGCTAACAAATCAGTTTTTTTAAATTACCATGCA 




ORF Start: ATG at 121 


ORF Stop:TAAat 1111 




SEQ ID NO: 1 84 


330 aa MW at 37439 lkD 


NOV60a, 


MLTVFYEPTSETAENMDPENQTMVTEFYFSDFPQS KNGSLLFF I PMLFIYI F I LVGNF 
MIFFAVRPDPHLHNPMYSFISVFSFIiEIWYTTVTIPKMLSNLLSEQKTIS^IG ILLQi 1 
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CG5 9557-01 Protein Sequence 



YFFHSLGVTEALVLTVMAIDRCVAICNPLRYAITMSPRLCIQLSTGSCIFGFLMLIiPE 
IVCISTLPFCGANQIHQLFCDFEPVLQLACTDTYIILVEDVIRAISILTSVSVITLFY 
LRIITVILRIPSGESRQKAFFTCAAHIAIFLLFFGSVSLMYLRFSVTFPPLLDKAIAL 
MFAVLALLFNPVIYSLRNKDMKNATKKILCSQKMFNASGS 



Further analysis of the NOV60a protein yielded the following properties shown in 
Table 60B. 



Table 601$. Protein Sequence Properties NOV60a 


PSort 
analysis: 


0.6000 probability located in plasma membrane; 0.4000 probability located in 
Golgi body; 0.3000 probability located in endoplasmic reticulum (membrane); 
0.0300 probability located in mitochondrial inner membrane 


SignalP 
analysis: 


Likely cleavage site between residues 67 and 68 



A search of the NOV60a protein against the Geneseq database, a proprietary 
database that contains sequences published in patents and patent publication, yielded 
several homologous proteins shown in Table 60C. 



Table 60C. Geneseq Results for NOV60a 



Geneseq 
Identifier 


Protein/Organism/Length [Patent 
#, Date] 


NOV60a 1 Identities/ 
Residues/ | Similarities for 

Match j the Matched 
Residues j Region 


Expect 
Value 


AAG71807 


Human olfactory receptor 
polypeptide, SEQ ID NO: 1488 - 
Homo sapiens, 3 1 9 aa. 
[WO200I27158-A2, 19-APR-2001] 


16..330 
1-315 


313/315 (99%) 
314/315 (99%) 


e-180 


AAG71803 


Human olfactory receptor 
polypeptide, SEQ ID NO: 1484 - 
Homo sapiens, 315 aa. 
[ WO200 1271 58-A2, 1 9-APR-2001] 


16. .329 
1..314 


219/314(69%) 
259/314(81%) 


e-129 
e-128 


AAU24658 


Human olfactory receptor 
AOLFR156 - Homo sapiens, 331 
aa. [WO200168805-A2, 20-SEP- 
2001] 


9..329 
10..330 


218/321 (67%) 
259/321 (79%) 


AAU24721 


Human olfactory receptor 
AOLFR220 - Homo sapiens, 343 
aa. [WO200168805-A2, 20-SEP- 
2001] 


20..329 
33..342 


196/310(63%) 
234/310(75%) 


e- 1 1 1 


AAG71808 


Human olfactory receptor 


20. .323 
9..3 12 


195/304 (64%) 
232/304 (76%) 


e-111 
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Homo sapiens, 3 1 7 aa. 

[\\ 02001271 5 8-A2, 19-APR-200 1 J 

In a BLAST search of public sequence databases, the NOV60a protein was found to 
have homology to the proteins shown in the BLASTP data in Table 60D. 



Table 60D. Public BLASTP Results for NOV60a 


Protein 

Accession 
Number 


Protein/Organism/Length 


NOV60a 
Residues/ 

Match 
Residues 


Identities/ 
Similarities for 
the Matched 
Portion 


Expect 
Value 


Q9WU86 


ODORANT RECEPTOR SI - 
Mus musculus (Mouse), 324 aa. 


15..324 
8..320 

— — J 


135/315 (42%) 
188/315 (58%) 


4e-67 


Q9EPG2 


M5 1 OLFACTORY RECEPTOR 
- Mus musculus (Mouse), 3 14 aa. 


20..325 
5. .311 


129/307 (42%) 
189/307 (61%) 


4e-65 


P23270 


Olfactory receptor-like protein 17 
- Rattus norvegicus (Rat), 327 aa. 


24..319 
10..310 


126/301 (41%) 
182/301 (59%) 


8e-65 


Q9QWU6 


OLFACTORY RECEPTOR 17 - 
Mus musculus (Mouse), 327 aa. 


16..319 
L.310 


128/310 (41%) 
184/310(59%) 


9e-64 


013036 


CHICK OLFACTORY 
RECEPTOR 7 - Gallus gallus 
(Chicken), 323 aa. 


16.319 
1..305 


122/305 (40%) 
187/305 (61%) 


le-63 



PFam analysis predicts that the NOV60a protein contains the domains shown in the 
Table 60E. 



Table 60E. Domain Analysis of NOV60a 


Pfam Domain 


1 Identities/ 
NOV60a Match Region j Similarities 

| for the Matched Region 


Expect Value 


7tm_l : domain 1 of 1 


56..304 1 45/270 (17%) 
1 72/270 (64%) 


2.4e-21 



Example 61. 



The NOV61 clone was analyzed, and the nucleotide and predicted polypeptide 
sequences are shown in Table 61 A. 
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Table 61A. NOV61 Sequence Analysis 




SEQIDNO:185 1 1061 bp 


NOV61a, 

CG59555-01 DNA Sequence 




ATGGTCACAGAGTTCCTCCTACTGGGATTTCTCCTGGGCCCAAGGATTCAGATGCTCC 
TCTTTGGGCTCTTCTCCCTGTTCTATATCTTCACCCTGCTGGGGAACGGGGCCATCCT 
GGGGCTCATCTCACTGGACTCCAGACTCCACACCCCCATGTACTTCTTCCTCTCACAC 
CTGGCTGTCGTCGACATCGCCTACACCCGCAACACGGTGCCCCAGATGCTGGCGAACC 
TCCTGCATCCAGCCAAGCCCATCTCCTTTGCTGGCTGCATGACGCAGACCTTTCTCTG 
TTTGAGTTTTGGACACAGCGAATGTCTCCTGCTGGTGCTGATGTCCTACGATCGTTAC 
GTGGCCATCTGCCACCCTCTCCGATACTCCGTCATCATGACCTGGAGAGTCTGCATCA 
CCCTGGCCGTCACTTCCTGGACGTGTGGCTCCCTCCTGGCTCTGGCCCATGTGGTTCT 
CATCCTAAGACTGCCCTTCTCTGGGCCTCATGAAATCAACCACTTCTTCTGTGAAATC 
CTGTCTGTCCTCAGGCTGGCCTGTGCTGACACCTGGCTCAACCAGGTGGTCATCTTTG 
CAGCCTGCGTGTTCTTCCTGGTGGGGCCACCCAGCCTGGTGCTTGTCTCCTACTCGCA 
CATCCTGGCGGCCATCCTGAGGATCCAGTCTGGGGAGGGCCGCAGAAAGGCCTTCTCC 
ACCTGCTCCTCCCACCTCTGCGTGGTGGGACTCTTCTTTGGCAGTGCCATCATCATGT 
ACATGGCCCCCAAGTCCCGCCATCCTGAGGAGCAGCAAAAGGTCTTTTTTCTATTTTA 
CAGTTTTTTCAACCCAACACTTAACCCCCTGATTTACAGCCTGAGGAACGGAGAGGTC 
AAGGGTGCCCTGAGGAGAGCACTGGGCAAGGAAAGTCATTCCTAACTGGTGTGACATT 
TGACTCTCCCTCCTCAGTCATCTCCTGGAATCTTGGTACCAAATACCACCTAAGTTCA 


CTACTCTCTTTATATCA 


— 


ORF Start: ATG at 41 jORF Stop: TAA at 971 




SEQIDNO:186 |310aa ]MW at 34713.8kD 


NOV61a, 

CG59555-01 Protein Sequence 


MGENQTMVTEFLLLGFLLGPRI QMLLFGLFSLFY I FTLLGNGAI LGL I SLDSRLHTPM 
YFFLSHLAWDIAYTRNTVPQMLANLLHPAKPISFAGCMTQTFLCLSFGHSECLLLVL 
MSYDRYVAICHPLRYSVIMTWRVCITLAVTSWTCGSLLALAHWLILRLPFSGPHEIN 
HFFCEILSVLRLACADTWLNQWIFAACVFFLVGPPSLVLVSYSHILAAILRIQSGEG 
RRKAFSTCSSHLCWGLFFGSAIIMYMAPKSRHPEEQQKVFFLFYSFFNPTLNPLIYS 
LI i SVI 3AI RRALGKESHS 



Further analysis of the NOV61a protein yielded the following properties shown in 
Table 61B. 



Table 61B. Protein Sequence Properties NOV61a 



PSort 
analysis: 


0.6400 probability located in plasma membrane; 0.4600 probability located in 
Golgi body; 0.3700 probability located in endoplasmic reticulum (membrane); 
0.1000 probability located in endoplasmic reticulum (lumen) 


SignalP 
analysis: 


Likely cleavage site between residues 43 and 44 



A search of the NOV61a protein against the Geneseq database, a proprietary 
database that contains sequences published in patents and patent publication, yielded 
several homologous proteins shown in Table 61C. 
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Table 61C. Geneseq Results for NOV61a 


Geneseq 
Identifier 


Protein/Organism/Length [Patent 
#, Date] 


NOV61a 
Residues/ 

Match 
Residues 


Identities/ 
Similarities for 
the Matched 
Region 


Expect 
Value 


AAM29935 


Peptide #3972 encoded by probe 
for measuring placental gene 
expression - Homo sapiens, 311 aa. 
[WO200157272-A2, 09-AUG- 
2001] 


1..310 
2..311 


310/310(100%) 
310/310(100%) 


0.0 


AAM17409 


Peptide #3843 encoded by probe 
for measuring cervical gene 
expression - Homo sapiens, 311 aa. 
[WO200157278-A2, 09-AUG- 
2001] 


1..310 
2..311 


310/310(100%) 
310/310(100%) 


0.0 


AAG72949 


Human olfactory receptor data 
exploratorium sequence, SEQ ID 
NO: 2631 - Homo sapiens, 314 aa. 
[WO200127158-A2, 19-APR- 
2001] 


1..310 
2..311 


310/310(100%) 
310/310(100%) 


0.0 


AAG72187 


Human olfactory receptor 
polypeptide, SEQ ID NO: 1868 - 
Homo sapiens, 3 1 0 aa. 
[WO200127158-A2, 19-APR- 
2001] 


1..310 
1..310 


310/310(100%) 
310/310(100%) 


0.0 


AAU04577 


Human G-protein coupled receptor 
like protein, GPCR #1 1 - Homo 
sapiens, 308 aa. [WO200 153454- 
A2, 26-JUL-2001] 


1..310 
1..308 


288/310 (92%) 
294/310(93%) 


e-165 



In a BLAST search of public sequence databases, the NOV61a protein was found to 
have homology to the proteins shown in the BLASTP data in Table 6 ID. 





Table 61D. Public BLASTP Results for NOV61a 




- — — __ 

Protein 
Accession 
Number 


1 

Protein/Organism/Length 


NOV61a 
Residues/ 

Match 
Residues 


Identities/ 
Similarities for 
the Matched 
Portion 


Expect 
Value 


Q96R46 


OLFACTORY RECEPTOR - Homo 
sapiens (Human), 217 aa (fragment). 


67..283 
1..217 


217/217(100%) 
217/217(100%) 


e-125 


095047 


Olfactory receptor 2A4 - Homo 
sapiens (Human), 3 1 0 aa. 


1..307 
1..307 


217/307 (70%) 
250/307 (80%) 


e-122 
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Q9NQN0 


DJ1005H11.1 (7 

1 KA IN b MbMBKA IN b KbCbr I UK 
(RHODOPSIN FAMILY) 
(OLFACTORY RECEPTOR LIKE) 
PROTEIN)) - Homo sapiens 
(Human), 272 aa (fragment). 


39..307 
1..269 


187 260 ((><>" ,, 
216/269 (79%) 


e-103 


Q9Z1V2 


OLFACTORY RECEPTOR B 12 - 
Mus musculus (Mouse), 223 aa 
(fragment). 


63..285 
1..223 


172/223 (77%) 
190/223 (85%) 


9e-98 


043888 


Ol FACTORY RECEPTOR - Homo 
sapiens (Human), 217 aa (fragment). 


67..282 
1..217 


173/217(79%) 
188/217(85%) 


le-97 



PFam analysis predicts that the NOV61a protein contains the domains shown in the 
Table 6 IE. 



Table 61E. Domain Analysis of NOV61a 


Pfam Domain 


NOV61a Match Region 


Identities/ 
Similarities 
for the Matched Region 


Expect Value 


7tm_l: domain 1 of 1 


40..289 


65/269 (24%) 
188/269 (70%) 


l.le-45 



Example 62. 

The NOV62 clone was analyzed, and the nucleotide and predicted polypeptide 
sequences are shown in Table 62 A. 



Table 62 A. NOV62 Sequence Analysis 


|SEQ ID NO: 187 11201 bp 


NOV62a, 

CG59551-01 DNA Sequence 


AGTTGGTTGTAAATAATTCTGCTTATATTACCTACAGAGTAAACATTATAGCATTATC 


ACTCCAGAATCCTTTGTTTCTATGGTTTCCAGATGTTTCCAATGTCTAGATGTTCCAG 


CTGCCCATCTCTGAGAAATCCAGCTGTGTCTCACAATGGATGCCACAGCCTGTAATGA 


ATCAGTGGATGGCTCACCCGTCTTCTATCTATTGGGCATCCCCTCTCTGCCAGAGACC 
TTCTTCCTCCCTGTGTTTTTTATTTTCCTCCTCTTCTACCTTCTCATCCTGATGGGTA 
ATGCCCTGATCCTGGTGGCCGTGGTGGCAGAGCCCAGCCTCCACAAGCCCATGTACTT 
CTTTCTGATCAATCTCTCCACCTTGGACATCCTTTTCACCACAACCACTGTCCCCAAG 
ATGCTGTCCTTATTCTTGCTTGGGGACCGCTTCCTCAGCTTTTCTTCCTGCTTACTGC 
AGATGTACCTCTTCCAAAGTTTTACATGTTCAGAAGCCTTCATCCTGGTGGTCATGGC 
CTATGACCGCTATGTGGCTATCTGCCACCCACTGCACTACCCTGTCCTCATGAACCCA 
CAGACCAATGCTACCTTGGCAGCCAGTGCCTGGCTAACTGCCCTCCTCCTGCCCATCC 
CAGCAGTAGTAAGGACCTCCCAGATGGCATATAACAGCATTGCCTACATCTACCACTG 
CTTCTGTGATCATCTGGCTGTGGTCCAGGCCTCCTGCTCTGACACCACCCCCCAGACC 
CTCATGGGCTTCTGCATCGCCATGGTGGTGTCCTTCCTCCCCCTTCTCCTGGTGCTTC 
TCTCCTATGTCCACATCCTGGCCTCAGTGCTTCGCATCAGTTCCCTAGAAGGACGGGC 
AAAAGCCTTCTCCACCTGCAGCTCCCACCTTCTGGTCGTGGGCACCTACTACTCATCT 
ATTGCCATAGCCTACGTGGCCTACAGGGCTGACCTGCCCCTTGACTTCCATATCATGG 
GCAATGTGGTATATGCCATTCTCACACCAATTCTCAACCCCCTCATTTACACGCTGAG 
AAACAGGGATGTAAAGGCAGCCATCACCAAAATCATGTCTCAAGACCCAGGCTGTGAC 
AGGAGCATTTGACCTTTAAATGCAGCTAACTCTGCTTCCAGGACACCAAATAACAGTG 


CTTAGCACA-=, T ^ATS AT iJVTGAAATAA 




ORK Start: \TG at 152 


ORF Stop: TGA at 1112 




SEQIDNO: 188 


■;20 aa MW at S 4 >502 6kD 
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|NOV62a, 
JCG59551-01 Protein Sequence 



MDATACNESVDGS PVFYLLG IPSLPETFFLPVFFI FLLF YLLI LMGNALI LVAWAF.P 
SLHKPMYFFLINLSTLDILFTTTTVPKMLSLFLLGDRFLSFSSCLLQMYLFQSFTCSE 
AFILWMAYDRYVAICHPLHYPVLMNPQTNATLAASAWLTALLLPIPAWRTSQMAYN 
SIAYIYHCFCDHLAWQASCSDTTPQTLMGFCIAMWSFLPLLLVLLSYVHILASVLR 
ISSLEGRAKAFSTCSSHLLWGTYYSSIAIAYVAYRADLPLDFHIMGNWYAILTPIL 
L E 1 iAITKIMSQDPGCDRSI 



Further analysis of the NOV62a protein yielded the following properties shown in 
Table 62B. 



Table 62B. Protein Sequence Properties NOV62a 


PSort 
analysis: 


0.6000 probability located in plasma membrane; 0.4000 probability located in 
Golgi body; 0.3000 probability located in endoplasmic reticulum (membrane); 
0.3000 probability located in microbody (peroxisome) 


SignalP 
analysis: 


Likely cleavage site between residues 57 and 58 



A search of the NOV62a protein against the Geneseq database, a proprietary 
database that contains sequences published in patents and patent publication, yielded 
several homologous proteins shown in Table 62C. 



Table 62 C. Geneseq Results for NOV62a 


Geneseq 
Identifier 


Protein/Organism/Length [Patent 
#, Date] 


NOV62a 
Residues/ 

Match 
Residues 


Identities/ 
Similarities for 
the Matched 
Region 


Expect 
Value 


AAG72119 


Human olfactory receptor 
polypeptide, SEQ ID NO: 1800 - 
Homo sapiens, 295 aa. 
[WO200127158-A2, 19-APR-2001] 


35..290 
2..257 


213/256 (83%) 
228/256 (88%) 


e-119 


AAU24639 


Human olfactory receptor 
AOLFR134 - Homo sapiens, 325 aa. 
[WO200168805-A2, 20-SEP-2001] 


16..308 
17..308 


129/293 (44%) 
186/293 (63%) 


6e-67 


AAG72479 


Human OR-like polypeptide query 
sequence, SEQ ID NO: 2160 - 
Homo sapiens, 324 aa. 
[WO200127158-A2, 19-APR-2001] 


16..308 
17..308 


129/293 (44%) 
186/293 (63%) 


6e-67 


AAG71590 


Human olfactory receptor 
polypeptide, SEQ ID NO: 1271 - 
Homo sapiens, 324 aa. 
[WO200127158-A2, 19-APR-2001] 


16..308 
17..308 


129/293(44%) 
186/293 (63%) 


6e-67 


AAG71632 


Human olfactory receptor 


16..315 
13. .312 


126/300(42%) 
1 79/300 (5y%) 


3e-64 
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Homo sapiens, 316 aa. 

| WO200 1 271 58-A2, 1 9-APR-2001] 









In a BLAST search of public sequence databases, the NOV 62a protein was found to 
have homology to the proteins shown in the BLASTP data in Table 62D. 



Table 62D. Public BLASTP Results for NOV62a 


Protein 
Accession 
Number 


Protein/Organism/Length 


NOV62a 
Residues/ 

Match 
Residues 


Identities/ 
Similarities for 
the Matched 
Portion 


Expect 
Value 


Q9Z236 


OLFACTORY RECEPTOR - 
Rattus norvegicus (Rat), 221 aa 
(fragment). 


70..289 
2..221 


187/220 (85%) 
202/220 (91%) 


e-104 


CAB43131 


OLFACTORY RECEPTOR - 
Stenella coeruleoalba (Striped 
dolphin), 1 72 aa (fragment). 


69.. 240 
1..172 


136/172 (79%) 
148/172 (85%) 


le-73 


Q9EPG2 


M51 OLFACTORY RECEPTOR 
- Mus musculus (Mouse), 314 aa. 


16..3I0 
12..305 


, , , _ 

131/295 (44%) 
191/295 (64%) 


2e-67 


Q9H208 


HP4 OLFACTORY RECEPTOR 
- Homo sapiens (Human), 317 aa 
(fragment). 


16..312 
12..308 


127/297 (42%) 
180/297 (59%) 


3e-65 


Q920G5 ] OLFACTORY RECEPTOR P3 - I 16..308 
I Mus musculus (Mouse), 324 aa. | 19.. 31 1 


126/295 (42%) 
180/295 (60%) 


le-62 



PFam analysis predicts that the NOV62a protein contains the domains shown in the 
Table 62E. 



Table 62E. Domain Analysis of NOV62a 


Pfam Domain 




NOV62a Match Region 


Identities/ 
Similarities 
for the Matched Region 


Expect Value 


7tm_l : domain 1 of 1 


46..295 


58/268 (22%) 
1 79/268 (67%) 


4.6e-38 



Example 63. 



The NOV63 clone was analyzed, and the nucleotide and predicted polypeptide 
sequences are shown in Table 63A. 
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Table 63A. NOV63 Sequence Analysis 




SEQ ID NO: 189 


1042 bp 


NOV63a, 

CG59540-01 DNA Sequence 


« TTT hi T sTTT "T TT T3J JA AA k 


GCCCTGGGTCACAGAATTCATCCTGGTTGGATTCCAGCTCAGTGCAGAGATGGAGATC 
TTTCTCTCTTGCATCTTCTCCCTGTTATATCTCTTCAGTCTACTGAGGAATGGCATGA 
ACATGGGACTCATCTGTCTGGATCCCAGACTACACACCCCCATATACTTCTTCCTGTC 
ACACTTGGCCGTCATTGACATATACTATGCTTCCAACAATTTGCTCAACATGCTGGAA 
AACCTAGTGAAACACAAAAAAACTATCTCGTTCATCTCTTGCATTATGCAGATGGCTT 
TGTATTTGACTTTTGCTGCTGCAGTGTGCATGATTTTGGTGGTGATGTCCTATGACAG 
ATTTGTGGCGATCTGCCATCCCCTGCATTACACTGTCATCATGAACTGGAGAGTGTGC 
ACAGTACTGGCTATTACTTCCTGGGCATGTGGATTTTCCCTGGCCCTCATAAATCTAA 
TTCTCCTTCTAAGGCTGCCCTTCTGTGGGCCCCAGGAGGTGAACCACTTCTTCGGTGA 
AATTCTGTCTGTCCTCAAACTGGCCTGTGCAGACACCTGGATTAATGAAATTTTTGTC 
TTTGCTGGTGGTGTGTTTGTCTTAGTCGGGCCCCTTTCCTTGATGCTGATCTCCTACA 
TGCGCATCCTCTTGGCCATCCTGAAGATCCAGTCAGGCGAGGC-CCACAGAAAGGACTT 




ATGTACATGGCCCCCAAGTCCCGCCATCCCGAGGAGCAGCAGAAGGTCCTTTCCCTGT 
TTTGCAGCCTTTGGAATCAGGTGCTGAACCCCCCTCTGATCTACAGCTTGAGGAATGC 
AGAGGTCAAGAGTGCCCCACAAGAGGGCCACTGAAGAAGGAGAGGCTGATGTTACAAT 
CTCAAAGGCACCACGAGGAGAGGGCCTGCTCCGAC^„ f " ^TTGGCTTTTT 




ORP Start: ATGat45 


ORE Stop: TGA at 960 




SEQ ID NO: 190 


305 aa ;MW at 34554.8RD 


NOV63a, 


MGDKQPWVTEFILVGFQLSAEMEIFLSCIFSLLYLFSLLRNGMNMGLICLDPRLHTPI 
YFFLSHLAVIDIYYASNNLLNMLENLVKHKKTISFISCIMQMALYLTFAAAVCMILW 
MSYDRFVAICHPLHYTVIMNWRVCTVLAITSWACGFSLALINLILLLRLPFCGPQEVN 
HFFGEILSVLKLACADTWINEIFVFAGGVFVLVGPLSLMLISYMRILLAILKIQSGEG 
HRKDFSTCSSHLCWGFFFANAIVMYMAPKSRHPEEQQKVLSLFCSLWNQVLNPPLIY 
LR1 iE 3AP03GH 


CG59540-01 Protein Sequence 

: 



Further analysis of the NOV63a protein yielded the following properties shown in 
Table 63B. 



Table 63B. Protein Sequence Properties NOV63a 


PSort 
analysis: 


0.6000 probability located in plasma membrane; 0.4000 probability located in 
Golgi body; 0.3000 probability located in endoplasmic reticulum (membrane); 
0.3000 probability located in microbody (peroxisome) 


SignalP 

analysis: 


Likely cleavage site between residues 43 and 44 



A search of the NOV63a protein against the Geneseq database, a proprietary 
database that contains sequences published in patents and patent publication, yielded 
several homologous proteins shown in Table 63C. 



Table 63C. Geneseq Results for NOV63a 


r 

Geneseq 
Identifier 


Protein/Organism/Length [Patent 
#, Date] 


NOV63a 
Residues/ 

Match 
Residues 


Identities/ 
Similarities for 
the Matched 
Region 


Expect 
Value 


AAU24758 


Human olfactory receptor 


1..300 
1..299 


258/300 (86%) 
275/300 (91%) 


e-146 
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[WO200168805-A2. 20-SEP-2001] 








AAG72952 


Human olfactory receptor data 
exploratorium sequence, SEQ ID 
NO: 2634 - Homo sapiens, 310 aa. 
[WO200127158-A2, 19-APR-2001] 


1..300 
1..299 


272/300 (90%) 


e-144 


AAG72377 


Human OR-like polypeptide query 
sequence, SEQ ID NO: 2058 - 
Homo sapiens, 312 aa. 

rWfY>fif>1 071 ^8 A 9 1Q APT? OAfin 


1..300 
1..299 


Z35/3UU (o5%) 
272/300 (90%) 


e-144 


AAG72169 


Human olfactory receptor 
polypeptide, SEQ ID NO: 1850 - 
Homo sapiens, 312 aa. 
[WO200127158-A2, 19-APR-2001] 


] 300 
1..299 




255/300 (85%) 
272/300 (90%) 


e-144 


AAG71994 


Human olfactory receptor 
polypeptide, SEQ ID NO: 1675 - 
Homo sapiens, 314 aa. 
[WO200127158-A2, 19-APR-2001] 


1..300 
1..299 


225/300 (75%) 
256/300 (85%) 


e-129 



In a BLAST search of public sequence databases, the NOV63a protein was found to 
have homology to the proteins shown in the BLASTP data in Table 63D. 



Table 63D. Public BLASTP Results for NOV63a 


Protein 
Accession 
Number 


Pr otein/O rganism/Len gth 


NOV63a 
Residues/ 

Match 
Residues 


Identities/ 
Similarities for 
the Matched 
Portion 


Expect 
Value 


095047 


Olfactory receptor 2A4 - Homo 
sapiens (Human), 3 1 0 aa. 


1..299 
1..298 


173/299 (57%) 
217/299 (71%) 


2e-92 


043885 


OLFACTORY RECEPTOR - 
Homo sapiens (Human), 217 aa 
(fragment). 


67..28 1 
1.-216 


154/216(71%) 
182/216(83%) 


5e-88 


043888 


OLFACTORY RECEPTOR - 
Homo sapiens (Human), 217 aa 
(fragment). 


67..281 
1 ..216 


153/216(70%) 
182/216(83%) 


8e-88 


Q96R48 


OLFACTORY RECEPTOR - 
Homo sapiens (Human), 2 1 7 aa 
(fragment). 


67..28I 
1..216 


153/216(70%) 
181/216(82%) 


2e-87 


Q96R47 


OLFACTORY RECEPTOR - 
Homo sapiens (Human), 215 aa 
(fragment). 


67..281 
1..214 


149/215 (69%) 
175/215(81%) 


3e-84 
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PFam analysis predicts that the NOV63a protein contains the domains shown in the 
Table 63E. 



Table 63E. Domain Analysis of NOV63a 


Pfam Domain 


NOV63a Match Region 


Identities/ 
Similarities 
for the Matched Region 


Expect Value 


7tm_l: domain 1 of 1 


47..290 


55/270 (20%) 
174/270(64%) 


9.7e-25 



Example 64. 

The NOV64 clone was analyzed, and the nucleotide and predicted polypeptide 
sequences are shown in Table 64A. 



Table 64 \ NOV64 Sequence Analysis 




SEQIDNO:191 1 973 bp 


NOV64a, 

CG59280-01 DNA Sequence 


AGGCACTAAATGAATATCTGTTTAATTCATAAAGTAACAGAGTTTCTCTTCTCTGGAT 


ATTCATTGTCATTGGGAATCTTATTGTATTTTTTGCAGTCAGGGTGGATACCCGTCTC 
CACAACCCCATGTATAATTTTATCAGCATTTTCTCATTTCTGGAGATCTGGTACACAA 
CTGCCACAATTCCCAAGATGCTCTCCATCCTCATCAGCAGGCAGAGGACCATCTCCAT 

TTGTTGACCACCATGGCCATTGATAGGTACGTTGCCATCTGTAACCCTCTCCGCTACC 
CAACCATCATGACCCCCGGGCTCTGTGTTCAGCTCTCTGTGGGGTCCTGCATCTTTGG 
CTTTCTTGTGTTGCTCCCAGAGATTGCATGGATTTCCACACTGCCCTTCTGTGGACCC 
AACCAAATCCACCAGATCTTCTGTGATTTTGAACCTGTGCTGCGCTTGGCCTGTACAG 
ACACGTCCATGATTCTGATTGAGGATGTGATCCATGCTGTGGCCATTGTATTCTCTGT 
CCTGATTATTGCCTTTTCTTATATCAGAATCATCACTGTAATCCTGAGGATTCCCTCT 
GTTGAAGGCCGCCAGAAGGCCTTTTCTACCTGTGCCGCCCATCTTAGTGTCTTTCTGA 
TGTTCTATGGCAGTGTATCCCTCATGTACCTGCGTTTCTCTGCCACTTTCCCACCGAT 

ATCTATAGCTTTAGAAATAAGGACATGAAGATTGCAATTAAAAAGCTTTTCTGCCCTC 
AGAAGATGGTTAATTTATCTGTAGATTAATGCTAGCTCATAGGCA 




ORF Start: ATG at 10 


ORF Stop: TAA at 955 




SEQ ID NO 192 


315 aa |MW at 35741 .4kD 


NOV64a, 

CG59280-01 Protein Sequence 


MNICLIHKVTEFLFSGFPQFEDGSLLFFIPLFVIYIFIVIGNLIVFFAVRVDTRLHNP 
MYNFISIFSFLEIWYTTATIPKMLSILISRQRTISMVGCLLQMYFFHSLGNSEGILLT 
TMAIDRYVAI CNPLRYPTIMTPGLCVQLSVGSCIFGFLVLLPEIAWISTLPFCGPNQI 
HQIFCDFEPVLRLACTDTSMILIEDVIHAVAIVFSVLIIAFSYIRIITVILRIPSVEG 
RQKAFSTCAAHLSVFLMFYGSVSLMYLRFSATFPPILDTAVALMFAVLAPFFNPIIYS 
' I IL' FJ liT 




SEQ ID NO: 193 (929 bp 


NOV64b, 

CG59280-02 DNA Sequence 


ATTTTTTGCAGTCAGGGTGGATACCCGTCTCCACAACCCCATGTATAATTTTATCAGC 
ATTTTCTCATTTCTGGAGATCTGGTACACAACTGCCACAATTCCCAAGATGCTCTCCA 

CTTCCATTCACTGGGAAATTCAGAGGGGATTTTGTTGACCACCATGGCCATTGATAGG 
TACGTTGCCATCTGTAACCCTCTCCGCTACCCAACCATCATGACCCCCGGGCTCTGTG 
TTCAGCTCTCTGTGGGGTCCTGCATCTTTGGCTTTCTTGTGTTGCTCCCAGAGATTGC 
ATGGATTTCCACACTGCCCTTCTGTGGACCCAACCAAATCCACCAGATCTTCTGTGAT 
TTTGAACCTGTGCTGCGCTTGGCCTGTACAGACACGTCCATGATTCTGATTGAGGATG 
TGATCCATGCTGTGGCCATTGTATTCTCTGTCCTGATTATTGCCTTTTCTTATATCAG 
AATCATCACTGTAATCCTGAGGATTCCCTCTGTTGAAGGCCGCCAGAAGGCCTTTTCT 
ACCTGTGCCGCCCATCTTAGTGTCTTTCTGATGTTCTATGGCAGTGTATCCCTCATGT 
ACCTGCGTTTCTCTGCCACTTTCCCACCGATTTTGGACACAGCTGTTGCACTGATGTT 
TGCAGTTCTTGCTCCCTTTTTCAACCCTATCATCTATAGCTTTAGAAATAAGGACATG 
AAGATTGCAATTAAAAAGCTTTTCTGCCCTCAGAAGATGGTTAATTTATCTGTAGATT 
■A l i r L ,i r ] 
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|ORF Start: TTC at 3 |ORF Stop: TAA at 870 




SEQIDNoTl94 J289 aa |MW at 32772.9kD 


NOV64b, 

CG59280-02 Protein Sequence 


FFIPLFVIYIFIVIGNLIVFFAVRVDTRLHNPMYNFISIFSFLEIWYTTATIPKMLSI 
LISRQRTISMVGCLLQMYFFHSLGNSEGILLTTMAIDRYVAICNPLRYPTIMTPGLCV 
QLSVGSCIFGFLVLLPEIAWISTLPFCGPNQIHQIFCDFEPVLRLACTDTSMILIEDV 
IHAVAIVFSVLIIAFSYIRIITVILRIPSVEGRQKAFSTCAAHLSVFLMFYGSVSLMY 
LRFSATFPPILDTAVALMFAVLAPFFNPIIYSFRNKDMKIAIKKLFCPQKMVNLSVD 



Sequence comparison of the above protein sequences yields the following sequence 
relationships shown in Table 64B. 



Table 64B. Comparison of NOV64a against NOV64b. 


Protein Sequence 


NOV64a Residues/ 
Match Residues 


Identities/ 
Similarities for the Matched Region 


NOV64b 


27..315 
I..289 


289/289 (100%) 
289/289(100%) 



Further analysis of the NOV64a protein yielded the following properties shown in 
Table 64C. 



Table 64C. Protein Sequence Properties NOV64a 


PSort 
analysis: 


0.6000 probability located in plasma membrane; 0.4000 probability located in 
Golgi body; 0.3000 probability located in endoplasmic reticulum (membrane); 
0.3000 probability located in microbody (peroxisome) 


SignalP 
analysis: 


Likely cleavage site between residues 54 and 55 



A search of the NOV64a protein against the Geneseq database, a proprietary 
database that contains sequences published in patents and patent publication, yielded 
several homologous proteins shown in Table 64D. 



Table 64D. Geneseq Results for NOV64a 


Geneseq 
Identifier 


Protein/Organism/Length [Patent 
#, Date] 


NOV64a 
Residues/ 

Match 
Residues 


Identities/ 
Similarities for 
the Matched 
Region 


Expect 
Value 


AAG71805 


Human olfactory receptor 
polypeptide, SEQ ID NO: 1486 - 
Homo sapiens, 256 aa. 
[WO200127158-A2, 19-APR-2001] 


59..314 
1..256 


255/256 (99%) 
255/256 (99%) 


e-145 
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A A OA'"? 

AAG/180J 


Human olfactory receptor 
polypeptide, SEQ ID NO: 1484 - 
Homo sapiens, 3 1 5 aa. 
| WO200 1 27 1 58-A2, 1 9-APR-2001 ] 


9. .311 
9..311 


O /I O A'"* /OAO/ \ 

243/iOj (80%) 
267/303 (87%) 


e-143 


AAU24658 


Human olfactory receptor 
AOLFR156 - Homo sapiens, 331 
aa. [WO200168805-A2, 20-SEP- 
2001] 


9..311 

25..327 

_ — _ \ 


240/303 (79%) 
264/303 (86%) 


e-140 


AAG71807 


Human olfactory receptor 
polypeptide, SEQ ID NO: 1488 - 
Homo sapiens, 319 aa. 
[WO200127158-A2, 19-APR-2001] 


9..313 
9..313 


222/305 (72%) 
259/305 (84%) 


e-131 


AAU24721 


Human olfactory receptor 
AOLFR220 - Homo sapiens, 343 
aa. [WO200168805-A2, 20-SEP- 
2001] 


9..308 
37..336 


209/300 (69%) 
242/300 (80%) 


e-119 


In a BLAST search of public sequence databases, the NOV 64a protein was found to 
have homology to the proteins shown in the BLASTP data in Table 64E. 


Table 64E. Public BLASTP Results for NOV64a 


s 

Protein 
Accession 
Number 


Protein/Organism/Length 


NOV64a 
Residues/ 

Match 
Residues 


Identities/ 
Similarities for 
the Matched 
Portion 

— -— — - — - 


Expect 
Value 


Q9EPG2 


M5 1 OLFACTORY RECEPTOR - 
Mus musculus (Mouse), 314 aa. 


1..302 
4..303 


137/303 (45%) 
194/303 (63%) 


2e-71 


Q9EPV0 


M50 OLFACTORY RECEPTOR 
(OLFACTORY RECEPTOR M50) - 
Mus musculus (Mouse), 316 aa. 


6..302 
4..301 


132/298 (44%) 
191/298 (63%) 


3e-71 


Q9EPG1 


M50 OLFACTORY RECEPTOR - 
Mus musculus (Mouse), 316 aa. 


6..302 
4..301 


130/298 (43%) 
190/298 (63%) 


2e-70 


I Q9WU86 


ODORANT RECEPTOR SI - Mus 
musculus (Mouse), 324 aa. 


1..310 
12..321 


133/313 (42%) 
190/313 (60%) 


4e-69 


| Q96KK4 


DJ994E9.5 (OLF \CTORY 
RECEPTOR, FAMILY 10, 
SUBFAMILY C, MEMBER 1 
(HS6M1-17)) - Homo sapiens 
(Human), 306 aa. 


9..314 
2..306 


137/307(44%) 
189/307 (60%) 


9e-68 



PFam analysis predicts that the NOV 64a protein contains the domains shown in the 
Table 64F. 
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Table 64F. Domain Analysis of M>\ 64a 


Pfam Domain 


NOV64a Match Region 


Identities/ 
Similarities 
for the Matched Region 


Expect Value 


7tm_l : domain 1 of 1 


41. .289 


51/269(19%) 
1 79/269 (67%) 


2.2e-33 



Example 65. 

The NOV65 clone was analyzed, and the nucleotide and predicted polypeptide 
sequences are shown in Table 65A. 



Table 65 A. NOV65 Sequence Analysis 




SEQIDNO:195 J 972 bp 


NOV65a, 

CG59568-01 DNA Sequence 


GCATGGTGATCCTGTCCTGGGAAAACCAAACGATGAGAGTGGAATTCGTGCTTCAAGG 
ATTCTCTTCCATCAGACAGTTAAATATTTTCCTCTTTATGATAATTTTAGTTTTCTAC 
ATCTTAACTGTTTCTGGAAACATCCTCATTGTCCTTCTAGTTTTAGTCAGACATCATC 
TCCACACCCCTATGTACTTCCTCCTGGTGAACTTGTCCTGTCTGGAGATCTGGTATAC 
CTCTAACATCATCCCCAAAATGTTGCTGATTATCATAGCTGAAGAGAAGACTATCTCT 
GTGGCTGGCTGGCTGGCACAATTCTACTTCTTCGGATCCCTGGCTGCCACGGAGTGCC 
TCTTGCTCACTGTGATGTCCTATGATCGCTACCTAGCCATCTGCCAGCCTCTTTGCTA 
CCGTGTCCTCATGACTGGCCCCCTTTGCATCAGGCTAGCTGCTGGCTCTTGGTTCTGC 
TGCTTCCTCCTTACAGCAATCACCATGGTCTTGCTATGTAGACTAACCTTCTGTGGAC 
CCTATGAAACTGATCACTTCTTTTGTGACTTCACCCCTCTGGTTCATCTCTCCTGCAT 
GGATACCTCAGTGACTGAGACCATTGCCTTTGCCACCTCTTCTGCAGTAACTCTGATC 
CCATTTCTCCTCATTGTAGCCTCCTACTCCTGCGTCCTTTCTGCTATCCTAAGAATCC 
CATCTTGCACAGGCCAGAAAAAGGCCTTCTCCACCTGCTCTTCCCACCTCACTGTGGT 
CATAGTGTTTTATGGGACACTGATTGCCACATACCTTGTGCCCTCAGCCAACTCATCC 
CAACTCTTGTGCAAAGGGTCCTCTCTGCTCTACATCATCCTGACACCCATGTTTAACC 
CCATCATTTATAGCCTGAGAAATAGAGACATCCATGAAGCTCTGAAGAAGTGCTTGAG 
r r- r l£ A!;-,] 




ORF Start: ATG at 3 |ORF Stop: TAA at 954 




SEQlDNO:196 3l7aa M\V at 357l3.4kD 


NOV65a, 

CG59568-01 Protein Sequence 


MVILSWENQTMRVEFVLQGFSSIRQLNIFLFMIILVFYILTVSGNILIVLLVLVRHHL 
HTPMYFLLVNLSCLEIWYTSNIIPKMLLIIIAEEKTISVAGW1AQFYFFGSLAATECL 
LLTVMSYDRYLAICQPLCYRVLMTGPLCIRLAAGSWFCCFLLTAITMVLLCRLTFCGP 
YETDHFFCDFTPLVHLSCMDTSVTETIAFATSSAVTLIPFLLIVASYSCVLSAILRIP 
SCTGQKKAFSTCSSHLTWIVFYGTLIATYLVPSANSSQLLCKGSSLLYIILTPMFNP 
IIYSLRNRDIHEALKKCLRKKSGVCLR 



Further analysis of the NOV65a protein yielded the following properties shown in 
Table 65B. 



Table 65B. Protein Sequence Properties NOV65a 


PSort 
analysis: 


0.6000 probability located in plasma membrane; 0.4000 probability located in 
Golgi body; 0.3888 probability located in mitochondrial inner membrane; 
0.3030 probability located in mitochondrial intermembrane space 


SignalP 
analysis: 


Likely cleavage site between residues 45 and 46 
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A search of the NOV65a protein against the Geneseq database, a proprietary 
database that contains sequences published in patents and patent publication, yielded 
several homologous proteins shown in Table 65C. 



Table 65C. Geneseq Results for NOV65a 


Geneseq 
Identifier 


Protein/Organism/Length [Patent 
#, Date] 


NOV65a 
Residues/ 

Match 
Residues 


Identities/ 
Similarities for 
the Matched 
Region 


Expect 
Value 


AAG72527 


Human OR-like polypeptide query 
sequence, SEQ ID NO: 2208 - 
Homo sapiens, 316 aa. 
[WO200127158-A2, 19-APR-2001] 


1..316 
1..316 


315/316(99%) 
315/316(99%) 


0.0 


AAG72231 


Human olfactory receptor 
polypeptide, SEQ ID NO: 1912 - 
Homo sapiens, 316 aa. 
[WO200127158-A2, 19-APR-2001] 


1..316 
1..316 


315/316(99%) 
315/316(99%) 


0.0 " 


AAG72084 


Human olfactory receptor 
polypeptide, SEQ ID NO: 1765 - 
Homo sapiens, 316 aa. 
[WO200127158-A2, 19-APR-2001] 


1.316 
1.316 


315/316(99%) 
315/316(99%) 


0.0 


AAG72700 


Murine OR-like polypeptide query 
sequence, SEQ ID NO: 2382 - Mus 
musculus, 314 aa. [WO200127158- 
A2, 19-APR-2001] 


1..308 
3..308 


154/308 (50%) 
208/308 (67%) 


2e-83 


AAG71814 


Human olfactory receptor 
polypeptide, SEQ ID NO: 1495 - 
Homo sapiens, 317 aa. 
[WO200127158-A2, 19-APR-2001] 


8..311 
5..308 


142/304 (46%) 
208/304 (67%) 


7e-79 



In a BLAST search of public sequence databases, the NOV65a protein was found to 
have homology to the proteins shown in the BLASTP data in Table 65D. 



Table 65D. Public BLASTP Results for NOV65a 


Protein 
Accession 
Number 


Protein/Organism/Length 


NOV65a 
Residues/ 

Match 
Residues 


Identities/ 
Similarities for 
the Matched 
Portion 


Expect 
Value 


Q9GZK7 


Olfactory receptor 1 1A1 
(Hs6Ml-18) - Homo sapiens 
(Human), 315 aa. 


1..308 
1..306 


147/308 (47%) 
202/308 (64%) 


4e-77 
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013036 


CHICK OLFACTORY 
RECEPTOR 7 - Gallus gallus 
(Chicken), 323 aa. 


7.3 1 1 
4..308 


139/305 (45%) 
198/305 (64%) 


le-76 




Q9JKA6 


OLFACTORY RECEPTOR P2 - 
Mus musculus (Mouse), 3 1 5 aa. 


4..313 
1 ..31 1 


143/311 (45%) 
194/311 (61%) 


le-75 


Q9WU86 


ODORANT RECEPTOR SI - 
Mus musculus (Mouse), 324 aa. 


14..308 
21..315 


144/295 (48%) 
189/295 (63%) 


2e-75 


Q9UGF6 


Olfactory receptor 5V1 (Hs6Ml- 
21) - Homo sapiens (Human), 
321 aa. 


7..305 
4..302 


138/299 (46%) 
199/299 (66%) 


5e-75 



PFam analysis predicts that the NOV65a protein contains the domains shown in the 
Table 65E. 



Table 65E. Domain Analysis of NOV65a 


Pfam Domain 


NOV65a Match 
Region 


Identities/ 
Similarities 
for the Matched 
Region 


Expect 
Value 


granulin: domain 1 of 1 


144..155 


7/13 (54%) 
11/13 (85%) 


1.7 


Trypan glycop: domain 1 
of 1 


218..241 


6/24 (25%) 
21/24 (88%) 


7.9 


7tm_l: domain 1 of 1 


- 

44..293 


53/268 (20%) 
172/268 (64%) 


1.5e-31 



Example 66. 

The NOV66 clone was analyzed, and the nucleotide and predicted polypeptide 
sequences are shown in Table 66A. 



Table 66A. NOV66 Sequence Analysis 


— „ „„ : 


SEQIDNO:197 1987 bp 


NOV66a, 

CG5 9224-01 DNA Sequence 


CATCTTCCTATGTGTCATGTCTCCTCTTAATGACACAAAAATGGAAGTCCTTAGATTC 
CTCCTTATCGGGATCACTGGACTGGAGAAAAGTCGCACCTGGATATCCATTCCTTTCT 
TATCTGTGTACCTTCTTTCTTGGATGGGTAATTTTACCGTCCTCTTTTTTATCAAGAC 
AGAGCAAAGCCTCCATGAACCTATGTATTATTTGCTTTCCATGCTCTCCATCTCTGAC 
CTAGGGCTGTCTCTGTCTTCCTTACCCATCACTTTGGGACTATTCCTATTTGATGTCC 
ATGAAATTCATGCAGCTCCATGCTTTGCCCAGGAATTTTTTATCCATCTGTTTACAGT 
CAGTGAAGCCTCTGTACTGTCTGTAATGGCATTTGACTGGTATGTGGCAATCCACAGT 
CCTTTGAGATACAGCACTATCTTAACTAGTCCCAGAGCCATCAAAACAGGGGTTCTTC 

ATATTGTCATCAAAACCTGCTCTCCCACTCCTATTGTCTCCACCAGGATGTCATGAAG 
CTGATGTGTTCTGACAACACAGTCAATGTTGTCTACGGACTCTGTGCAGGACTTTCTA 
CTATGCTGGACTTGGTGTTTATTACCTTCTCCTATATGATTTTAAGGGCTGTACTGGG 
AATTGCTACCCCCAGACAGCAGTTCAAGGCCCTCAACACGTGCATCTCTCACATCTGT 
GCTGTGCTTATCTTCTATGTGCCCACGCTGAGTGCTGCCATGCTCCACCAGTTTGCCA 
GGGATGTGTCTCCTATGATCCACGTCCTCATGGCTGATATTTTTCTGCTGGTGCCACC 
CCTGTTCAATCCCATCGTGTACTGTGTOAAGACCCACCAAATCCGAGAAAAGGTTGTG 
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GGGAAACTTTGTCCAAAAGTAAGTTGATCAAAGG AA 3 A - AATGAATGTATA 
A 




ORF Start: ATG at 17 


ORF Stop: TGA at 953 




SEQIDNO: 198 


312 aa JmW at 35250.7kD 


NOV66a, 

CG5 9224-01 Protein Sequence 


MS PLNDTKME VLRFLLI G I TGLEKSRTWI S I PFLSVYLLSWMGNFTVLFF I KTEQSLH 
EPMYYLLSMLSISDLGLSLSSLPITLGLFLFDVHEIHAAPCFAQEFFIHLFTVSEASV 
LS VMAFDWYVAI US PLRYST ILTSPRAI KTGVLLTSKNVLLILPLP FLLQRLRYCHQN 




LLSHSYCLHQDVMKLMCSDNTVNWYGLCAGLSTMLDLVFITFSYMILRAVLGIATPR 
QQFKALNTCISHICAVLIFYVPTLSAAMLHQFARDVSPMIHVLMADIFLLVPPLLNPI 
VYCVKTHQIREKWGKLCPKVS 



Further analysis of the NOV66a protein yielded the following properties shown in 
Table 66B. 



Table 66B. Protein Sequence Properties NOV66a 


PSort 

analysis: 


0.6000 probability located in plasma membrane; 0.4000 probability located in 
Golgi body; 0.3000 probability located in endoplasmic reticulum (membrane); 
0.2007 probability located in mitochondrial inner membrane 


SignalP 
analysis: 


Likely cleavage site between residues 50 and 51 



A search of the NOV66a protein against the Geneseq database, a proprietary 
database that contains sequences published in patents and patent publication, yielded 
several homologous proteins shown in Table 66C. 



Table 66C. Geneseq Results for NOV66a 



Geneseq 
Identifier 


Protein/Organism/Length [Patent 
#, Date] 


NOV66a 
Residues/ 

Match 
Residues 


Identities/ 
Similarities for 
the Matched 
Region 


Expect 
Value 


AAG72488 


Human OR-like polypeptide query 
sequence, SEQ ID NO: 2169 - 
Homo sapiens, 319 aa. 
[WO200127158-A2, 19-APR-2001] 


1.312 
,.3,3 


309/313 (98%) 
309/313 (98%) 


e-176 


AAG71557 


Human olfactory receptor 
polypeptide, SEQ ID NO: 1238 - 
Homo sapiens, 319 aa. 
[WO200127158-A2, 19-APR-2001] 


1..312 
1-313 


309/313 (98%) 
309/313 (98%) 


e-176 


AAU24573 


Human olfactory receptor 
AOLFR63 - Homo sapiens, 3 1 3 aa. 
[WO200168805-A2, 20-SEP-2001] 


1..310 
1..311 


186/311 (59%) 
246/311 (78%) 


e-109 


AAG71558 


Human olfactory receptor 


1.310 
1.311 


185/311 (59%) 
245/311 (78%) 


e-108 
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Homo sapiens, 3 1 3 aa. 
[WO200127158-A2. 19-APR-2001] 






AAU24682 


Human olfactory receptor j 1 ..307 
AOLFR1 81 -Homo sapiens, 312 aa. 1..306 
L WO200 1 68805-A2, 20-SEP-200 1 ] 


188/308 (61%) 
237/308 (76%) 


e-106 



In a BLAST search of public sequence databases, the NOV66a protein was found to 
have homology to the proteins shown in the BLASTP data in Table 66D. 



Table 66D. Public BLASTP Results for NOV66a 


Protein 
Accession 
Number 


Protein/Organism/Length 


NOV66a 

Match 
Residues 


Identities/ 

the Matched 
Portion 


Value 


AAL35109 


PROSTATE-SPECIFIC G 
PROTEIN-COUPLED 
RECEPTOR RA1C - Mus 
musculus (Mouse), 320 aa. 


14..304 
11. .303 


141/293 (48%) 
199/293 (67%) 


2e-77 


088628 


Olfactory receptor 51E2 (G-protein 
coupled receptor RAlc) - Rattus 
norvegicus (Rat), 320 aa. 


14..304 
11. .303 


141/293 (48%) 
200/293 (68%) 


2e-77 


CAC38935 


SEQUENCE 9 FROM PATENT 
WOO 13 1014 - Homo sapiens 
(Human), 3 1 8 aa. 


5..304 
6..306 


145/302 (48%) 
206/302 (68%) 


2e-77 


CAC37756 


SEQUENCE 1 FROM PATENT 
WOO 125434 - Homo sapiens 
(Human), 3 1 7 aa. 


5..304 
5..305 


145/302 (48%) 
206/302 (68%) 


3e-77 


Q9H255 


Olfactory receptor 51E2 (Prostate 
specific G-protein coupled 
receptor) (HPRAJ) - Homo sapiens 
(Human), 320 aa. 


14..304 
11. .303 


139 293 (47%) 
198/293 (67%) 


2e-76 



PFam analysis predicts that the NOV66a protein contains the domains shown in the 
Table 66E. 



Table 66E. Domain Analysis of NO V66a 


Pfam Domain 


NOV66a Match Region 


Identities/ 
Similarities 
for the Matched Region 


Expect Value 


7tm_l: domain 1 of 2 


43..151 


30/111 (27%) 
73/1 1 1 (66%) 


6.3e-14 



292 



7tm_l: domain 2 of 2 


212..292 


16/92(17%) 


0.052 






52/92 (57%) 





Example 67. 

The NOV67 clone was analyzed, and the nucleotide and predicted polypeptide 
sequences are shown in Table 67A. 



Table 67A. NOV67 Sequence Analysis 




SEQIDNO:199 1 994 bp 


novo /a, 

CG59222-01 DNA Sequence 


cacaatgtcti;t:.:tt :-;atA'-;tttt w-ttata -t "■ >:ttcctcctaacgggcctc 
tcaggccttgaaagcagatatgacttgatttccctgcccatcttcttggtttatgcca 
cctcaattgccgggaacattagcatcctcttcattatcagaactgagtcttccctcca 
ccaaccgatgtattactttctgtcaatgctggcattcactgacctgggcctatctaac 
actaccttacctaccatgttcagtgtcttctggttccatgcccgggagatctccttca 
atgcttgtctggtccaaatgtacttcattcatgttttctcgattattgagtcagctgt 
actcctggctatggcctttgactgctttatagcaatctgtgaacccttgcgctatgca 
gccatcctaaccaatgatgtaatcattgggattgggttggcaattgctggaagggcct 
tggctctggtctttccagcttctttcctcttgaagaggcttcaatatcatgatgtcaa 
tattctgtcctacctcttctgcctgcaccaggacctcataaagacgactgtatccaac 
tgtcgagtcagcagcatctatggcctcatggtggtcatctgttccatgggacttgatt 
cagtgcttctcctcctctcctatgtcctcatcctgggcacagtgttgagtatagcctc 
caaggcagagagagtgagagccctcaatacttgcatctcccacatctgtgctgtactc 
accttctatacaccaatgattgggctatctatgatccatcgctatggacagaatgctt 
cctcaattgtccatgtgctgatggccaatgtctacttgctggttccacctctcatgaa 
ccccgttgtctacagtgttaagaccaagcagattcgtgacagaatcttcaataaattc 
aagaaacatgaagtgtagatgacagagattctgaaacataactttccctccattcccc 


ATATATTT 


h— ~ — — — _ 


ORF Start: ATG at 5 


ORF Stop: TAG at 944 




SEQ ID NO: 200 


313 aa MW at 35044.2kD 


NOV67a, 

CG59222-01 Protein Sequence 


MSVFNSSALYPRFLLTGLSGLESRYDLISLPIFLVYATSIAGNISILFIIRTESSLHQ 
PMYYFLSMLAFTDLGLSNTTLPTMFSVFWFHAREISFNACLVQMYFIHVFSIIESAVL 
LAMAFDCFIAICEPLRYAAILTNDVIIGIGLAIAGRALALVFPASFLLKRLQYHDVNI 
LSYLFCLHQDLIKTTVSNCRVSSIYGLMWICSMGLDSVLLLLSYVLILGTVLSIASK 
AERVRALNTCISHICAVLTFYTPMIGLSMIHRYGQNASSIVHVLMANVYLLVPPLMNP 
i r 1 :CFKKH3\. 



Further analysis of the NOV67a protein yielded the following properties shown in 
Table 67B. 



Table 67B. Protein Sequence Properties NOV67a 


PSort 
analysis: 


0.6000 probability located in plasma membrane; 0.4047 probability located in 
mitochondrial inner membrane; 0.4000 probability located in Golgi body; 
0.3480 probability located in mitochondrial intermembrane space 


SignalP 
analysis: 


No Known Signal Sequence Predicted 



A search of the NOV67a protein against the Geneseq database, a proprietary 
database that contains sequences published in patents and patent publication, yielded 
several homologous proteins shown in Table 67C. 
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Table 67C. Geneseq Results for NOV67a 


Geneseq 
Identifier 


Protein/Organism/Length [Patent 
#, Date] 


NOV67a 
Residues/ 

Match 
Residues 


Identities/ 
Similarities for 
the Matched 
Region 


Expect 
Value 


AAG72605 


Human OR-like polypeptide query 
sequence, SEQ ID NO: 2286 - 
Homo sapiens, 318 aa. 
[WO200127158-A2, 19-APR-2001] 


1..309 
4..313 


295/310(95%) 
298/310 (95%) 


e-163 


AAG71519 


Human olfactory receptor 
polypeptide, SEQ ID NO: 1200 - 
Homo sapiens, 318 aa. 
| WO200 1 27 1 58-A2, 1 9-APR-200 1 J 


1..309 
4..313 


295/310(95%) 
298/310(95%) 


e-163 


AAU24683 


Human olfactory receptor 
AOLFR182 - Homo sapiens, 314 aa. 
[WO200168805-A2, 20-SEP-2001] 


5. .308 
9..312 


178/304 (58%) 
235/304 (76%) 


e-102 


AAG71715 


Human olfactory receptor 
polypeptide, SEQ ID NO: 1396 - 
Homo sapiens, 314 aa. 
1 WO200 1271 58-A2, 1 9-APR-200 1] 


5..308 
9..312 


178/304 (58%) 
235/304 (76%) 


e-102 


ABB44526 


Human GPCR4a polypeptide SEQ 
ID NO 1 1 - Homo sapiens, 3 1 5 aa. 
| W( )200 1 74904-A2. 1 1 -OCT-200 1 ] 


5..308 
6..309 


169/304 (55%) 
227/304 (74%) 


2e-96 



In a BLAST search of public sequence databases, the NOV67a protein was found to 
have homology to the proteins shown in the BLASTP data in Table 67D. 



Table 67D. Public BLASTP Results for NOV67a 


Protein 
Accession 
Number 


Protein/Organism/Length 


NOV67a 
Residues/ 

Match 
Residues 


Identities/ 
Similarities for 
the Matched 
Portion 


Expect 
Value 


Q9H344 


Olfactory receptor 5112 
(HOR5'betal2) - Homo sapiens 
(Human), 312 aa. 


13..308 
12..307 


154/296 (52%) 
221/296 (74%) 


2e-91 


Q9H2C8 


ODORANT RECEPTOR 
HOR3'BETAl - Homo sapiens 
(Human), 321 aa. 


2..308 
10.316 


160/307 (52%) 
216/307 (70%) 


5e-89 


Q9H343 


Olfactory receptor 5111 
(HOR5'betal 1) - Homo sapiens 
(Human), 314 aa. 


5..312 
5..313 


156/309 (50%) 
223/309 (71%) 


9e-89 



294 



AAL35109 


PROSTATE-SPECIFIC G 
PROTEIN-COUPLED 
RECEPTOR RA1C - Mus 
musculus (Mouse), 320 aa. 


13. .309 
11. .307 


148/297 (49%) 
207/297 (68%) 


2e-86 


Q924X8 


OLFACTORY RECEPTOR S85 - 
Mus musculus (Mouse), 314 aa. 


2..304 
3..305 


150/303 (49%) 
221/303 (72%) 


le-85 



PFam analysis predicts that the NOV67a protein contains the domains shown in the 
Table 67E. 



Table 67E. Domain Analysis of NOV67a 


Pfam Domain 


NOV67a Match Region 


Identities/ 
Similarities 
for the Matched Region 


Expect Value 


7tm_l: domain 1 of 1 

I 


42..138 


24/99 (24%) 
67/99 (68%) 


7.8e-14 



Example 68. 

The NOV68 clone was analyzed, and the nucleotide and predicted polypeptide 
sequences are shown in Table 68A. 



Table 68A. NOV68 Sequence Analysis 




SEQIDNO:201 |981 bp 


NOV68a, 

CG59220-01 DNA Sequence 


gcaatga-saaacoS'.-A'-; v t: .- n • • "k;a :—t :r:n cagctggcc 

CCCAGACCCAGACTCTGCTCTTTGTGCTGTTCGTGGTGATTTGCCTCCTGACTGTGAT 

gggaaacctgctgctgctggtggtgattaatgctgattcttgcctccacacacccatg 

TACTTCTTCCTGGGACAATTGTCCTTCTTGGATCTCTGCCATTCCTCTGTCACTGCAC 
CTAAGCTGTTGGAGAACCTCCTGTCTGAGAAGAAAACCATCTCAGTAGAGGGCTGCAT 
GGCTCAGGTCTTCTTTGTGTTTGCCACTGGGGGCACTGAATCCTCCCTGCTTGCTGTG 
ATGGCCTATGACCGCTATGTTGCCATCAGCTCTCCTTTGCTCTATGGCCAAGTGATGA 
ACAGACAGCTGTGTTCAGGGCTGGTGGGGGGCTCATGGGGCTTGGCTTTTCTGGATGC 
CCTCATCAATATCCTTGTAGCTCTCAATTTAGACTTCTGTGAGGCTCAAAATATCCAC 
CACTTCAGCTGTGAGCTGCCCTCTCTCTATCCTTTGTCTTGCTCTGATGTGTCAGCAA 
GTTTTACCACCCTGCTCTGCTCCAGCTTCCTGCATTTCTTTGGAAATTTTCTCATGAT 
ATTCTTGTCTTATATTTGCATTTTGTCCACCATCCTGAGGATCAGCTCCACTACAGGC 
AGAAGCAAAGCCTTCTCCACCTGCTCCTCCCACCTCACTGCAGTGATTTTCTTTTATG 
GCTCCGGATTACTCCGCTATCTCATGCCAAATTCAGGATCCATTCAAGAGCTGATCTT 
CTCCTTGCAGTACAGCGTGATCACTCCCATGCTGAATCTCCTCATTTACAGCCTGAAG 
AACAGGGAGGTGAAGGCAGCTGTGAGAAGAACATTGAGAAAATATTTCTAGTGTTTCA 




ORF Start: ATG at 4 


ORF Stop: TAG at 919 




SEQ ID NO: 202 


305 aa ,MW at 5373: IkD 


NOV68a, 

CG59220-01 Protein Sequence 


MRNRSWPEFVLLGLSAGPQTQTLLFVLFWICLLTVMGNLLLLWINADSCLHTPMY 
FFLGQLSFLDLCHSSVTAPKLLENLLSEKKTISVEGCMAQVFFVFATGGTESSLLAVM 
AYDRYVAISSPLLYGQVMNRQLCSGLVGGSWGLAFLDALINILVALNLDFCEAQNIHH 
FSCELPSLYPLSCSDVSASFTTLLCSSFLHFFGNFLMIFLSYICILSTILRISSTTGR 
SKAFSTCSSHLTAVIFFYGSGLLRYLMPNSGSIQELIFSLQYSVITPMLNLLIYSLKN 
REVKAAVRRTLRKYF 



Further analysis of the NOV68a protein yielded the following properties shown in 
Table 68B. 
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Table 68B. Protein Sequence Properties NOV68a 


PSort 
analysis: 


0.6400 probability located in plasma membrane; 0.4600 probability located in 
Golgi body; 0.3700 probability located in endoplasmic reticulum (membrane); 
0.1000 probability located in endoplasmic reticulum (lumen) 


SignalP 
analysis: 


Likely cleavage site between residues 50 and 51 



A search of the NOV68a protein against the Geneseq database, a proprietary 
database that contains sequences published in patents and patent publication, yielded 
several homologous proteins shown in Table 68C. 



Table 68C. Geneseq Results for NOV68a 



Geneseq 
Identifier 


Protein/Organism/Length [Patent 
#, Date] 


NOV68a 
Residues/ 

Match 
Residues 


Identities/ 
Similarities for 
the Matched 
Region 


Expect 
Value 


AAU24771 


Human olfactory receptor 
AOLFR328 - Homo sapiens, 312 aa. 
[WO200168805-A2, 20-SEP-2001] 


3..304 
5..306 


212/302 (70%) 
251/302 (82%) 


e-120 


AAG98585 


Mouse olfactory receptor 7 - Mus 
musculus domesticus, 214 aa. 
[WO200146262-A2. 28-JUN-2001] 


66..279 
1..214 


144/214(67%) 
169/214(78%) 


le-78 


AAG72680 


Murine OR-like polypeptide query 
sequence, SEQ ID NO: 2362 - Mus 
musculus, 337 aa. [WO200127158- 
A2, 19-APR-2001] 


3..305 
20..324 


148/305 (48%) 
201/305 (65%) 


3e-74 


AAG71546 


Human olfactory receptor 
polypeptide, SEQ ID NO: 1227 - 
Homo sapiens, 3 1 5 aa. 
[WO200 1 27 1 58- \2. i 9-APR-200 1 ] 


3..301 
5..306 


143/302(47%) 
201/302 (66%) 


2e-73 


AAG66701 


Human GPCR1 polypeptide - Homo 
sapiens, 311 aa. [WO200 160865- 
A2. 23-AUG-2001| 


3..301 
5..306 


143/302(47%) 
201/302 (66%) 


2e-73 



In a BLAST search of public sequence databases, the NOV68a protein was found to 
have homology to the proteins shown in the BLASTP data in Table 68D. 



296 





Table 68D. Public BLASTP Results for NOV68a 




Protein 
Accession 
Number 


Protein/Organism/Length 


NOV68a 
Residues/ 

Match 
Residues 


Identities/ 
Similarities for 
the Matched 
Portion 


Expect 
Value 


Q9JM36 


OLFACTORY RECEPTOR - Mus 
musculus domesticus (western 
European house mouse), 214 aa 
(fragment). 


66..279 
1..214 


144/214 (67%) 
169/214(78%) 


5e-78 


AQA7 1 O 


OLFACTORY RECEPTOR - Mus 
musculus (Mouse), 312 aa. 


3 .299 
5..303 


142/299(47%) 
193/299(64%) 




Q9EPG6 


Bl OLFACTORY RECEPTOR - 
Mus musculus (Mouse), 314 aa. 


3..299 
5 .303 


140/299 (46%) 
196/299 (64%) 


2e-72 




P23266 


Olfactory receptor-like protein F5 - 
Rattus norvegicus (Rat), 3 1 3 aa. 


3 305 
5..309 


142/305 (46%) 
196/305 (63%) 


9e-72 


Q9EQA3 


ODORANT RECEPTOR K30 - 
Mus musculus (Mouse), 311 aa. 


3..305 
5..310 


143/306 (46%) 
202/306 (65%) 


2e-71 



PFam analysis predicts that the NOV68a protein contains the domains shown in the 
Table 68E. 



Table 68E. Domain Analysis of NOV68a 


Pfam Domain 


NOV68a Match Region 


Identities/ 
Similarities 
for the Matched Region 


Expect Value 


7tm_l : domain 1 of 1 


39..286 


54/268(20%) 
169/268 (63%) 


1.7e-29 



Example 69. 

The NOV69 clone was analyzed, and the nucleotide and predicted polypeptide 
sequences are shown in Table 69A. 



Table 69 A. NOV69 Sequence Analysis 




SEQ ID NO: 203 (957 bp 


NOV69a, 

CG592 18-01 DNA Sequence 


" _ATG - - T T n T" - - :■' f : :.\T 


TCTCTGGGAACCTGATCATCATCAGCCTGACCTTGCTGGACACCCGCCTGCAGACATC 

GTGCCCAAGATGCTGCTCAACATTGCCATGGGGACCAAGACCGTTAGCTTTGCTGGGT 
GCATTACCCAGGACTTTTTCTTTCCACATCTTCTGGGGGCCACAGAGTTCTTCCTCCT 
CACAGCCATGGCCTATGACCAGTATATTGCCATCTGCAAGCCCCTCCACTACCCCATG 
CTCATAAGTAGTAGAGTCTGCACACAGCTCATCCTCACCTGCTGGCTACTAGGTTTCT 
CCTTCATCATCATGCCTGTCATCCTGACCAGTCAGCTTCCATTCTGTGATACCCACAT 
CAAGCATTTCTTCTGTGACTACACGCCTCTAATGGAGGTGGTCTGCAGTGGGCCAAAG 
I T ( ATTTI I 11 " 1 HT^TT-r C 
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TCATCACCCTGTCCTATGTCCAGATCATCCAGACAATTGTCAGAATCCCCGCTGTCCA 

TATGACAGCTGCTTCTTTATGTATGTCAAGCCCTCTCCAGGAAAGTGGGTTGATGTCA 
ACAAGGGAGTGTCTCTAATCAATACAATTATTGCCCCACTGTTAAATCCCTTCATCTG 
TACTCTGAGGAACCAACAAGTTAAGCAGGTAATGAAAGACCTAGTCAGAAAAATGACT 
TTGTTCCAAAATAAATAAGGGCCCTAAAA 




ORF Start: ATG at 8 


ORF Stop: TAA at 944 




SEQIDNO: 204 


312 aa jMW at 35358. lkD 


NOV69a, 

CG59218-01 Protein Sequence 


MANQTWTEFFLQGLTDTKELQVAVFLLLLLAYLVTVSGNLIIISLTLLDTRLQTSMY 
LFLQNLSCLEIWFQTVIVPKMLLNIAMGTKTVSFAGCITQDFFFPHLLGATEFFLLTA 
MAYDQYIAICKPLHYPMLISSRVCTQLILTCWLLGFSFIIMPVILTSQLPFCDTHIKH 
FFCDYTPLMEWCSGPKVLEMVDFTLALVALFGTLVLITLSYVQIIQTIVRIPAVQER 
KKAFSTCSSHVIMVTMCYDSCFFMYVKPSPGKWVDVNKGVSLINTI IAPLLNPFICTL 
RMQQ^ i t KDL KMTL ?C J 



Further analysis of the NOV69a protein yielded the following properties shown in 
Table 69B. 



Table 69B. Protein Sequence Properties NOV69a 


PSort 
analysis: 


0.6000 probability located in plasma membrane; 0.4000 probability located in 
Golgi body; 0.3000 probability located in endoplasmic reticulum (membrane); 
0.0300 probability located in mitochondrial inner membrane 


SignalP 
analysis: 


Likely cleavage site between residues 40 and 41 



A search of the NOV69a protein against the Geneseq database, a proprietary 
database that contains sequences published in patents and patent publication, yielded 
several homologous proteins shown in Table 69C. 



Table 69C. Geneseq Results for NOV69a 


Geneseq 
Identifier 


Protein/Organism/Length [Patent 
#, Date] 


NOV69a 
Residues/ 

Match 
Residues 


Identities/ 
Similarities for 
the Matched 
Region 


Expect 1 
Value j 


AAG72538 


Human OR-like polypeptide query 
sequence, SEQ ID NO: 2219 - 
Homo sapiens, 3 1 3 aa. 
[WO200127158-A2. 19-APR-2001] 


1..312 
1.313 


284/317(89%) 
293/317(91%) 


e-157 1 


AAG72229 


Human olfactory receptor 
polypeptide, SEQ ID NO: 1910 - 
Homo sapiens, 313 aa. 
[W( 12001 27 158-A2. 19- KPR-2001] 


1..312 
1 ..313 


284/317(89%) 
293/317(91%) 


e-157 I 


AAU24761 


Human olfactory receptor 
AOLFR1 12B - Homo sapiens, 309 


1..306 
1..306 


173/307 (56%) 
227/307 (73%) 


2e-96~j 
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J 2001] 








AAU24765 


Human olfactory receptor 
AOLFR225B - Homo sapiens, 309 
aa. [WO200168805-A2, 20-SEP- 
2001] 


1..306 
1..306 


166/307 (54%) 
227/307 (73%) 


2e-94 


AAG66353 


GPCR partial protein sequence - 
Unidentified, 313 aa. 
[WO200155179-A2, 02-AUG-2001] 


1..309 
1..310 


160/311 (51%) 
209/311 (66%) 


4e-87 



In a BLAST search of public sequence databases, the NOV69a protein was found to 
have homology to the proteins shown in the BLASTP data in Table 69D. 



Table 690. Public HI. VSTP Results for NOV69a 


Protein 
Accession 
Number 


Protein/Organism/Length 


NOV69a 
Residues/ 

Match 
Residues 


Identities/ 
Similarities for 
the Matched 
Portion 


Expect 
Value 


O9Z1V0 


OLFACTORY RECEPTOR C6 - 
Mus musculus (Mouse), 313 aa. 


1..309 
1..310 


160/311 (51%) 
209/311 (66%) 


2e-86 




SEQUENCE 18 FROM PATENT 
WO01 64879 - Homo sapiens 
(Human), 331 aa. 


8..306 
12..311 


142/301 (47%) 
200/301 (66%) 


4e-78 


CAC88328 


SEQUENCE 22 FROM PATENT 
WO01 64879 - Homo sapiens 
(Human), 331 aa. 


8..306 
12..311 


142/301 (47%) 
198/301 (65%) 


2e-77 


CAC88327 


SEQUENCE 20 FROM PATENT 
WO01 64879 - Homo sapiens 
(Human), 331 aa. 


8..306 
12..3U 


141/301 (46%) 
198/301 (64%) 


8e-77 


070270 


OLFACTORY RECEPTOR- 
LIKE PROTEIN - Rattus 
norvegicus (Rat), 327 aa. 


3..308 
11-316 


136/307 (44%) 
208/307 (67%) 




4e-76 



PFam analysis predicts that the NOV69a protein contains the domains shown in the 
Table 69E. 
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Table 69E. Domain Analysis of NOV69a 



Pfam Domain 


NOV69a Match Region 


Identities/ 
Similarities 
for the Matched Region 


Expect Value 


7tm_l: domain 1 of 1 


39..244 


47/214 (22%) 
147/214 (69%) 


1.9e-25 



Example 70. 

The NOV70 clone was analyzed, and the nucleotide and predicted polypeptide 
sequences are shown in Table 70A. 



Table 70A. NOV70 Sequence Analysis 




SEQIDNO:205 1 962 bp 


NOV70a, 

CG59216-01 DNA Sequence 


T-TT" T T T T ATG7 T T T T T 
CTCCTTATCGGGATCACTGGACTGGAGAAAAGTCGCACCTGGATATCCATTCCTTTCT 
TATCTGTGTACCTTCTTTCTTGGATGGGTAATTTTACCGTCCTCTTTTTTATCAAGAC 
AGAGCAAAGCCTCCATGAACCTATGTATTATTTGCTTTCCATGCTCTCCATCTCTGAC 
CTAGGGCTGTCTCTGTCTTCCTTACCCATCACTTTGGGACTATTCCTATTTGATGTCC 
ATGAAATTCATGCAGCTCCATGCTTTGCCCAGGAATTTTTTATCCATCTGTTTACAGT 
CAGTGAAGCCTCTGTACTGTCTGTAATGGCATTTGACTGGTATGTGGCAATCCACAGT 
CCTTTGAGATACAGCACTATCTTAACTAGTCCCAGAGCCATCAAAACAGGGGTTCTTC 
TGACTTCCAAGAATGTTCTTTTGATCCTTCCACTGCCCTTTCTCTTGCAAAGGCTGAG 
ATATTGTCATCAAAACCTGCTCTCCCACTCCTATTGTCTCCACCAGGATGTCATGAAG 
CTGATGTGTTCTGACAACACAGTCAATGTTGTCTACGGACTCTGTGCAGGACTTTCTA 
CTATGCTGGACTTGGTGTTTATTACCTTCTCCTATATTATGATTTTAAGGGCTGTACT 
GGGAATTGCTACCCCCAGACAGCAGTTCAAGGCCCTCAACACGTGCATCTCTCACATC 
TGTGCTGTGCTTATCTTCTATGTGCCCACGCTGAGTGCTGCCATGCTCCACCAGTTTG 
CCAGGGATGTGTCTCCTATGATCCACGTCCTCATGGCTGATATTTTTCTGCTGGTGCC 
ACCCCTGTTGAATCCCATCGTGTACTGTGTGAAGACCCACCAAATCCGAGAAAAGGTT 
GT sGGAAi ITTi [ CAAAAGTAAGTTGATCAA 




OPvF Start: ATG at 17 


OPvF Stop: TGA at 956 




SEQ [D N< ): 206 


313 aa • MW at 35363.9kD 


NOV70a, 

CG592 16-01 Protein Sequence 


MSPLNDTKMEVLRFLLIGITGLEKSRTWISIPFLSVYLLSWMGNFTVLFFIKTEQSLH 
EPMYYLLSMLSISDLGLSLSSLPITLGLFLFDVHEIHAAPCFAQEFFIHLFTVSEASV 
LSVMAFDWYVAIHSPLRYSTILTSPRAIKTGVLLTSKNVLLILPLPFLLQRLRYCHQN 
LLSHSYCLHQDVMKLMCSDNTVNWYGLCAGLSTMLDLVFITFSYIMILRAVLGIATP 
RQQFKALNTC I SH I CAVL I FYVPTLSAAMLHQFARDVS PM IHVLMADIFLLVPPLLNP 
/C"K IOI-E "7YGKLCPKVS 



Further analysis of the NOV70a protein yielded the following properties shown in 
Table 70B. 



Table 70B. Protein Sequence Properties NOV70a 


PSort 
analysis: 


0.6000 probability located in plasma membrane; 0.4000 probability located in 
Golgi body; 0.3000 probability located in endoplasmic reticulum (membrane); 
0.2007 probability located in mitochondrial inner membrane 


SignalP 
analysis: 


Likely cleavage site between residues 50 and 51 
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A search of the NOWOa protein against the Geneseq database, a proprietary 
database that contains sequences published in patents and patent publication, yielded 
several homologous proteins shown in Table 70C. 



Table 70C. Geneseq Results for NOWOa 


Geneseq 
Identifier 


Protein/Organism/Length [Patent 
#, Date] 


NOV70a 
Residues/ 

Match 
Residues 


Identities/ 
Similarities for 
the Matched 
Region 


Expect 
Value 


AAG72488 


Human OR-like polypeptide query 
sequence, SEQ ID NO: 2169 - 
Homo sapiens, 319 aa. 
[WO200127158-A2, 19-APR-2001] 


1..313 
1..313 


310/313(99%) 
310/313(99%) 


e-178 


AAG71557 


Human olfactory receptor 
polypeptide, SEQ ID NO: 1238 - 
Homo sapiens, 319 aa. 
[WO200127158-A2, 19-APR-2001] 


1-313 
1.313 


310/313(99%) 
310/313(99%) 


e-178 


AAU24573 


Human olfactory receptor 
AOLFR63 - Homo sapiens, 313 aa. 
[WO200168805-A2, 20-SEP-2001] 


1..311 
1.311 


186/311 (59%) 
246/311 (78%) 


e-110 


AAG71558 


Human olfactory receptor 
polypeptide, SEQ ID NO: 1239 - 
Homo sapiens, 313 aa. 
[WO200127158-A2, 19-APR-2001] 


1..311 
1.311 


185/311 (59%) 
245/311 (78%) 


e-109 


AAU24682 


Human olfactory receptor 
AOLFR181 - Homo sapiens, 312 aa. 
[WO200168805-A2, 20-SEP-2001] 


1.308 
1.306 


188/308 (61%) 
238/308 (77%) 


e-107 



In a BLAST search of public sequence databases, the NOV70a protein was found to 
have homology to the proteins shown in the BLASTP data in Table 70D. 



Table 70D. Public BLASTP Results for NOV70a 


Protein 
Accession 
Number 


Protein/Organism/Length 


NOV70a 
Residues/ 

Match 
Residues 


Identities/ 
Similarities for 
the Matched 
Portion 


Expect 
Value 


CAC38935 


SEQUENCE 9 FROM PATENT 
WOO 1 3 1 0 1 4 - Homo sapiens 
(Human), 3 1 8 aa. 


5.305 
6.306 


145/302 (48%) 
207/302 (68%) 


5e-79 
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AAL35109 


PROSTATE-SPECIFIC G 
PROTEIN-COUPLED 
RECEPTOR RAIC-Mus 
musculus (Mouse), 320 aa. 


14..305 
11. .303 


141/293 (48%) 7e-79 
199/293 (67%) 


CAC37756 


SEQUENCE 1 FROM PATENT 
WO0125434 - Homo sapiens 
(Human), 317 aa. 


5..305 
5..305 


145/302(48%) j 7e-79 I 
207/302(68%) I 


088628 
Q9H255 


Olfactory receptor 51E2 (G-protein 
coupled receptor RAlc) - Rattus 
norvegicus (Rat), 320 aa. 


14..305 
11-303 


141/293(48%) I 7e-79 
200/293 (68%) 

| 


Olfactory receptor 51E2 (Prostate 
specific G-protein coupled 
receptor) (HPRAJ) - Homo sapiens 
(Human), 320 aa. 


14..305 
11. .303 


139/293 (47%) 7e-78 
198/293 (67%) ! 

! 

1 



PFam analysis predicts that the NOV70a protein contains the domains shown in the 
Table 70E. 



Table 70E. Domain Analysis of NOV70a 


Pfam Domain 


NOV70a Match Region 


Identities/ 
Similarities 
for the Matched Region 


Expect Value 


7tm_l: domain 1 of 2 


43..151 


30/1 1 1 (27%) 
73 1 1 1 (66%) 


6.3e-14 

J 


YCF9: domain 1 of 1 


208..262 


10/59(17%) 
31/59 (53%) 


7.5 


7tm_l : domain 2 of 2 


212..293 


18/93 (19%) 
55/93 (59%) 


0.00034 



Example 71. 

The NOV71 clone was analyzed, and the nucleotide and predicted polypeptide 
sequences are shown in Table 71 A. 



Table 71 A. NOV71 Sequence Analysis 


SEQIDNO:207 J 995 bp 


NOV71a, 

CG592 14-01 DNA Sequence 


GCACAATGTCTGTCTTCAATAGTTCTGCCTTATACCCTCGCTTCCTCCTAACGGGCCT 
CTCAGGCCTTGAAAGCAGATATGACTTGATTTCCCTGCCCATCTTCTTGGTTTATGCC 
ACCTCAATTGCCGGGAACATTAGCATCCTCTTCATTATCAGAACTGAGTCTTCCCTCC 
ACCAACCGATGTATTACTTTCTGTCAATGCTGGCATTCACTGACCTGGGCCTATCTAA 
CACTACCTTACCTACCATGTTCAGTGTCTTCTGGTTCCATGCCCGGGAGATCTCCTTC 
AATGCTTGTCTGGTCCAAATGTACTTCATTCATGTTTTCTCGATTATTGAGTCAGCTG 
TACTCCTGGCTATGGCCTTTGACTGCTTTATAGCAATCTGTGAACCCTTGCGCTATGC 
AGCCATCCTAACCAATGATGTAATCATTGGGATTGGGTTGGCAATTGCTGGAAGGGCC 


TTGGCTCTGGTCTTTCCAGCTTCTTTCCTCTTGAAGAGGCTTCAATATCATGATGTCA 
ATATTCTGTCCTACCTCTTCTGCCTGCACCAGGACCTCATAAAGACGACTGTATCCAA 
CTGTCGAGTCAGCAGCATCTATGGCCTCATGGTGGTCATCTGTTCCATGGGACTTGAT 

t(„,t fmT j i l< r "„ ( rT„ -vr^ ] 
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CCAAGGCAGAGAGAGTGAGAGCCCTCAATACTTGCATCTCCCACATCTGTGCTGTACT 
CACCTTCTATACACCAATGATTGGGCTATCTATGATCCATCGCTATGGACAGAATGCT 
TCCTCAATTGTCCATGTGCTGATGGCCAATGTCTACTTGCTGGTTCCACCTCTCATGA 
ACCCCGTTGTCTACAGTGTTAAGACCAAGCAGATTCGTGACAGAATCTTCAATAAATT 
CAAGAAACATGAAGTGTAGATGACAGAGATTCTGAAACATAACTTTCCCTCCATTCCC 


CATATATTT 




OPvF Start: ATG at 6 


ORF Stop: TAG at 945 




SEQ ID NO: 208 


313 aa |MW at 35044.2kD 


NOV71a. 

CG5 92 14-01 Protein Sequence 


MSVFNSSALYPRFLLTGLSGLESRYDLISLPIFLVYATSIAGNISILFIIRTESSLHQ 
PMYYFLSMLAFTDLGLSNTTLPTMFSVFWFHARE I SFNACLVQMYF IHVFS 1 1 ESAVL 
LAMAFDCFIAICEPLRYAAILTNDVIIGIGLAIAGRALALVFPASFLLKRLQYHDVNI 
LSYLFCLHQDLI KTTVSNCRVS SIYGLMWI CSMGLDSVLLLLS YVLI LGTVLS I ASK 
AERVRALNTC I SHI CAVLTFYTPMIGLSMI HRYGQNAS S I VHVLMANVYLLVP PLMNP 
VV YS VKT KQ I RDR I FNKFKKHE V 



Further analysis of the NOV71a protein yielded the following properties shown in 
Table 71B. 



Table 71B. Protein Sequence Properties NOV71a 


PSort 
analysis: 


0.6000 probability located in plasma membrane; 0.4047 probability located in 
mitochondrial inner membrane; 0.4000 probability located in Golgi body; 
0.3480 probability located in mitochondrial intermembrane space 


| SignalP 
\ analysis: 


No Known Signal Sequence Predicted 



A search of the NOV71a protein against the Geneseq database, a proprietary 
database that contains sequences published in patents and patent publication, yielded 
5 several homologous proteins shown in Table 71C. 



Table 71C. Geneseq Results for NOV71a 


Geneseq 
Identifier 


Protein/Organism/Length 
[Patent #, Date] 


NOV71a 
Residues/ 

Match 
Residues 


Identities/ 
Similarities for 
the Matched 
Region 


Expect 
Value 


AAG72605 


Human OR-like polypeptide 
query sequence, SEQ ID NO: 
2286 - Homo sapiens, 318 aa. 
[WO200127158-A2, 19-APR- 
2001] 


1..309 
4..313 


295/310 (95%) 
298/310(95%) 


e-163 


AAG71519 


Human olfactory receptor 
polypeptide, SEQ ID NO: 1200 - 
Homo sapiens, 3 1 8 aa. 
[WO200127158-A2, 19-APR- 
2001] 


I..309 
4..313 


295/310(95%) 
298/310(95%) 


e-163 


AAU24683 


Human olfactory receptor 


5..308 
9..312 


178/304 (58%) 
235/304 (76%) 


e-102 



303 





aa. [WO200168805-A2, 20-SEP- 
2001] 








AAG71715 


Human olfactory receptor 
polypeptide, oJj/v,; iu iNw. IjVO - 

Homo sapiens, 314 aa. 
[WO200127158-A2, 19-APR- 
2001] 


5..308 


178/304 (58%) 


e-102 


ABB44526 


Human GPCR4a polypeptide 
SEQ ID NO 1 1 - Homo sapiens, 
315 aa. [WO200174904-A2, 11- 
C K 1-2001] 


5..308 
6..309 


169/304 (55%) 
227/304 (74%) 


2e-96 



In a BLAST search of public sequence databases, the NOV71a protein was found to 
have homology to the proteins shown in the BLASTP data in Table 7 ID. 



Table 71D. Public BLASTP Results for NOV71a 


Protein 
Accession 
Number 


Protein/Organism/Length 


NOV71a 
Residues/ 

Match 
Residues 


Identities/ 
Similarities for 
the Matched 
Portion 


Expect 
Value 


Q9H344 


Olfactory receptor 5 1 12 
(HOR5'betal2) - Homo sapiens 
(Human), 312 aa. 


13.. 3 OS 
12..307 


154/296(52%) 
221/296 (74%) 


2e-91 


Q9H2C8 


ODORANT RECEPTOR 
HOR3'BETAl - Homo sapiens 
(Human), 321 aa. 


2..308 
10..316 


160/307 (52%) 
216/307 (70%) 


5e-89 


Q9H343 


Olfactory receptor 5111 
(HOR5'betal 1) - Homo sapiens 
(Human), 314 aa. 


5..312 
5. .313 


156/309 (50%) 
223/309 (71%) 


9e-89 


AAL35109 


PROSTATE-SPECIFIC G 
PROTEIN-COUPLED 
RECEPTOR RA1C - Mus 
musculus (Mouse), 320 aa. 


13..309 
11. .307 


148/297 (49%) 
207/297 (68%) 


2e-86 


Q924X8 


OLFACTORY RECEPTOR S85 - 
Mus musculus (Mouse), 3 1 4 aa. 


2..304 
3..305 


150/303 (49%) 
221/303 (72%) 


le-85 



PFam analysis predicts that the NOV71a protein contains the domains shown in the 
Table 7 IE. 
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Table 71E. Domain Analysis of NOV71a 


Pfam Domain 


NOV71a Match Region 


Identities/ 
Similarities 
for the Matched Region 


Expect Value 


7tm_l : domain 1 of 1 


42..138 


24/99 (24%) 
67/99 (68%) 


7.8e-14 



Example 72. 

The NOV72 clone was analyzed, and the nucleotide and predicted polypeptide 
sequences are shown in Table 72 A. 



Table 72A. NOV72 Sequence Analysis 




SEQ ID NO: 209 1 1004 bp 


NOV72a, 

CG5921 1-01 DNA Sequence 


CTTCTCATCTTTTCCCTCAAATACTGGGATGTCCATTCTCAATACCTCTGAAATGGAA 
ATCTCTATTTTCTACTTGGTTGGGATCCCAGGTTTGGAGCATGCCAATATTTGGATCT 
CTATCCCCATATGTCTCATGTACACTGTTGCTATCCTAGGGAATTGTACCATTCTGTT 
TTTCATAAAAACAGAGCCTTCTTTGCATGAGCCCATGTACTATTTTCTCTCCATGTTG 

TGTTCAATGCTCCAGGAATTTCCCCTGATGCCTGTATTGCTCAAGAGTTTTTCATCCA 
TGGATTCTCAGCTATGGAGTCATCTGTACTTCTTATAATGTCCTTTGATCGCTTTATT 
GCCATCTGCAACCCCCTGAGATACACTTCCATCCTCACCAGTGCCAGAGTCATTCAAA 
TTGGGCTTGCTTTTTCTCTCAAAAATGTTTTGTTGATCCTCCCATTTCCTTTCACTCT 
AAAACATCTAAAATATTGTAAGAAGAACCTCCTGTCCCAATCCTACTGCCTCCATCAA 
GATGTCATGAAACTGGCCTGCACTGACAACAAGGTCAACATCATCTATGGCTTATTTG 
TGGCTCTCACAGGCATCCTAGACTTGACATTTATTTTCATGTCCTACATGTTGATACT 
GAAAGCAGTGTTGAGCATAGCATCAAGAAAGAAAAGGCTCAAGGTCCTCAATACATGT 
GTTTCCCACATCTGTGCTGTGCTCATCTTCTATGTGCCCATTATCTCCCTAGCTGTCA 
TCTACCGGTTTGCCAAACACAGTTTCCCAATCACTAGGATCCTCATAGCTGATGCTTT 
TCTGCTGGTGCCTCCATTGATGAACCCCATTGTATACTGTGTGAAGAGCCAGCAGATA 
AGAAATCTTGTCTTAGAAAAACTGTGCCAGAAGCAAAGCTGAAGCGGATGCTTAACCA 
CATGATGCTTAACCCAAA 




ORF Start: ATG at 29 


ORF Stop: TGA at 968 




SI-.O ID NO; 210 


313 aa |MW at 35313. lkD 


NOV72a, 

CG59211-01 Protein Sequence 


MSILNTSEMEISIFYLVGIPGLEHANIWISIPICLMYTVAILGNCTILFFIKTEPSLH j 
EPMYYFLSMLALSDLGLSLSSLPTMLRIFLFNAPGISPDACIAQEFFIHGFSAMESSV I 
LLIMSFDRFIAICNPLRYTSILTSARVIQIGLAFSLKNVLLILPFPFTLKHLKYCKKN 1 
LLSQSYCLHQDVMKLACTDNKVNIIYGLFVALTGILDLTFIFMSYMLILKAVLSIASR | 
KKRLKVLNTCVSHI CAVLI FYVP 1 1 SLAV I YRFAKHSF PI TR I L I ADAFLLVPPLMNP 
I VYCVKSQQ I RNL VLE KLCQKQS 



Further analysis of the NOV72a protein yielded the following properties shown in 
Table 72B. 



Table 72B. Protein Sequence Properties NOV72a 


PSort 
analysis: 


0.6000 probability located in plasma membrane; 0.4000 probability located in 
Golgi body; 0.3000 probability located in endoplasmic reticulum (membrane); 
0.0300 probability located in mitochondrial inner membrane 


SignalP 
analysis: 


Likely cleavage site between residues 44 and 45 
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A search of the NOV72a protein against the Geneseq database, a proprietary 
database that contains sequences published in patents and patent publication, yielded 
several homologous proteins shown in Table 72C. 



Table 72C. Geneseq Results for NOV72a 


Geneseq 
Identifier 


Protein/Organism/Length [Patent 
#, Date] 


NOV72a 
Residues/ 

Match 
Residues 


Identities/ 
Similarities for 
the Matched 
Region 


Expect 
Value 


AAG71564 


Human olfactory receptor 
polypeptide, SEQ ID NO: 1245 - 
Homo sapiens, 322 aa. 
[WO200127158-A2, 19-APR-2001] 


1.313 
5..317 


312/313(99%) 
312/313 (99%) 


e-177 


AAU24573 


Human olfactory receptor 
AOLFR63 - Homo sapiens, 313 aa. 
[WO200168805-A2, 20-SEP-2001] 


1..312 
1..312 


225/312(72%) 
272/312 (87%) 


e-131 


AAG71721 


Human olfactory receptor 
polypeptide, SEQ ID NO: 1402 - 
Homo sapiens, 316 aa. 
[WO200127158-A2, 19-APR-2001] 


1..311 
1-311 


236/312(75%) 
267/312(84%) 


e-131 


AAU24682 


Human olfactory receptor 
AOLFR181 - Homo sapiens, 312 
aa. [WO200168805-A2, 20-SEP- 
2001] 


1..308 
1..306 




224/308 (72%) 
265/308 (85%) 


e-131 


AAG71701 


Human olfactory receptor 
polypeptide, SEQ ID NO: 1382 - 
Homo sapiens, 312 aa. 
[WO200127158-A2, 19-APR-2001] 


1..308 
1..306 


224/308 (72%) 
265/308 (85%) 


,■.„■„■„ „,,„«, 

e-131 



In a BLAST search of public sequence databases, the NOV72a protein was found to 
have homology to the proteins shown in the BLASTP data in Table 72D. 



Table 72D. Public BLASTP Results for NOV72a 


Protein 
Accession 
Number 


Pr otei n/O rganism/Length 


NOV72a 
Residues/ 

Match 
Residues 


Identities/ 
Similarities for 
the Matched 
Portion 


Expect 
Value 

1 


Q9H344 


Olfactory receptor 5112 
(HOR5'betal2) - Homo sapiens 
(Human), 312 aa. 


12..304 
10..303 


152/294 (51%) 
219/294(73%) 


6e-90 

i 
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Q9EQQ7 


MOR 3'BETA4 - Mus musculus 
(Mouse), 319 aa. 


1..309 
1..310 


159/310(51%) 
219/310(70%) 


9e-89 


Q9H343 


Olfactory receptor 5111 
(HOR5'betal 1) - Homo sapiens 
(Human), 314 aa. 


4..313 
4..314 


154/311 (49%) 
226/311 (72%) 


9e-89 


CAC38935 


SEQUENCE 9 FROM PATENT 
WO01 3 1014 - Homo sapiens 
(Human), 3 1 8 aa. 


5..305 
6..306 


153/302 (50%) 
217/302 (71%) 


2e-87 


CAC37756 


SEQUENCE 1 FROM PATENT 
WOO 125434 - Homo sapiens 
(Human), 3 1 7 aa. 


5..305 
5..305 


153/302 (50%) 
217/302(71%) 


3e-87 



PFam analysis predicts that the NOV72a protein contains the domains shown in the 
Table 72E. 



Table 72E. Domain Analysis of NOV72a 


Pfam Domain 


NOV72a Match 
Region 


Identities/ 
Similarities 
for the Matched 
Region 


Expect 
Value 


DUF40: domain 1 of 
1 


109.. 134 


10/26 (38%) 
20/26 (77%) 


0.38 


7tm_l: domain 1 of 2 


43.. 144 | 27/107(25%) 
J 71/107(66%) 


1.6e-15 


7tm_l : domain 2 of 2 


212..293 


16/93 (17%) 
56/93 (60%) 


4.7 


Sina: domain 1 of 1 


300..311 


7/12 (58%) 
10/12(83%) 


1 



Example 73. 

The NOV73 clone was analyzed, and the nucleotide and predicted polypeptide 
sequences are shown in Table 73A. 



Table 73A. NOV73 Sequence Analysis 




SEQIDNO:211 1 1581 bp 


NOV73a, 

CG5 9276-01 DNA Sequence 


CTGGTGGGTTGGCGGCTAAGGGGCGGAGACAAGAGGGGCCGCCACCATCTCCTCCAAT j 


GGAAGGGAGACAGGGGCGGGCTTAATGACGGAAGGAGCATGGCGTGGAGACACCTGAA 


AAAGCGGGCCCAGGATGCTGTGATCATCCTGGGGGGAGGAGGACTTCTCTTCGCCTCC 
TACCTGATGGCCACGGGAGATGAGCGTTTCTATGCTGAACACCTGATGCCGACTCTGC 
AGGGGCTGCTGGACCCGGAGTCAGCCCACAGACTGGCTGTTCGCTTCACCTCCCTGGG 
GCTCCTTCCACGGGCCAGATTTCAAGACTCTGACATGCTGGAAGTGAGAGTTCTGGGC 
CATAAATTCCGAAATCCAGTAGGAATTGCTGCAGGATTTGACAAGCATGGGGAAGCCG 
TGGACGGACTTTATAAGATGGGCTTTGGTTTTGTTGAGATAGGAAGTGTGACTCCAAA 
ACCTCAGGAAGGAAACCCTAGACCCAGAGTCTTCCGCCTCCCTGAGGACCAAGCTGTC 
ATTAACAGGTATGGATTTAACAGTCACGGGCTTTCAGTGGTGGAACACAGGTTACGGG 
CCAGACAGCAGAAGCAGGCCAAGCTCACAGAAGATGGACTGCCTCTGGGGGTCAACTT 
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GGGGAAGAACAAGACCTCAGTGGACGCCGCGGAGGACTACGCAGAAGGGGTGCGCGTA 
CTGGGCCCCCTGGCCGACTACCTGGTGGTGAATGTGTCCAGCCCCAACACTGCCGGGC 
TGCGGAGCCTTCAGGGAAAGGCCGAGCTGCGCCGCCTGCTGACCAAGGTGCTGCAGGA 
GAGGGATGGCTTGCGGAGAGTGCACAGGCCGGCAGTCCTGGTGAAGATCGCTCCTGAC 
CTCACCAGCCAGGATAAGGAGGACATTGCCAGTGTGGTCAAAGAGTTGGGCATCGATG 
GGCTGATTGTTACGAACACCACCGTGAGTCGCCCTGCGGGCCTCCAGGGTGCCCTGCG 
CTCTGAAACAGGAGGGCTGAGTGGGAAGCCCCTCCGGGATTTATCAACTCAAACCATT 
CGGGAGATGTATGCACTCACCCAAGGCAAGGTTTCCCGTCGAGTTCCCATAATTGGGG 
TTGGTGGTGTGAGCAGCGGGCAGGACGCGCTGGAGAAGATCCGGGCAGGGGCCTCCCT 
GGTGCAGCTGTACACGGCCCTCACCTTCTGGGGGCCACCCGTTGTGGGCAAAGTCAAG 
CGGGAACTGGAGGCCCTTCTGAAGGAGCAGGGCTTTGGCGGAGTCACAGATGCCATTG 
GAGCAGATCATCGGAGGATGAGGAAACGGGCAGAGAAGCGGCTGATTGTCCAGTCCCC 
CTGCGTGGAGGCTGCTTGGCTGGGCTCGAGCCCAGCGGTGGTGGGTCAGTTGGGACCT 
GGTGGTCTGCTGGTGGTCAGTTTGGGAATTTCCAGGTACGATTGTTTTCAGGCACTGT 
TCTTTGACTTGGTTGCAGAAAAACAGATTTTGCAACACTTTCCAAGGACACAGTGTTA 
CCACTCCCTCACCCTGCCATGGCCTCTTGGTTCTGCTTTTAACTTCTGAGCCTCAGGG 
AGTCCATCTTGTCTG 




ORF Start: ATG at 97 


ORF Stop: TGAat 1555 




SEQIDNO 212 


486 aa |MW at 52982.6kD 


NOV73a, 

CG59276-01 Protein Sequence 


MAWRHLKKRAQDAVIILGGGGLLFASYLMATGDERFYAEHLMPTLQGLLDPESAHRLA 
VRFTSLGLLPRARFQDSDMLEVRVLGHKFRNPVGIAAGFDKHGEAVDGLYKMGFGFVE 
IGSVTPKPQEGNPRPRVFRLPEDQAVINRYGFNSHGLSWEHRLRARQQKQAKLTEDG 
LPLGVNLGKNKTSVDAAEDYAEGVRVLGPLADYLWNVSSPNTAGLRSLQGKAELRRL 
LTKVLQERDGLRRVHRPAVLVKIAPDLTSQDKEDIASWKELGIDGLIVTNTTVSRPA 
GLQGALRSETGGLSGKPLRDLSTQTIREMYALTQGKVSRRVPIIGVGGVSSGQDALEK 
IRAGASLVQLYTALTFWGPPWGKVKRELEALLKEQGFGGVTDAIGADHRRMRKRAEK 
RLIVQSPCVEAAWLGSSPAWGQLGPGGLLWSJLGISRYDCFQALFFDLVAEKQILQH 
FIT r ^ LIJ ^FUF 



Further analysis of the NOV73a protein yielded the following properties shown in 
Table 73B. 



Table 73B. Protein Sequence Properties NOV73a 


Psort 
analysis: 


0.81 10 probability located in plasma membrane; 0.6400 probability located in 
endoplasmic reticulum (membrane); 0.3700 probability located in Golgi body; 
0.1839 probability located in microbody (peroxisome) 


SignalP 

analysis: 


No Known Signal Sequence Predicted 



A search of the NOV73a protein against the Geneseq database, a proprietary 
database that contains sequences published in patents and patent publication, yielded 
several homologous proteins shown in Table 73 C. 



Table 73( '.Geneseq Results for M>\ 73a 


Geneseq 
Identifier 


Protein/Organism/Length [Patent 
#, Date] 


NOV73a 
Residues/ 

Match 
Residues 


Identities/ 
Similarities for 
the Matched 
Region 


Expect 
Value 


AAB70780 


Tobacco dihydro-orotase protein - 
Nicotiana tabacum, 458 aa. 
[WO200118190-A2, 15-MAR-2001] 


36..398 
81..458 


199/383 (51%) 
257/383 (66%) 


e-101 
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AAG01301 


Human secreted protein, SEQ ID 
NO: 5382 - Homo sapiens, 144 aa. 
[EP1033401-A2, 06-SEP-2000] 


1..144 
1.-144 


143/144 (99%) 
144/144 (99%) 


3e-79 


AAG91420 


C glutamicum protein fragment SEQ 
ID NO: 5 1 74 - Corynebacterium 
glutamicum, 371 aa. [EP1 108790- 
A2, 20-JUN-2001] 


76..396 
60. .366 


131/328 (39%) 
190/328 (56%) 


6e-60 


AAB46597 


C. glutamicum dihydroorotate 
dehydrogenase protein - 
Corynebacterium glutamicum, 321 
aa. [DE19929364-A1, 28-DEC- 
2000] 


76..396 
10..316 


131/328 (39%) 
190/328 (56%) 


6e-60 


AAB80123 


Corynebacterium glutamicum MP 
protein sequence SEQ ID NO:980 - 
Corynebacterium glutamicum, 334 
aa. [WO200100843-A2, 04-JAN- 
2001] 


76..396 
23..329 


131/328 (39%) 
190/328 (56%) 


le-59 


In a BLAST search of public sequence databases, the NOV73a protein was found to 
have homology to the proteins shown in the BLASTP data in Table 73D. 




Table 73D. Public BLASTP Results for NOV73a 




Protein 
Accession 
Number 


Protein/Organism/Length 


NOV73a 
Residues/ 

Match 
Residues 


Identities/ 
Similarities for 
the Matched 
Portion 


Expect 
Value 


Q02127 


Dihydroorotate dehydrogenase, 
mitochondrial precursor (EC 1 .3.3 . 1 ) 
(Dihydroorotate oxidase) 
(DHOdehase) - Homo sapiens 
(Human), 396 aa (fragment). 


1..399 
2..396 


392/399 (98%) 
394/399 (98%) 


0.0 


PC1219 


dihydroorotate oxidase (EC 1.3.3.1) 
precursor - human, 397 aa. 


1..399 
3..397 


388/399 (97%) 
393/399 (98%) 


0.0 


Q63707 


Dihydroorotate dehydrogenase, 
mitochondrial precursor (EC 1.3.3.1) 
(Dihydroorotate oxidase) 
(DHOdehase) - Rattus norvegicus 
(Rat), 395 aa. 


1..399 
1..395 


350/399 (87%) 
369/399 (91%) 


0.0 


035435 


Dihydroorotate dehydrogenase, 
mitochondrial precursor (EC 1.3.3.1) 
(Dihydroorotate oxidase) 
(DHOdehase) - Mus musculus 
(Mouse), 395 aa. 


1..399 
1..395 


346/399 (86%) 
366/399 (91%) 


0.0 
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Q9FZM9 


DIHYDROOROTATE 


29.398 


206/394 (52%) e-101 




DEHYDROGENASE - Oryza sativa 


79..468 


261/394(65%) 




(Rice), 468 aa. 




1 



PFam analysis predicts that the NOV73a protein contains the domains shown in the 
Table 73E. 



Table 73E. Domain Analysis of NOV73a 


Pfam Domain 


NOV73a Match 
Region 


Identities/ 
Similarities 
for the Matched 
Region 


Expect 
Value 


DHOdehase: domain 1 of 
1 


77..381 


183/331 (55%) 
282/331 (85%) 


1.9e-169 



Example 74. 



The NOV74 clone was analyzed, and the nucleotide and predicted polypeptide 
5 sequences are shown in Table 74A. 



Table 74A. NOV74 Sequence Analysis 




SEQIDNO:213 1 1875 bp 


NOV74a, 

CG59268-01 DNA Sequence 


ATGl": JVT- -1 T - • 7, -1 ; T-- - 
TCTCCACCACAAGCAACGAAGCCTGCAAGCTGTTCGATGCCACGCTGACCCAGTATGT 
AAAATGGACCAATGACAAGAGTCTCGGTGGCATCGAGGGCTGCCTGTCAAAGCTCAAA 
GCAGCAGATCCAACCTTTGTGATGGGCCACGCCATGGCTACTGGCCTTGTGCTGATTG 
GCACTGGAAGCTCCGTGAAGCTGGACAAAGAGCTGGACCTGGCTGTGAAGACAATGGT 
GGAGATTTCAAGAACCCAGCCGCTGACAAGGCGGGAGCAGCTGCACGTGTCTGCAGTA 
GAGACATTTGCCAATGGGAACTTTCCGAAAGCCTGTGAACTATGGGAACAGATTCTCC 
AGGACCACCCGACAGACATGTTGGCCCTGAAATTTTCCCATGATGCTTATTTTTACCT 
GGGCTATCAGGAACAGATGAGAGATTCTGTTGCTCGAATTTACCCCTTCTGGACACCT 
GACATCCCCCTAAGCAGCTATGTGAAAGGCATCTACTCTTTTGGCTTGATGGAAACCA 
ACTTCTACGACCAGGCAGAAAAACTCGCCAAAGAGGCACCAACTCTTTGTCTTCAACA 
CCAGCACCCCACAGACAACTACTGGGCAGGAAAAGCAGGCTGTGATGGGGCCAGGAGT 
GGTAACACATGGGCTCTGTGTCTGCAGCCCCAGGCTGACGCATGGTCGGTGCACACCG 
TCGCTCACATCCACGAGATGAAAGCAGAGATCAAGGATGGGTTGGAATTCATGCAGCA 
CTCAGAGACCTTCTGGAAGGACTCTGATATGTTGGCTTGTCATAACTATTGGCACTGG 
GCTTTATATCTGATTGAGAAGGGTTTAATAAGGAGAACTTTATTCTTCCAGGGCGAAT 
ATGAGGCCGCGCTGACCATCTACGATACCCACATCCTTCCCAGCCTGCAGGCCAACGA 
TGCAATGCTGGACGTGGTGGACAGCTGCTCCATGCTCTACCGCCTGCAGATGGAAGGA 






ACCACATCCTGCTGTTCAATGACGCACACTTCCTGATGGCATCCCTGGGTGCACACGA 
CCCCCAGACCACACAGGAGCTGCTGACCACCCTGCGGGACGCCAGCGAGTATGCAGAG 
GGGCCTTCTCGGGGTGGGGGTCCTCACCCTGCCGAGAGGTGCCAGGCCTTTGCCTGTA 
TTATCAGCAATCCTGACGGTTCTGTTAGATTGGCACTGTTATGCCTGCTTACAGATGA 
GCAAACTGAGGCTGGAAGATCCCCAGGGGAGAACTGCCAGCACCTCCTGGCCCGAGAC 
GTGGGGCTGCCCCTGTGCCAGGCCCTGGTGGAGGCTGAGGACGGGAACCCTGACCGCG 
TCCTGGAGCTGCTCCTGCCCATCCGCTACCGGATCGTCCAGCTCGGTGGGAGCAATGC 
CCAGAGAGACGTCTTCAACCAGCTGCTGATTCACGCGGCCTTAAACTGCACCTCCAGC 
GTCCATAAGAACGTAGCCCGGAGCCTTCTGATGGAGCGTGATGCCTTGAAGCCCAACT 
CGCCCCTGACCGAGCGGCTCATCCGCAAGGCAGCTACCGTCCACCTCATGCAGAAGCC 
TTCTACCCGCCAACCCCCACTGCAGGCTGCTCTCTCCATGGAAGGAGGCGGCGGCCGC 
GATGAGCCTTCAGCCTGCCGGGCAGGGGACGTGAACATGGATGACCCTAAGAAGGAAG 






rGATTT T i rGA 




ORF Start: ATG at 1 |ORF Stop: TGA at 1873 
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SEQIDNO:214 1 624 aa |MW at 69393.3kD 


NOV74a, 

CG59268-01 Protein Sequence 


MAAAS PLRDCQAWKDARLPLSTTSNEACKLFDATLTQYVKWTNDKSLGGI EGCLSKLK 
AADPTFVMGHAMATGLVLIGTGSSVKLDKELDLAVKTMVEISRTQPLTRREQLHVSAV 
ETFANGNFPKACELWEQILQDHPTDMLALKFSHDAYFYLGYQEQMRDSVARIYPFWTP 
DIPLSSYVKGIYSFGLMETNFYDQAEKLAKEAPTLCLQHQHPTDNYWAGKAGCDGARS 
GNTWALCLQPQADAWSVHTVAHIHEMKAEIKDGLEFMQHSETFWKDSDMLACHNYWHW 
ALYLIEKGLIRRTLFFQGEYEAALTIYDTHILPSLQANDAMLDWDSCSMLYRLQMEG 
VSVGQRWQDVLPVARKHSRDHILLFNDAHFLMASLGAHDPQTTQELLTTLRDASEYAE 
GPSRGGGPHPAERCQAFACIISNPDGSVRIALLCLLTDEQTEAGRSPGENCQHLLARD 
VGLPLCQALVEAEDGNPDRVLELLLPIRYRIVQLGGSNAQRDVFNQLLIHAALNCTSS 
VHKNVARSLLMERDALKPNSPLTERLIRKAATVHLMQKPSTRQPPLQAALSMEGGGGR 
DEPSACRAGDVKMDDPKKBGKSLLLRRCCCSGCSVEMEGDLMFP 



Further analysis of the NOV74a protein yielded the following properties shown in 
Table 74B. 



Table 74B. Protein Sequence Properties NOV74a 


PSort 
analysis: 


0.4328 probability located in mitochondrial matrix space; 0.3000 probability 
located in microbody (peroxisome); 0.1 137 probability located in 
mitochondrial inner membrane; 0.1 137 probability located in mitochondrial 
intermembrane space 


SignalP 
analysis: 


No Known Signal Sequence Predicted 



A search of the NOV74a protein against the Geneseq database, a proprietary 
database that contains sequences published in patents and patent publication, yielded 
several homologous proteins shown in Table 74C. 



Table 74C. Geneseq Results for \<)V74;t 


Geneseq 
Identifier 


Protein/Organism/Length 
[Patent #, Date] 


NOV74a 
Residues/ 

Match 
Residues 


Identities/ 
Similarities for 
the Matched 
Region 


Expect [ 
Value i 


AAM41338 


Human polypeptide SEQ ID NO 
6269 - Homo sapiens, 478 aa. 
[WO200153312-A1, 26-JUL- 
2001] 


1..559 
10..478 


4(>v559 (82%) 
466/559 (82%) 


0.0 


AAM39552 


Human polypeptide SEQ ID NO 
2697 - Homo sapiens, 453 aa. 
[WO200153312-A1, 26-JUL- 
2001] 


1..529 
1..439 


434/529 (82%) 
437/529 (82%) 


0.0 


AAG02871 


Human secreted protein, SEQ ID 
NO: 6952 - Homo sapiens, 104 aa. 
[EP1033401-A2, 06-SEP-2000] 


1..102 
1..102 


102/102(100%) 
102/102(100%) 


le-52 | 
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AAM40893 


Human polypeptide SEQ ID NO 
5824 - Homo sapiens, 746 aa. 
[WO200153312-A1, 26-JUL- 
2001] 


568..604 
1..37 


32/37 (86%) 
32/37 (86%) 


2e-10 


AAM40892 


Human polypeptide SEQ ID NO 
5823 - Homo sapiens, 746 aa. 
[WO200153312-A1, 26-JUL- 
2001] 


568..604 
1..37 


32/37 (86%) 
32/37 (86%) 


2e-10 


In a BLAST search of public sequence databases, the NOV74a protein was found to 
have homology to the proteins shown in the BLASTP data in Table 74D. 


Table 74D. Public BLASTP Results for NOV74a 


Protein 
Accession 
Number 


Protein/Organism/Length 


NOV74a 
Residues/ 

Match 
Residues 


Identities/ 
Similarities for 
the Matched 
Portion 


Expect 
Value 


AAH18918 


HYPOTHE riC \l 45.7 KDA 
[ PROTEIN - Homo sapiens 
i (Human), 404 aa. 


66..559 
1..404 


399/494 (80%) 
402/494(80%) j 




0.0 


Q9NW P8 


KAIA2372 PROTEIN - Homo 
sapiens (Human), 336 aa. 


1..352 
1..310 


305/352 (86%) 
308/352 (86%) 


e-172 


Q9XW02 


Y54G1 1 A.4 PROTEIN - 
Caenorhabditis elegans, 497 aa. 


4..556 
6..458 


165/557 (29%) 
256/557(45%) ! 


3e-61 


Q9XW01 


Y54G1 1 A.7 PROTEIN - 
Caenorhabditis elegans, 407 aa. 


4..347 
6..305 


122/347 (35%) 
177/347(50%) 


7e-53 


Q98CS1 


MLR5032 PROTEIN - 
Rhizobium loti (Mesorhizobium 
loti), 440 aa. 


60..553 
46..435 


145/496 (29%) 
215/496(43%) 


le-43 



PFam analysis predicts that the NOV74a protein contains the domains shown in the 
Table 74E. 



Table 74E. Domain Analysis of NOV74a 


Pfam Domain 


NOV74a Match 
Region 


Identities/ 
Similarities 
for the Matched 
Region 


Expect 
Value 


Monooxygenase: domain 1 
of 1 


225..410 


28/238 (12%) 
121/238 (51%) 


6.4 



312 



Example 75. 

The NOV75 clone was analyzed, and the nucleotide and predicted polypeptide 
sequences are shown in Table 75A. 



Table 75A. NOV75 Sequence Analysis 




SEQ ID NO: 215 j 1851 bp 


NOV75a, 

CG59549-01 DNA Sequence 


CAGCTACAGCAAACATCGTTCGAGATGTCCCACCAAGAGGGCAGCACAGGTGGCTTAC 
CAGACTTAGTGACTGAAAGCCTGTTCAGCAGCCCAGAGGAGCAGTCTGGAGTAGCAGC 
GGTGACGGCGGCCTCCTCAGACATTGAAATGGCAGCCACAGAGCCATCGACCGGAGAT 

CAGACTCAGAAAGTTCAAGTGAAGACGTCGAACTTGAAAGCATGGGTGAAGGTTTATT 
TGGTTACCCGTTAGTGGGAGAGGAGACAGAAAGGGAGGAGGAAGAAGAAGAGATGGAG 
GAGGAAGGGGAGGAGGAAGAACAGCCTCGGATGTGTCCACGATGCGGTGGCACCAACC 

ATCTGCCCTGCCCCGATCTCGCTGGCAAGTCCTTACTGCTCTTCGCCAGCGGCAGCTG 
GGTTCAAGTGCCCGCTTTGTATATGAGGCCTGTGGGGCAAGAACCTTTGTGCAGCGTT 
TCCGCCTGCAGTATCTTCTTGGAAGCCATGCCGGTTCTGTCAGTACCATACACTTTAA 
CCAGCGTGGCACCCGACTGGCCAGTAGCGGTGATGACTTAAGGGTGATAGTGTGGGAC 
TGGGTGCGGCAGAAGCCAGTACTGAACTTTGAGAGTGGTCACGATATTAATGTCATCC 




ACAGGTACGGGTAGCAGAACTAATTAATGCATCATATTGCGAGAATACTAAGCGTGTG 
GCCAAGCACAGGGGACCTGCCCACGAGTTGGCTCTGGAGCCAGACTCTCCTTATAAGT 
TCCTCACTTCAGGTGAAGATGCCGTTGTGTTCACCATTGACCTCAGGCAAGACCGGCC 
AGCTTCAAAAGTTGTGGTAACAAGAGAAAATGATAAGAAAGTCGGACTGTATACAATC 
TCTATGAATCCTGCCAATATTTACCAATTTGCAGTGGGTGGACATGATCAGTTTGTAA 
GGATTTATGACCAGAGGAGAATTGATAAGAAAGAAAACAATGGAGTACTCAAGAAATT 
CACTCCTCATCATCTGGTTTATTGTGATTTCCCAACAAACATCACCTGCGTTGTGTAC 
AGCCACGATGGCACAGAGCTCCTGGCCAGCTACAATGATGAAGATATTTACCTCTTCA 
ACTCCTCTCTCAGTGATGGTGCTCAATATGTTAAGAGATATAAGGGGCACAGAAATAA 
TGACACAATCAAATGTGTTAATTTCTATGGCCCCCGGAGTGAGTTTGTCGTGAGCGGT 
AGTGATTGTGGGCACGTCTTCTTCTGGGAGAAATCATCCTCCCAGATCATCCAGTTCA 
TGGAGGGGGACAGAGGAGATATAGTAAACTGTCTTGAACCCCACCCTTACCTACCTGT 

GCCACTGAGCTTACTGGGTTAAAAGATGTGATTAAGAAGAACAAGCAGGAGCGAGATG 
AAGACAACTTGAACTATACGGACTCGTTTGACAACCGCATGCTTCGGTTCTTCGTGCG 
TCACCTGTTACAGAGAGCTCATCAACCCGGCTGGAGAGATCATGGAGCTGAGTTCCCA 
GATGAAGAAGAGTTGGATGAGTCTTCCAGCACCTCAGATACATCCGAGGAGGAGGGCC 
AAGATCGAGTGCAGTGCATACCATCCTGAAGGCCTCATATCCAGTCCAGCTAG 


|ORF Start: ATG at 25|ORF Stop: TGA at 1825 


SEQ ID NO: 216 600 aa |MW at 67372.4RD 


NOV75a, 

CG59549-01 Protein Sequence 


MSHQEGSTGGLPDLVTESLFSSPEEQSGVAAVTAASSDIEMAATEPSTGDGGDTRDGG 
FLNDASTENQNTDSESSSEDVELESMGEGLFGYPLVGEETEREEEEEEMEEEGEEEEQ 
PRMCPRCGGTNHDQCLLDEDQALEEWISSETSALPRSRWQVLTALRQRQLGSSARFVY 
EACGARTFVQRFRLQYLLGSHAGSVSTIHFNQRGTRLASSGDDLRVIVWDWVRQKPVL 
NFESGHDINVIQAKFFPNCGDSTLAMCGHDGQVRVAELINASYCENTKRVAKHRGPAH 
ELALEPDSPYKFLTSGEDAWFTIDLRQDRPASKVWTREKDKKVGLYTISMNPANIY 
QFAVGGHDQFVRIYDQRRIDKKENNGVLKKFTPHHLVYCDFPTNITCWYSHDGTELL 
ASYNDEDIYLFNSSLSDGAQYVKRYKGHRNNDTIKCVNFYGPRSEFWSGSDCGHVFF 
WEKSSSQIIQFMEGDRGDIVNCLEPHPYLPVLATSGLDQHVRIWTPTAKTATELTGLK 
DVIKKNKQERDEDNLNYTDSFDNRMLRFFVRHLLQRAHQPGWRDHGAEFPDEEELDES 
8 S T S DT S E E 3 GQD R V . C I P S 



Further analysis of the NOV75a protein yielded the following properties shown in 
Table 75B. 



Table 75B. Protein Sequence Properties NOV75a 


PSort 
analysis 


0.6500 probability located in cytoplasm; 0.1000 probability located in 
mitochondrial matrix space; 0.1000 probability located in lysosome (lumen); 
0.0442 probability located in microbody (peroxisome) 
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I SignalP 
I analysis: 



No Known Signal Sequence Predicted 



A search of the NOV75a protein against the Geneseq database, a proprietary 
database that contains sequences published in patents and patent publication, yielded 
several homologous proteins shown in Table 75C. 



Table 75C. Geneseq Results for NOV75a 


Geneseq 
Identifier 


Protein/Organism/Length [Patent 
#, Date] 


NOV75a 
Residues/ 

Match 
Residues 


Identities/ 
Similarities for 
the Matched 
Region 


Expect 
Value 


AAR85870 


WD-40 domain-contg. Mus 
musculus protein - Mus musculus, 
816 aa. [W09521252-A2, 10- 
AUG-1995] 


95..589 


295/495 (59%) 


e-179 


AAM73935 


Human bone marrow expressed 
probe encoded protein SEQ ID NO: 
34241 - Homo sapiens, 164 aa. 
[WO200157276-A2, 09-AUG- 
2001] 


1..157 
8.. 164 


157/157(100%) 
157/157 (100%) 


2e-87 


AAM61216 


Human brain expressed single exon 
probe encoded protein SEQ ID NO: 
33321 - Homo sapiens, 164 aa. 
[WO200157275-A2, 09-AUG- 
2001] 


1..157 
8..164 


157/157(100%) 
157/157(100%) 


2e-87 | 


AAM34114 


Peptide #8151 encoded by probe 
for measuring placental gene 
expression - Homo sapiens, 164 aa. 
[WO200157272-A2, 09-AUG- 
2001] 


I..157 
8.. 164 


157/157(100%) 
157/157(100%) 


2e-87 | 


AAB57007 


Human prostate cancer antigen 
protein sequence SEQ ID NO: 1585 
- Homo sapiens, 214 aa. 
[\\ O?00055 1 74-A 1,21 -SEP-2000] 


408..600 
22..214 


144/194(74%) 
162/194 (83%) 


2e-80 | 



In a BLAST search of public sequence databases, the NOV75a protein was found to 
have homology to the proteins shown in the BLASTP data in Table 75D. 
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1 able 75D. Public BLASTP Results for NOV 75a 


} 

Protein j 
Accession j 
Number 


Protein/Organism/Length 


NOV75a 
Residues/ 

Match 
Residues 


Identities/ 
Similarities for j 
the Matched 
Portion 


Expect 
Value 


Q12839 


H326 PROTEIN - Homo sapiens 
(Human), 597 aa. 


1..600 
1..597 


408/604(67%) 1 
471/604 (77%) 


0.0 


Q01078 


PROTEIN PC326 - Mus musculus 
(Mouse), 747 aa. 


95..589 
264..746 


295/495 (59%) 
372/495 (74%) 


e-178 


Q9W091 


CG8001 PROTEIN - Drosophila 
melanogaster (Fruit fly), 748 aa. 


68..587 
209..711 


178/533 (33%) 
280/533 (52%) 


le-77 


Q96E00 


UNKNOWN (PROTEIN FOR 
MGC:9478) - Homo sapiens 
(Human), 273 aa. 


1..246 
1..243 


141/249 (56%) 
173/249 (68%) 


8e-66 


Q9M1E5 


HYPOTHETICAL 54.0 KDA 
PROTEIN - Arabidopsis thaliana 
(Mouse-ear cress), 481 aa. 


183. .536 
42..419 


136/382 (35%) 
209/382 (54%) 


2e-62 



PFam analysis predicts that the NOV75a protein contains the domains shown in the 
□ Table 75E. 

W 



Table 75E. Domain Analysis of NOV75a 



Pfam Domain 


NOV75a Match Region . 


Identities/ 
Similarities 
for the Matched Region 


Expect Value 


WD40: domain 1 of 7 


188..224 


13/37 (35%) 
29/37 (78%) 


0.0016 


WD40: domain 2 of 7 


231. .269 


12/39 (31%) 
26/39 (67%) 


11 


WD40:domain3of7 


278..315 


9/38 (24%) 
24/38 (63%) 


2.2e+02 


WD40: domain 4 of 7 


326..363 


8/38(21%) J 8.8 
27/38 (71%) j 


S~ — — — 

WD40: domain 5 of 7 


382..418 


5/37 (14%) 
27/37 (73%) 


12 


WD40: domain 6 of 7 


429..466 


6/38 (16%) 
26/38 (68%) 


18 


WD40: domain 7 of 7 


473. .509 


11/37(30%) 
22/37(59%) 


0.51 
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Example 76. 

The NOV76 clone was analyzed, and the nucleotide and predicted polypeptide 
sequences are shown in Table 76A. 



Tabic 76A. NOV76 Sequence Analysis 




SEQIDNO:217 ! 7497 bp 


NOV76a, 

CG5 964 1-01 DNA Sequence 


ATGGTCTTGCTTCTTTGTCTATCTTGTCTGATTTTCTCCTGTCTGACCTTTTCCTGGT 
TAAAAATCTGGGGGAAAATGACGGACTCCAAGCCGATCACCAAGAGTAAATCAGAAGC 
AAACCTCATCCCGAGCCAGGAGCCCTTTCCAGCCTCTGATAACTCAGGGGAGACACCG 
CAGAGAAATGGGGAGGGCCACACTCTGCCCAAGACACCCAGCCAGGCCGAGCCAGCCT 
CCCACAAAGGCCCCAAAGATGCCGGTCGGCGGAGAAACTCCCTACCACCCTCCCACCA 
GAAGCCCCCAAGAAACCCCCTTTCTTCCAGTGACGCAGCACCCTCCCCAGAGCTTCAA 
GCCAACGGGACTGGGACACAAGGTCTGGAGGCCACAGATACCAATGGCCTGTCCTCCT 
CAGCCAGGCCCCAGGGCCAGCAAGCTGGCTCCCCCTCCAAAGAAGACAAGAAGCAGGC 
AAACATCAAGAGGCAGCTGATGACCAACTTCATCCTGGGCTCTTTTGATGACTACTCC 
TCCGACGAGGACTCTGTTGCTGGCTCATCTCGTGAGTCTACCCGGAAGGGCAGCCGGG 
CCAGCTTGGGGGCCCTGTCCCTGGAGGCTTATCTGACCACAAGGCCGAGCATGTCGGG 
ACTCCACCTGGTGAAGAGGGGACGGGAACACAAGAAGCTGGACCTGCACAGAGACTTT 
ACCGTGGCTTCTCCCGCTGAGTTTGTCACACGCTTTGGGGGGGATCGGGTCATCGAGA 
AGGTGCTTATTGCCAACAACGGGATTGCCGCCGTGAAGTGCATGCGCTCCATCCGCAG 
GTGGGCCTATGAGATGTTCCGCAACGAGCGGGCCATCCGGTTTGTTGTGATGGTGACC 
CCCGAGGACCTTAAGGCCAACGCAGAGTACATCAAGATGGCGGATCATTACGTCCCCG 
TCCCAGGAGGGCCCAATAACAACAACTATGCCAACGTGGAGCTGATTGTGGACATTGC 
CAAGAGAATCCCCGTGCAGGCGGTGTGGGCTGGCTGGGGCCATGCTTCAGAAAACCCT 
AAACTTCCGGAGCTGCTGTGCAAGAATGGAGTTGCTTTCTTAGGCCCTCCCAGTGAGG 
CCATGTGGGCCTTAGGAGATAAGATCGCCTCCACCGTTGTCGCCCAGACGCTACAGGT 
CCCAACCCTGCCCTGGAGTGGAAGCGGCCTGACAGTGGAGTGGACAGAAGATGATCTG 
CAGCAGGGAAAAAGAATCAGTGTCCCAGAAGATGTTTATGACAAGGGTTGCGTGAAAG 
ACGTAGATGAGGGCTTGGAGGCAGCAGAAAGAATTGGTTTTCCATTGATGATCAAAGC 
TTCTGAAGGTGGCGGAGGGAAGGGAATCCGGAAGGCTGAGAGTGCGGAGGACTTCCCG 
ATCCTTTTCAGACAAGTACAGAGTGAGATCCCAGGCTCGCCCATCTTTCTCATGAAGC 
TGGCCCAGCACGCCCGTCACCTGGAAGTTCAGATCCTCGCTGACCAGTATGGGAATGC 
TGTGTCTCTGTTTGGTCGCGACTGCTCCATCCAGCGGCGGCATCAGAAGATCGTTGAG 
GAAGCACCGGCCACCATCGCCCCGCTGGCCATATTCGAGTTCATGGAGCAGTGTGCCA 
TCCGCCTGGCCAAGACCGTGGGCTATGTGAGTGCAGGGACAGTGGAATACCTCTATAG 
TCAGGATGGCAGCTTCCACTTCTTGGAGCTGAATCCTCGCTTGCAGGTGGAACATCCC 
TGCACAGAAATGATTGCTGATGTTAATCTGCCGGCCGCCCAGCTACAGATCGCCATGG 
GCGTGCCACTGCACCGGCTGAAGGATATCCGGCTTCTGTATGGAGAGTCACCATGGGG 
AGTGACTCCCATTTCTTTTGAAACCCCCTCAAACCCTCCCCTCGCCCGAGGCCACGTC 
ATTGCCGCCAGAATCACCAGCGAAAACCCAGACGAGGGTTTTAAGCCGAGCTCCGGGA 
CTGTCCAGGAACTGAATTTCCGGAGCAGCAAGAACGTGTGGGGTTACTTCAGCGTGGC 
CGCTACTGGAGGCCTGCACGAGTTTGCGGATTCCCAATTTGGGCACTGCTTCTCCTGG 
GGAGAGAACCGGGAAGAGGCCATTTCGAACATGGTGGTGGCTTTGAAGGAACTGTCCA 
TCCGAGGCGACTTTAGGACTACCGTGGAATACCTCATTAACCTCCTGGAGACCGAGAG 
CTTCCAGAACAACGACATCGACACCGGGTGGTTGGACTACCTCATTGCTGAGAAAGTG 
CAGGCGGAGAAACCGGATATCATGCTTGGGGTGGTATGCGGGGCCTTGAACGTGGCCG 
ATGCGATGTTCAGAACGTGCATGACAGATTTCTTACACTCCCTGGAAAGGGGCCAGGT 
CCTCCCAGCGGATTCACTACTGAACCTCGTAGATGTGGAATTAATTTACGGAGGTGTT 
AAGTACATTCTCAAGGTGGCCCGGCAGTCTCTGACCATGTTCGTTCTCATCATGAATG 
GCTGCCACATCGAGATTGATGCCCACCGGCTGAATGATGGGGGGCTCCTGCTCTCCTA 
CAATGGGAACAGCTACACCACCTACATGAAGGAAGAGGTTGACAGTTACCGAATTACC 
ATCGGCAATAAGACGTGTGTGTTTGAGAAGGAGAACGATCCTACAGTCCTGAGATCCC 
CCTCGGCTGGGAAGCTGACACAGTACACAGTGGAGGATGGGGGCCACGTTGAGGCTGG 
GAGCAGCTACGCTGAGATGGAGGTGATGAAGATGATCATGACCCTGAACGTTCAGGAA 
AGAGGCCGGGTGAAGTACATCAAGCGTCCAGGTGCCGTGCTGGAAGCAGGCTGCGTGG 
TGGCCAGGCTGGAGCTCGATGACCCTTCTAAAGTCCACCCGGCTGAACCGTTCACAGG 
AGAACTCCCTGCCCAGCAGACACTGCCCATCCTCGGAGAGAAACTGCACCAGGTCTTC 
CACAGCGTCCTGGAAAACCTCACCAACGTCATGAGTGGCTTTTGTCTGCCAGAGCCCG 
TTTTTAGCATAAAGCTGAAGGAGTGGGTGCAGAAGCTCATGATGACCCTCCGGCACCC 
GTCACTGCCGCTGCTGGAGCTGCAGGAGATCATGACCAGCGTGGCAGGCCGCATCCCC 
GCCCCTGTGGAGAAGTCTGTCCGCAGGGTGATGGCCCAGTATGCCAGCAACATCACCT 
CGGTGCTGTGCCAGTTCCCCAGCCAGCAGATAGCCACCATCCTGGACTGCCATGCAGC 
CACCCTGCAGCGGAAGGCTGATCGAGAGGTCTTCTTCATCAACACCCAGAGCATCGTG 
CAGTTGGTCCAGAGATACCGCAGCGGGATCCGCGGCTATATGAAAACAGTGGTGTTGG 
ATCTCCTGAGAAGATACTTGCGTGTTGAGAGCAAGGCAAGAGATGCTGATGCCAACAC 
CAGTGGGATGGTGGGGGGCGTGAGGAGCCTGAGCTTTACCTCTGTGTGGTGTTTTGTC 
1 : ' JC'GAATCCCACTACGACAAGTGTGTGATAAACCTCAGGGAGCAGTTCAAGCCAG 
ACATGTCCCAGGTGCTGGACTGCATCTTCTCCCACGCACAGGTGGCCAAGAAGAACCA 
GCTGGTGATCATGTTGATCGATGAGCTGTGTGGCCCAGACCCTTCCCTGTCGGACGAG 
T MlT > r I / 
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CCCTCAGAGCCCGGCAGATCCTGATTGCCTCCCACCTCCCCTCCTACGAGCTGCGGCA 
TAACCAGGTGGAGTCCATTTTCCTGTCTGCCATTGACATGTACGGCCACCAGTTCTGC 
CCCGAGAACCTCAAGAAATTAATACTTTCGGAAACAACCATCTTCGACGTCCTGCCTA 
CTTTCTTCTATCACGCAAACAAAGTCGTGTGCATGGCGTCCTTGGAGGTTTACGTGCG 
GAGGGGCTACATCGCCTATGAGTTAAACAGCCTGCAGCACCGGCAGCTCCCGGACGGC 
ACCTGCGTGGTAGAATTCCAGTTCATGCTGCCGTCCTCCCACCCAAACCGGATGACCG 
TGCCCATCAGCATCACCAACCCTGACCTGCTGAGGCACAGCACAGAGCTCTTCATGGA 
CAGCGGCTTCTCCCCACTGTGCCAGCGCATGGGAGCCATGGTAGCCTTCAGGAGATTC 
GAGGACTTCACCAGAAATTTTGATGAAGTCATCTCTTGCTTCGCCAACGTGCCCAAAG 
ACACCCCCCTCTTCAGCGAGGCCCGCACCTCCCTATACTCCGAGGATGACTGCAAGAG 
CCTCAGAGAAGAGCCCATCCACATTCTGAATGTGTCCATCCAGTGTGCAGACCACCTG 
GAGGATGAGGCACTGGTGCCGATTTTACGGACATTCGTACAGTCCAAGAAAAATATCC 
TTGTGGATTATGGACTCCGACGAATCACATTCTTGATTGCCCAAGAGTTTGCAGAAGA 
TCGCATTTACCGTCACTTGGAACCTGCCCTGGCCTTCCAGCTGGAACTTAACCGGATG 
CGTAACTTCGATCTGACCGCCGTGCCCTGTGCCAACCACAAGATGCACCTTTACCTGG 
GTGCTGCCAAGGTGAAGGAAGGTGTGGAAGTGACGGACCATAGGTTCTTCATCCGCGC 
CATCATCAGGCACTCTGACCTGATCACAAAGGAAGCCTCCTTCGAATACCTGCAGAAC 
GAGGGTGAGCGGCTGCTCCTGGAGGCCATGGACGAGCTGGAGGTGGCGTTCAATAACA 
CCAGCGTGCGCACCGACTGCAACCACATCTTCCTCAACTTCGTGCCCACTGTCATCAT 
GGACCCCTTCAAGATCGAGGAGTCCGTGCGCTACATGGTTATGCGCTACGGCAGCCGG 
CTGTGGAAACTCCGTGTGCTACAGGCTGAGGTCAAGATCAACATCCGCCAGACCACCA 
CCGGCAGTGCCGTTCCCATCCGCCTGTTCATCACCAATGAGTCGGGCTACTACCTGGA 
CATCAGCCTCTACAAAGAAGTGACTGACTCCAGATCTGGAAATATCATGTTTCACTCC 
TTCGGCAACAAGCAAGGGCCCCAGCACGGGATGCTGATCAATACTCCCTACGTCACCA 
AGGATCTGCTCCAGGCCAAGCGATTCCAGGCCCAGACCCTGGGAACCACCTACATCTA 
TGACTTCCCGGAAATGTTCAGGCAGGCAAGTCCGGCGGCTCAGACGCGGGTACATGTG 
CACAATGTGCAGGCTCTCTTTAAACTGTGGGGCTCCCCAGACAAGTATCCCAAAGACA 
TCCTGACATACACTGAATTAGTGTTGGACTCTCAGGGCCAGCTGGTGGAGATGAACCG 
ACTTCCTGGTGGAAATGAGGTGGGCATGGTGGCCTTCAAAATGAGGTTTAAGACCCAG 
GAGTACCCGGAAGGACGGGATGTGATCGTCATCGGCAATGACATCACCTTTCGCATTG 
rCTTTGGCCCTGGAGAGGACCTTCTGTACCTGCGGGCATCCGAGATGGCCCGGGC 
AGAGGGCATTCCCAAAATTTACGTGGCAGCCAACAGTGGCGCCCGTATTGGCATGGCA 
GAGGAGATCAAACACATGTTCCACGTGGCTTGGGTGGACCCAGAAGACCCCCACAAAA 
AAAAAAAAACAGTGGCTTTCAGTGCAGGGAACTGGATTCGTAGCCTCACTAAAGTATT 
TTTTAAGGGATTTAAATACCTGTACCTGACTCCCCAAGACTACACCAGAATCAGCTCC 
CTGAACTCCGTCCACTGTAAACACATCGAGGAAGGAGGAGAGTCCAGATACATGATCA 
CGGATATCATCGGGAAGGATGATGGCTTGGGCGTGGAGAATCTGAGGGGCTCAGGCAT 
GATTGCTGGGGAGTCCTCTCTGGCTTACGAAGAGATCGTCACCATTAGCTTGGTGACC 
TGCCGAGCCATTGGGATTGGGGCCTACTTGGTGAGGCTGGGCCAGCGAGTGATCCAGG 
TGGAGAATTCCCACATCATCCTCACAGGAGCAAGTGCTCTCAACAAGGTCCTGGGAAG 
AGAGGTCTACACATCCAACAACCAGCTGGGTGGCGTTCAGATCATGCATTACAATGGT 
GTCTCCCACATCACCGTGCCAGATGACTTTGAGGGGGTTTATACCATCCTGGAGTGGC 
TGTCCTATATGCCAAAGGATAATCACAGCCCTGTCCCTATCATCACACCCACTGACCC 
CATTGACAGAGAAATTGAATTCCTCCCATCCAGAGCTCCCTACGACCCCCGGTGGATG 
CTTGCAGGAAGGCCTCACCCAACTCTGAAGGGAACGTGGCAGAGCGGATTCTTTGACC 
ACGGCAGTTTCAAGGAAATCATGGCACCCTGGGCGCAGACCGTGGTGACAGGACGAGC 
AAGGCTTGGGGGGATTCCCGTGGGAGTGATTGCTGTGGAGACACGGACTGTGGAGGTG 
GCAGTCCCTGCAGACCCTGCCAACCTGGATTCTGAGGCCAAGATAATTCAGCAGGCAG 
GACAGGTGTGGTTCCCAGACTCAGCCTACAAAACCGCCCAGGCCGTCAAGGACTTCAA 
CCGGGAGAAGTTGCCCCTGATGATCTTTGCCAACTGGAGGGGGTTCTCCGGTGGCATG 
AAAGACATGTATGACCAGGTGCTGAAGTTTGGAGCCTACATCGTGGACGGCCTTAGAC 
AATACAAACAGCCCATCCTGATCTATATCCCGCCCTATGCGGAGCTCCGGGGAGGCTC 
CTGGGTGGTCATAGATGCCACCATCAACCCGCTGTGCATAGAAATGTATGCAGACAAA 
GAGAGCAGGGGTGGTGTTCTGGAACCAGAGGGGACAGTGGAGATTAAGTTCCGAAAGA 
AAGATCTGATAAAGTCCATGAGAAGGATCGATCCAGCTTACAAGAAGCTCATGGAACA 
GCTAGGGGAACCTGATCTCTCCGACAAGGACCGAAAGGACCTGGAGGGCCGGCTAAAG 
GCTCGCGAGGACCTGCTGCTCCCCATCTACCACCAGGTGGCGGTGCAGTTCGCCGACT 
TCCATGACACACCCGGCCGGATGCTGGAGAAGGGCGTCATATCTGACATCCTGGAGTG 
GAAGACCGCACGCACCTTCCTGTATTGGCGTCTGCGCCGCCTCCTCCTGGAGGACCAG 
GTCAAGCAGGAGATCCTGCAGGCCAGCGGGGAGCTGAGTCACGTGCATATCCAGTCCA 
TGCTGCGTCGCTGGTTCGTGGAGACGGAGGGGGCTGTCAAGGCCTACTTGTGGGACAA 
CAACCAGGTGGTTGTGCAGTGGCTGGAACAGCACTGGCAGGCAGGGGATGGCCCGCGC 
TCCACCATCCGTGAGAACATCACGTACCTGAAGCACGACTCTGTCCTCAAGACCATCC 
GAGGCCTGGTTGAAGAAAACCCCGAGGTGGCCGTGGACTGTGTGATATACCTGAGCCA 
GCACATCAGCCCAGCTGAGCGGGCGCAGGTCGTTCACCTGCTGTCTACCATGGACAGC 
CCGGCCTCCACCTGA 



OKI ' Start: ATG at 1 )ORF Stop: TGA at 7495 





SEQ ID NO: 218 


2498 aa (MW at 280484.4kD 


NOV76a, 

CG59641-01 Protein Sequence 


MVLLLCLSCLIFSCLTFSWLKIWGKMTDSKPITKSKSEANLIPSQEPFPASDNSGETP 
QRNGEGHTLPKTPSQAEPASHKGPKDAGRRRNSLPPSHQKPPRNPLSSSDAAPSPELQ 
ANGTGTQGLEATDTNGLSSSARPQGQQAGSPSKEDKKQANIKRQLMTNFILGSFDDYS 
SDEDSVAGSSRESTRKGSRASLGALSLEAYLTTRPSMSGLHLVKRGREHKKLDLHRDF 
TVASPAEFVTRFGGDRVIEKVLIANNGIAAVKCMRSIRRWAYEMFRNERAIRFWMVT 
P EDLKANAE Y I KMADH YVP VPGG PNNNNYANVELI VD I AKRI P VQAVWAGWGHAS ENP 
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KLPELLCKNGVAFLGPPSEAMWALGDKIASTWAQTLQVPTLPWSGSGLTVEWTEDDL 
QQGKRISVPEDWDKGCVKDVDEGLEAAERIGFPLMIKASEGGGGKGIRKAESAEDFP 
ILFRQVQSEIPGSPIFLMKLAQHARHLEVQILADQYGNAVSLFGRDCSIQRRHQKIVE 
EAPAT I APLAI FE FMEQCAI RLAKTVGYVSAGTVE YLYSQDGSFHFLELNPRLQVEHP 
CTEMIADVNLPAAQLQIAMGVPLHRLKDIRLLYGESPWGVTPISFETPSNPPLARGHV 
IAARITSENPDEGFKPSSGTVQELNFRSSKNVWGYFSVAATGGLHEFADSQFGHCFSW 
GENREEAI SNMWALKELS I RGDFRTTVEYL I WLLETESFQNNDIDTGWLDYLI AEKV 
QAEKPDIMLGWCGALNVADAMFRTCMTDFLHSLERGQVLPADSLLNLVDVELIYGGV 
KYILKVARQSLTMFVLIMNGCHIEIDAHRLNDGGLLLSYNGNSYTTYMKEEVDSYRIT 
IGNKTCVFEKENDPTVLRSPSAGKLTQYTVEDGGHVEAGSSYAEMEVMKMIMTLNVQE 
RGRVKYIKRPGAVLEAGCWARLELDDPSKVHPAEPFTGELPAQQTLPILGEKLHQVF 
HSVLENLTNVMSGFCLPEPVFS I KLKEWVQKLMMTLRHPSLPLLELQE IMTSVAGR I P 
APVEKSVRRVMAQYASNITSVLCQFPSQQIATILDCHAATLQRKADREVFFINTQSIV 
QLVQRYRSGIRGYMKTWLDLLRRYLRVESKARDADANTSGMVGGVRSLSFTSVWCFV 
SPESHYDKOTINLREQFKPDMSQVLDCIFSHAQVAKKNQLVIMLIDELCGPDPSLSDE 
LISILNELTQLSKSEHCKVALRARQILIASHLPSYELRHNQVESIFLSAIDMYGHQFC 
PENLKKLILSETTIFDVLPTFFYHANKWCMASLEVYVRRGYIAYELNSLQHRQLPDG 
TCWEFQFMLPSSHPNRMTVPISITNPDLLRHSTELFMDSGFSPLCQRMGAMVAFRRF 
EDFTRNFDEVISCFANVPKDTPLFSEARTSLYSEDDCKSLREEPIHILNVSIQCADHL 
EDEALVPILRTFVQSKKNILVDYGLRRITFLIAQEFAEDRIYRHLEPALAFQLELNRM 
RNFDLTAVPCANHKMHLYLGAAKVKEGVEVTDHRFFIRAIIRHSDLITKEASFEYLQN 
EGERLLLEAMDELEVAFNNTSVRTDCNHIFLNFVPTVIMDPFKIEESVRYMVMRYGSR 
LWKLRVLQAEVKINIRQTTTGSAVPIRLFITNESGYYLDISLYKEVTDSRSGNIMFHS 
FGNKQGPQHGMLINTPYVTKDLLQAKRFQAQTLGTTYIYDFPEMFRQASPAAQTRVHV 
HNVQALFKLWGSPDKYPKDILTYTELVLDSQGQLVEMNRLPGGNEVGMVAFKMRFKTQ 
EYPEGRDVIVIGNDITFRIGSFGPGEDLLYLRASEMARAEGIPKIYVAANSGARIGMA 
EEIKHMFHVAWVDPEDPHKKKKTVAFSAGNWIRSLTKVFFKGFKYLYLTPQDYTRISS 
LNSVHCKHIEEGGESRYMITDIIGKDDGLGVENLRGSGMIAGESSLAYEEIVTISLVT 
CRAIGIGAYLVRLGQRVIQVENSHIILTGASALNKVLGREVYTSNNQLGGVQIMHYNG 
VSHITVPDDFEGVYTILEWLSYMPKDNHSPVPIITPTDPIDREIEFLPSRAPYDPRWM 
LAGRPHPTLKGTWQSGFFDHGSFKE IMAPWAQTWTGRARLGGI PVGVIAVETRTVEV 
AVPADPANLDSEAKI I QQAGQVWFPDSAYKTAQAVKDFNREKLPLMI FANWRGFSGGM 
KDMYDQVLKFGAYIVDGLRQYKQPILIYIPPYAELRGGSWWIDATINPLCIEMYADK 
ESRGGVLEPEGTVEIKFRKKDLIKSMRRIDPAYKKLMEQLGEPDLSDKDRKDLEGRLK 
AREDLLLPI YHQVAVQFADFHDTPGRMLEKGVI SD I LEWKTARTFLYWRLRRLLLEDQ 
VKQEILQASGELSHVHIQSMLRRWFVETEGAVKAYLWDN3SIQVWQWLEQHWQAGDGPR 
STIRENITYLKHDSVLKTIRGLVEENPEVAVDCVIYLSQHISPAERAQWHLLSTMDS 
j PAST 



Further analysis of the NOV76a protein yielded the following properties shown in 
Table 76B. 



Table 76B. Protein Sequence Properties NOV76a 


PSort 
analysis: 


0.6850 probability located in endoplasmic reticulum (membrane); 0.6400 
probability located in plasma membrane; 0.4600 probability located in Golgi 
body; 0.1000 probability located in endoplasmic reticulum (lumen) 


SignalP 
analysis: 


Likely cleavage site between residues 25 and 26 



A search of the NOV76a protein against the Geneseq database, a proprietary 
database that contains sequences published in patents and patent publication, yielded 
several homologous proteins shown in Table 76C. 
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Table 76C. Geneseq Results for NOV76a | 


Geneseq 
Identifier 


Protein/Organism/Length 
[Patent #, Date] 


NOV76a 
Residues/ 

Match 
Residues 


Identities/ 
Similarities for 
the Matched 
Region 


Expect 
Value 


AAU32848 


Novel human secreted protein 
#3339 - Homo sapiens, 2486 aa. 
[WO200179449-A2, 25-OCT- 
2001] 


26..2498 
1..2486 


2316/2555 (90%) 
2339/2555 (90%) 


0.0 


AAR05707 

j 


Acetyl-CoA-carboxylase - Gallus 
sp, 2324 aa. [JP02057179-A, 26- 
FEB-1990] 


163..2498 
17..2324 


1728/2375 (72%) 
2003/2375 (83%) 


0.0 

1 


AAB86033 


Bovine acetyl-coenzyme A 
carboxylase-alpha protein 
fragment - Bos taurus, 2288 aa. 
[DEI 9946 173-A1, 05-APR-2001] 


204..2497 
14..2288 


1719/2342 (73%) 
1969/2342 (83%) 


0.0 


AAR9881 1 


Erysiphe graminis acetyl 
coenzyme A carboxylase - 
Erysiphe graminis f.sp.hordei, 
2273 aa. [FR2727129-A1, 24- 
MAY-1996] 


235..2490 
42..2271 


1045/2326 (44%) 
1432/2326 (60%) 


0.0 


AAY24150 


Candida albicans acetyl CoA 
carboxylase - Candida albicans, 
2270 aa. [W09932635-A1, 01- 
JUL-1999] 


239..2489 
8S..2269 


1015/2300 (44%) 
1396/2300 (60%) 


0.0 



In a BLAST search of public sequence databases, the NOV76a protein was found to 
have homology to the proteins shown in the BLASTP data in Table 76D. 



Table 76D. Public BLASTP Results for NOV76a 



Protein 
Accession 
Number 


Protein/Organism/Length 


NOV76a 
Residues/ 

Match 
Residues 


Identities/ 
Similarities for 
the Matched 
Portion 


Expect 
Value 


000763 


Acetyl-CoA carboxylase 2 (EC 
6.4.1.2) (ACC-beta) [Includes: 
Biotin carboxylase (EC 6.3.4.14)] 
- Homo sapiens (Human), 2483 aa. 


1..2498 
1..2483 


2349/2528 (92%) 
2384/2528 (93%) 


0.0 


070151 


ACETYL-COA CARBOXYLASE 
- Rattus norvegicus (Rat), 2456 aa. 


1 ..2497 
1..2455 


2068/2524 (81%) 
2224/2524 (87%) 


0.0 
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CAA48770 


ACETYL-COA CARBOXYLASE 
(EC 6.4.1.2) - Homo sapiens 
(Human). 2339 aa. 


163..2498 
17..2339 


1921/2390 (80%) 
2086/2390 (86%) 


0.0 




Acetyl-CoA carboxylase (EC 
6.4.1.2) (ACC) [Includes: Biotin 
carboxylase (EC 6.3.4.14)] - 
Gallus gallus (Chicken), 2324 aa. 


1 i a no 

17..2324 


1 1 52.115 1 j ( /Z /o) 
2004/2375 (83%) 


0.0 


PI 1497 


Acetyl-CoA carboxylase 1 (EC 
6.4.1.2) (ACC-alpha) [Includes: 
Biotin carboxylase (EC 6.3.4.14)] 
- Rattus norvegicus (Rat), 2345 aa. 


163..2497 
17..2345 


1736/2396 (72%) 
1993/2396 (82%) 


0.0 



PFam analysis predicts that the NOV76a protein contains the domains shown in the 
Table 76E. 



Table 76E. Domain Analysis of NOV76a 


Pfam Domain 


NOV76a Match 
Region 


Identities/ 
Similarities 
for the Matched 
Region 


Expect 
Value 


CPSase L chain: domain 1 
of 1 


249.372 


49/132 (37%) 
117/132 (89%) 


2.2e-57 


CPSase L D2: domain 1 of 
1 


374..619 


102/253 (40%) 
218/253 (86%) 


6.6e-U8 


Biotin carb C: domain 1 of 
1 


640. .747 


40/118(34%) 
100/118 (85%) 


1.9e-53 


biotin lipoyl: domain 1 of 1 


885..951 


22/75 (29%) 
56/75 (75%) 


6.5e-17 


Carboxyl trans: domain 1 of 
2 


1783..1878 


31/100(31%) 
88/100 (88%) 


7.4e-34 


GTP cyclohydrol: domain 1 
of 1 


2287..2304 


6/18(33%) 
13/18(72%) 


6.6 


Carboxyl trans: domain 2 of 
2 


1897..2374 


191/504 (38%) 
447/504 (89%) 


4.1e-258 



Example 77. 



The NOV77 clone was analyzed, and the nucleotide and predicted polypeptide 
sequences are shown in Table 77A. 
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Table 77A. NOV77 Sequence Analysis 




SEQIDNO:219 j 1624 bp 


NOV77a, 

CG59630-01 DNA Sequence 


CGCGCGCCGGGGATGGAGCCGCAGCCCGGCGGCGCCCGGAGCTGCCGGCGCGGGGCCC 
CCGGCGGCGCCTGCGAGCTGGGCCCGGCGGCCGAGGCGGCGCCCATGAGCCTCGCCAT 
CCACAGCACCACGGGCACCCGCTACGACCTGGCCGTGCCGCCCGACGAGACGGTGGAG 
GGGCTGCGCAAGCGGTTGTCCCAGCGCCTCAAAGTGCCCAAGGAGCGCCTGGCTCTTC 
TCCACAAAGACACGCGGCTCAGTTCGGGGAAGCTGCAGGAGTTCGGCGTGGGTGATGG 
CAGCAAGCTGACCTTGGTACCCACCGTGGAAGCGGGCCTCATGTCTCAGGCCTCAAGG 
CCGGAACAGTCCGTGATGCAAGCTCTCGAGAGTCTCACGGAGACGCAGCCCCCAGCGG 
CGCCCGGGCCGGGCCGGGCTGGCGGAGGAGGCTTCCGGAAATACAGATTCATTTTATT 
TAAGCGTCCGTGGCACCGACAGGGACCCCAGAGCCCAGAGAGGGGCGGCGAGAGGCCC 
CAGGTCAGTGACTTCCTGTCGGGCCGTTCGCCACTGACACTGGCCTTGCGTGTGGGCG 
ACCACATGATGTTCGTGCAGCTGCAGCTCGCGGCCCAGCACGCTCCACTGCAACACCG 
CCATGTGCTGGCCGCTGCGGCCGCCGCCGCTGCTGCGCGGGGGGACCCCAGCATAGCC 
TCCCCCGTGTCCTCGCCCTGCCGGCCGGTGTCCAGTGCCGCCCGAGTCCCCCCGGTGC 
CCACCAGCCCGTCCCCTGCATCTCCCTCGCCCATCACAGCCGGCTCCTTCCGGTCCCA 
CGCAGCCTCCACCACCTGCCCGGAGCAGATGGACTGCTCCCCCACGGCCAGCAGCAGT 
GCCAGTCCTGGTGCCAGCACCACGTCTACCCCAGGGGCCAGCCCTGCCCCCCGCTCCC 
GAAAACCCGGCGCCGTCATCGAGAGCTTTGTGAATCACGCCCCGGGGGTCTTCTCAGG 
GACCTTCTCTGGCACGCTACACCCCAACTGCCAAGACAGCAGCGGGCGGCCGCGGCGT 
GACATCGGCACCATCCTGCAGATCCTGAACGACCTCCTGAGCGCCACCCGGCACTACC 
AGGGCATGCCCCCTTCGCTGGCCCAGCTCCGCTGCCACGCCCAGTGCTCCCCGGCCTC 
ACCGGCCCCCGACCTGGCCCCCAGAACTACCTCCTGCGAGAAGCTCACGGCTGCCCCC 
TCAGCCTCCCTGCTGCAGGGCCAGAGCCAGATCCGCATGTGCAAGCCCCCGGGTGACC 
GGCTTCGGCAGACAGAAAACCGCGCCACGCGCTGCAAGGTGGAACGGCTGCAGCTGCT 
TCTGCAGCAGAAACGGCTCCGTAGAAAGGCCCGGCGGGACGCGCGGGGTCCGTACCAC 
TGGTCACCCAGCCGCAAGGCCGGCCGCAGCGACAGCAGTAGCAGCGGGGGCGGCGGCA 
GCCCCAGCGAGGCCTCCGGCTTGGGCCTCGACTTCGAGGACTCCGTGTGGAAGCCAGA 


"j " :tcg cccci :cc 


AGGGCGGCGGGGP r ^CCCGGAGAGAAC 




ORFStart:ATGatl3 


ORF Stop: TAG at 1546 




SEQ ID NO: 220 


511 aa MW at 53949.3kD 


NOV77a, 

CG5 963 0-01 Protein Sequence 


MEPQPGGARSCRRGAPGGACELGPAAEAAPMSLAIHSTTGTRYDLAVPPDETVEGLRK 
RLSQRLKVPKERLALLHKDTRLSSGKLQEFGVGDGSKLTLVPTVEAGLMSQASRPEQS 
VMQALESLTETQPPAAPGPGRAGGGGFRKYRFILFKRPWHRQGPQSPERGGERPQVSD 
FLSGRSPLTLALRVGDHMMFVQLQLAAQHAPLQHRHVLAAAAAAAAARGDPSIASPVS 
SPCRPVSSAARVPPVPTSPSPASPSPITAGSFRSHAASTTCPEQMDCSPTASSSASPG 
ASTTSTPGASPAPRSRKPGAVIESFVNHAPGVFSGTFSGTLHPNCQDSSGRPRRDIGT 
ILQILNDLLSATRHYQGMPPSLAQLRCHAQCSPASPAPDLAPRTTSCEKLTAAPSASL 
LQGQSQIRMCKPPGDRLRQTENRATRCKVERLQLLLQQKRLRRKARRDARGPYHWSPS 
i MPDIKSE 



Further analysis of the NOV77a protein yielded the following properties shown in 
Table 77B. 



Table 77B. Protein Sequence Properties NOV77a 


PSort 
analysis: 


0.3000 probability located in microbody (peroxisome); 0.3000 probability 
located in nucleus; 0.1526 probability located in lysosome (lumen); 0.1000 
probability located in mitochondrial matrix space 


SignalP 
analysis: 


No Known Signal Sequence Predicted 



A search of the NOV77a protein against the Geneseq database, a proprietary 
database that contains sequences published in patents and patent publication, yielded 
several homologous proteins shown in Table 77C. 
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Table 77C. Geneseq Results for NOV77a 


Geneseq 
Identifier 


Protein/Organism/Length [Patent 
#, Date] 


NOV77a 
Residues/ 

Match 
Residues 


Identities/ 
Similarities for 
the Matched 
Region 


Expect 
Value 


AAB56832 


Human prostate cancer antigen 
protein sequence SEQ ID NO: 1410 - 
Homo sapiens, 236 aa. 
[WO200055174-A1, 21-SEP-2000] 


267..493 
1..227 


189/227(83%) 
195/227 (85%) 


e-104 



In a BLAST search of public sequence databases, the NOV77a protein was found to 
have homology to the proteins shown in the BLASTP data in Table 77D. 



Table 77D. Public BLASTP Results for NOV77a 


Protein 
Accession 
Number 


Protein/Organism/Length 


NOV77a 
Residues/ 

Match 
Residues 


Identities/ 
Similarities for 
the Matched 
Portion 


Expect 
Value 


Q9JJJ6 


MIDNOLIN - Mus musculus 
(Mouse), 508 aa. 


1..51 1 
1..508 


475/514(92%) 
486/514(94%) 


0.0 


Q96BW8 


SIMILAR TO MIDNOLIN - 
Homo sapiens (Human), 1 77 aa 
(fragment). 


338..511 
4.. 177 


174/174(100%) 
174/174(100%) 


2e-97 


Q9W2S4 


CG9732 PROTEIN - Drosophila 
melanogaster (Fruit fly), 989 aa. 


213..363 
524..677 


58/155 (37%) 
80/155 (51%) 


6e-18 


AAL40834 


BPLF 1 - Human herpesvirus 4 
(Epstein-Barr virus), 3179 aa. 


200..406 
320..530 


64/223 (28%) 
95/223 (41%) 


2e-07 


Q9BKV7 


PPG3 - Leishmania major, 1325 
aa. 


213. .328 
984..1104 


37/121 (30%) 
66/121 (53%) 


2e-06 



PFam analysis predicts that the NOV77a protein contains the domains shown in the 
Table 77E. 
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Table 77E. Domain Analysis of NOV77a 


Pfam Domain 


NOV77a Match 
Region 


Identities/ 
Similarities 
for the Matched 
Region 


Expect 
Value 


ubiquitin: domain 1 of 1 


3 I. .99 


19/79 (24%) 
46/79 (58%) 


0.00033 


PI3 PI4 kinase: domain 1 
of 1 


411. .427 


7/18(39%) 
14/18(78%) 


1.5 



Example 78. 

The NOV78 clone was analyzed, and the nucleotide and predicted polypeptide 
sequences are shown in Table 78A. 



Table 78A. NO\ 78 Sequence Analysis 




SEQIDNO:221 1 1034 bp 


NOV78a, 

CG59561-01 DNA Sequence 


----- - ATGT T - . - - , -T - 

GATCTGCCGGATTATGCGGGACGCTAATGTGGCCCGCAATGTCTACGGCGGGACCATC 
CTGAAGATGATCAAAGAGGCGGGCGCCATCATCAGCACCCGGCATTGCAATCCGCAGA 
ACGGGGATCGCTGTGTGGCCGCTCTGGCTCGGGTCGAGTGCACCCACTTCCTGTGGCC 

TCTGTGGAGGTGCAGGTCAACATGATGTCCGAAAACATCCTCACAGGTGCCAAAAAGC 
TGACCAATAAGGCCACCCTCTGGTATGCGCCCCTGTCGCTGACGAACGTGGACAAGGT 
CCTCGAAGAGCCTCCTGTTGTGTATTTCCGGCAGGAGCAGGAGGAGGAGGGCCAGAAG 
CGGTACAAAACCCAGAAGCTGGAGCGCATGGAGACCAACTGGAGGAACGGGGACATCG 
TCCAGCCAGTCCTCAACCCAGAGCCGAACACTGTCAGCTACAGCCAGTCCAGCTTGAT 
CCACCTGGTGGGGCCTTCAGACTGTACCCTGCACAGCTTCGTGCATGAAGGGGTGACC 
ATGAAGGTCATGGACGAGGTCGCCGGGATCTTGGCTGCACGCCACTGCAAGACCAACC 
TCGTCACAGCCTCCATGGAGGCCATTAATTTTGACAACAAGATCAGAAAAGGCTGCAT 
CAAGACCATCTCCGGACGCATGACCTTCACGAGCAATAAGTCCGTAGAGATCGAGGTC 
TTGGTGGATGCCGACTGTGTTGTGGACAGCTCTCAGAAGCGCTACAGGGCCGCCAGTG 
TCTTCACCTATGTGTCGCTGAGCCAGGAAGGCAGGTCGCTGCCCATGCCCCAGCTCGT 
GCCGGAGACCCAGGACGAGAAGGGCTTTGAGGCCTGGCTCGGTGGCTCACGCCTATAA 
1 




ORE Start: ATG at 21 [ORE Stop: TAA at 984 




SEQ ID NO: 222 T 321 aa MW at 35738. 7kD 


NOV78a, 

CG59561-01 Protein Sequence 


MSGPDIKTPTAIQICRIMRDANVARNVYGGTILKMIKEAGAIISTRHCNPQNGDRCVA 
ALARVECTHFLWPMCIGEVAHVSAEITYTSKHSVEVQVNMMSENILTGAKKLTNKATL 
WYAPLSLTNVDKVLEEPPWYFRQEQEEEGQKRYKTQKLERMETNWRNGDIVQPVLNP 
EPNTVSYSQSSLIHLVGPSDCTLHSFVHEGVTMKVMDEVAGILAARHCKTNLVTASME 
AINFDNKIRKGCIKTISGRMTFTSNKSVEIEVLVDADCWDSSQKRYRAASVFTYVSL 
SQEGRSLPM r _, E l EAULGGSRL 



Further analysis of the NOV78a protein yielded the following properties shown in 
Table 78B. 
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Table 78B, Protein Sequence Properties NOV78a 


PSort 
analysis: 


0.8000 probability located in microbody (peroxisome); 0.1000 probability 
located in mitochondrial matrix space; 0.1000 probability located in lysosome 
(lumen); 0.0000 probability located in endoplasmic reticulum (membrane) 


SignalP 
analysis: 


No Known Signal Sequence Predicted 



A search of the NOV78a protein against the Geneseq database, a proprietary 
database that contains sequences published in patents and patent publication, yielded 
several homologous proteins shown in Table 78C. 



Table 78C. Geneseq Results for NOV78a 


Geneseq 


Protein/Organism/Length [Patent 
#, Date] 


NOV78a 
Residues/ 

Match 
Residues 


Identities/ 
Similarities for 

Region 


Expect 


AAW74896 


Human secreted protein encoded by 
gene 169 clone HPTTU1 1 - Homo 
sapiens, 339 aa. [W09839448-A2, 
11 -SEP- 1998] 


1..310 
1..313 


273/313 (87%) 
292/313 (93%) 


e-154 


I AAY71115 


Human Hydrolase protein-13 
(HYDRL-13) - Homo sapiens, 375 
aa. [WO200028045-A2, 18-MAY- 
2000] 


1..310 
33..316 


247/313 (78%) 
266/313 (84%) 


e-133 


AAY35275 


Chlamydia pneumoniae 
transmembrane protein sequence - 
Chlamydia pneumoniae, 155 aa. 
[WO9927105-A2, 03-JUN-1999] 


187..310 
16..138 


35/124 (28%) 
72/124 (57%) 


le-09 


AAG92590 


C glutamicum protein fragment 
SEQ ID NO: 6344 - 
Corynebacterium glutamicum, 339 
aa. [EP1 1 08790- \2. 20-JUN-2001] 


24..309 
35. .307 


69/296 (23%) 
112/296 (37%) 


7e-08 

. J 


AAB76624 


Corynebacterium glutamicum MCT 
protein SEQ ID NO:230 - 
Corynebacterium glutamicum, 339 
aa. [WO200100805-A2, 04-JAN- 
2001] 


24..309 
35..307 


69/296 (23%) 
112/296 (37%) 


7e-08 1 



In a BLAST search of public sequence databases, the NOV78a protein was found to 
have homology to the proteins shown in the BLASTP data in Table 78D. 
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Table 7SD. Public BLASTP Results for NOV78a 


Protein 
Accession 
Number 


Protein/Organism/Length 


NOV78a 
Residues/ 

Match 
Residues 


Identities/ 
Similarities for 
the Matched 
Portion 


Expect 
Value 


000154 


Cytosolic acyl coenzyme A thioester 
hydrolase (EC 3.1.2.2) (Long chain 
acyl-CoA thioester hydrolase) (CTE- 
II) (Brain acyl-CoA hydrolase) 
(BACH) - Homo sapiens (Human), 
338 aa. 


1..310 
1..313 


274/313 (87%) 
293/313 (93%) 


e-154 


Q91V12 


ACYL-COA HYDROLASE 
(HYPOTHETICAL 37.6 KDA 
PROTEIN) - Mus musculus (Mouse), 
338 aa. 


1..310 
1..313 


265/313 (84%) 
287/313 (91%) 


e-150 


Q64559 


Cytosolic acyl coenzyme A thioester 
hydrolase (EC 3.1.2.2) (Long chain 
acyl-CoA thioester hydrolase) (CTE- 
II) (Brain acyl-CoA hydrolase) 
(BACH) (ACT) (LACH1) (ACH1) - 
Rattus norvegicus (Rat), 338 aa. 


1..310 
1..313 


263/313 (84%) 
286/313 (91%) 


e-149 


JC5416 


palmitoyl-CoA hydrolase (EC 
3.1.2.2), hepatic - rat, 343 aa. 


12..310 
17..318 


251/302 (83%) 
276/302 (91%) 


e-142 


Q9Y541 


DJ202O8.3.1 (HBACH (BRAIN 
ACYL-COA HYDROLASE (ACYL 
COENZYME A THIOESTER 
HYDROLASE, EC 3.1.2.2)) 
(ISOFORM 1)) - Homo sapiens 
(Human), 237 aa (fragment). 


1..202 
33..236 


181/204 (88%) 
190/204 (92%) 


e-100 



PFam analysis predicts that the NOV78a protein contains the domains shown in the 
Table 78E. 



Table 78E. Domain Analysis of NOV78a 


Pfam Domain 


NOV78a Match 
Region 


Identities/ 
Similarities 
for the Matched 
Region 


Expect 
Value 


Acyl-CoA hydro: domain 1 
of 1 


165..305 


46/147 (31%) 
131/147(89%) 


1 le-47 



Example 79. 

The NOV79 clone was analyzed, and the nucleotide and predicted polypeptide 
sequences are shown in Table 79A. 
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Table 79A. NOV79 Sequence Analysis 




SEQ ID NO: 223 |4203 bp 


NOV79a, 

CG59452-01 DNA Sequence 


AATGTGATGGGATCACTAGCATGTCTGCGGAGAGCGGCCCTGGGACGAGATTGAGAAA 
TCTGCCAGTAATGGGGGATGGACTAGAAACTTCCCAAATGTCTACAACACAGGCCCAG 
GCCCAACCCCAGCCAGCCAACGCAGCCAGCACCAACCCCCCGCCCCCAGAGACCTCCA 
ACCCTAACAAGCCCAAGAGGCAGACCAACCAACTGCAATACCTGCTCAGAGTGGTGCT 
CAAGACACTATGGAAACACCAGTTTGCATGGCCTTTCCAGCAGCCTGTGGATGCCGTC 
AAGCTGAACCTCCCTGATTACTATAAGATCATTAAAACGCCTATGGATATGGGAACAA 
TAAAGAAGCGCTTGGAAAACAACTATTACTGGAATGCTCAGGAATGTATCCAGGACTT 
CAACACTATGTTTACAAATTGTTACATCTACAACAAGCCTGGAGATGACATAGTCTTA 
ATGGCAGAAGCTCTGGAAAAGCTCTTCTTGCAAAAAATAAATGAGCTACCCACAGAAG 
AAACCGAGATCATGATAGTCCAGGCAAAAGGAAGAGGACGTGGGAGGAAAGAAACAGG 
TACAGCAAAACCTGGCGTTTCCACGGTACCAAACACAACTCAAGCATCGACTCCTCCG 
CAGACCCAGACCCCTCAGCCGAATCCTCCTCCTGTGCAGGCCACGCCTCACCCCTTCC 
CTGCCGTCACCCCGGACCTCATCGTCCAGACCCCTGTCATGACAGTGGTGCCTCCCCA 
GCCACTGCAGACGCCCCCGCCAGTGCCCCCCCAGCCACAACCCCCACCCGCTCCAGCT 
CCCCAGCCCGTACAGAGCCACCCACCCATCATCGCGGCCACCCCACAGCCTGTGAAGA 
CAAAGAAGGGAGTGAAGAGGAAAGCAGACACCACCACCCCCACCACCATTGACCCCAT 
TCACGAGCCACCCTCGCTGCCCCCGGAGCCCAAGACCACCAAGCTGGGCCAGCGGCGG 
GAGAGCAGCCGGCCTGTGAAACCTCCAAAGAAGGACGTGCCCGACTCTCAGCAGCACC 
CAGCACCAGAGAAGAGCAGCAAGGTCTCGGAGCAGCTCAAGTGCTGCAGCGGCATCCT 
CAAGGAGATGTTTGCCAAGAAGCACGCCGCCTACGCCTGGCCCTTCTACAAGCCTGTG 
GACGTGGAGGCACTGGGCCTACACGACTACTGTGACATCATCAAGCACCCCATGGACA 
TGAGCACAATCAAGTCTAAACTGGAGGCCCGTGAGTACCGTGATGCTCAGGAGTTTGG 
TGCTGACGTCCGATTGATGTTCTCCAACTGCTATAAGTACAACCCTCCTGACCATGAG 
GTGGTGGCCATGGCCCGCAAGCTCCAGGATGTGTTCGAAATGCGCTTTGCCAAGATGC 
CGGACGAGCCTGAGGAGCCAGTGGTGGCCGTGTCCTCCCCGGCAGTGCCCCCTCCCAC 
CAAGGTTGTGGCCCCGCCCTCATCCAGCGACAGCAGCAGCGATAGCTCCTCGGACAGT 
GACAGTTCGACTGATGACTCTGAGGAGGAGCGAGCCCAGCGGCTGGCTGAGCTCCAGG 
AGCAGCTCAAAGCCGTGCACGAGCAGCTTGCAGCCCTCTCTCAGCCCCAGCAGAACAA 
AC C AAAG AAAAAG GAG AAAG AC AAG AAG G AAAAGAAAAAAG AAAAG C ACAAAAGG AAA 
GAGGAAGTGGAAGAGAATAAAAAAAGCAAAGCCAAGGAACCTCCTCCTAAAAAGACGA 
AGAAAAATAATAGCAGCAACAGCAATGTGAGCAAGAAGGAGCCAGCGCCCATGAAGAG 
CAAGCCCCCTCCCACGTATGAGTCGGAGGAAGAGGACAAGTGCAAGCCTATGTCCTAT 
GAGGAGAAGCGGCAGCTCAGCTTGGACATCAACAAGCTCCCCGGCGAGAAGCTGGGCC 
GCGTGGTGCACATCATCCAGTCACGGGAGCCCTCCCTGAAGAATTCCAACCCCGACGA 
GATTGAAATCGACTTTGAGACCCTGAAGCCGTCCACACTGCGTGAGCTGGAGCGCTAT 
GTCACCTCCTGTTTGCGGAAGAAAAGGAAACCTCAAGCTGAGAAAGTTGATGTGATTG 
CCGGCTCCTCCAAGATGAAGGGCTTCTCGTCCTCAGAGTCGGAGAGCTCCAGTGAGTC 
CAGCTCCTCTGACAGCGAAGACTCCGAAACAGAGATGGCTCCGAAGTCAAAAAAGAAG 
GGGCACCCCGGGAGGGAGCAGAAGCAGCACCATCATCACCACCATCAGCAGATGCAGC 
AGGCCCCGGCTCCTGTGCCCCAGCAGCCGCCCCCGCCTCCCCAGCAGCCCCCACCGCC 
TCCACCTCCGCAGCAGCAACAGCAGCCGCCACCCCCGCCTCCCCCACCCTCCATGCCG 
CAGCAGGCAGCCCCGGCGATGAAGTCCTCGCCCCCACCCTTCATTGCCACCCAGGTGC 
CCGTCCTGGAGCCCCAGCTCCCAGGCAGCGTCTTTGACCCCATCGGCCACTTCACCCA 
GCCCATCCTGCACCTGCCGCAGCCTGAGCTGCCCCCTCACCTGCCCCAGCCGCCTGAG 
CACAGCACTCCACCCCATCTCAACCAGCACGCAGTGGTCTCTCCTCCAGCTTTGCACA 
ACGCACTACCCCAGCAGCCATCACGGCCCAGCAACCGAGCCGCTGCCCTGCCTCCCAA 
GCCCGCCCGGCCCCCAGCCGTGTCACCAGCCTTGACCCAAACACCCCTGCTCCCACAG 
CCCCCCATGGCCCAACCCCCCCAAGTGCTGCTGGAGGATGAAGAGCCACCTGCCCCAC 
CCCTCACCTCCATGCAGATGCAGCTGTACCTGCAGCAGCTGCAGAAGGTGCAGCCCCC 
TACGCCGCTACTCCCTTCCGTGAAGGTGCAGTCCCAGCCCCCACCCCCCCTGCCGCCC 
CCACCCCACCCCTCTGTGCAGCAGCAGCTGCAGCAGCAGCCGCCACCACCCCCACCAC 
CCCAGCCCCAGCCTCCACCCCAGCAGCAGCATCAGCCCCCTCCACGGCCCGTGCACTT 
GCAGCCCATGCAGTTTTCCACCCACATCCAACAGCCCCCGCCACCCCAGGGCCAGCAG 
CCCCCCCATCCGCCCCCAGGCCAGCAGCCACCCCCGCCGCAGCCTGCCAAGCCTCAGC 
AAGTCATCCAGCACCACCATTCACCCCGGCACCACAAGTCGGACCCCTACTCAACCGG 
TCACCTCCGCGAAGCCCCCTCCCCGCTTATGATACATTCCCCCCAGATGTCACAGTTC 
CAGAGCCTGACCCACCAGTCTCCACCCCAGCAAAACGTCCAGCCTAAGAAACAGGTAA 
CTGGCAGGGCTGGGCCAAGTCCTGTGGGCCAGGGCCGGGGGTGCCTGCCCACCTCACC 
GGCCGCTGTGCCTGTGCCATCCCAGGAGCTGCGTGCTGCCTCCGTGGTCCAGCCCCAG 

TTATGTTCCAGGGCCGGAAATGAAGCCTGTGGATGTCGGGAGGCCTGTGATCCGGCCC 
CCAGAGCAGAACGCACCGCCACCAGGGGCCCCTGACAAGGACAAACAGAAACAGGAGC 
CGAAGACTCCAGTTGCGCCCAAAAAGGACCTGAAAATCAAGAACATGGGCTCCTGGGC 
CAGCCTAGTGCAGAAGCATCCGACCACCCCCTCCTCCACAGCCAAGTCATCCAGCGAC 
AGCTTCGAGCAGTTCCGCCGCGCCGCTCGGGAGAAAGAGGAGCGTGAGAAGGCCCTGA 
AGGCTCAGGCCGAGCACGCTGAGAAGGAGAAGGAGCGGCTGCGGCAGGAGCGCATGAG 
GAGCCGAGAGGACGAGGATGCGCTGGAGCAGGCCCGGCGGGCCCATGAGGAGGCACGT 
CGGCGCCAGGAGCAGCAGCAGCAGCAGCGCCAGGAGCAACAGCAGCAGCAGCAACAGC 
AAGCAGCTGCGGTGGCTGCCGCCGCCACCCCACAGGCCCAGAGCTCCCAGCCCCAGTC 
CATGCTGGACCAGCAGAGGGAGTTGGCCCGGAAGCGGGAGCAGGAGCGAAGACGCCGG 
GAAGCCATGGCAGCTACCATTGACATGAATTTCCAGAGTGATCTATTGTCAATATTTG 
AGAAAM I T " 
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1 ORF Start: ATG at j ORF Stop: TGA at 4 1 9 1 

[21 1 




SEQ ID NO: 224 1 1 390 aa | MW at 1 54728.4kD 


NOV79a, 

CG59452-01 Protein 
Sequence 


L LE^SCMSTTQAQAQPQPANAASTHPPPPETSNPNKPKR 
QTNQLQYLLRWLKTLWKHQFAWPFQQPVDAVKLNLPDYYKIIKTPMDMGTIKKRLEN 
NYYWNAQECIQDFNTMFTNCYIYNKPGDDIVLMAEALEKLFLQKINELPTEETEIMIV 
QAKGRGRGRKETGTAKPGVSTOPNTTQASTPPQTQTPQPNPPPVQATPHPFPAVTPDL 
IVQTPVMTWPPQPLQTPPPVPPQPQPPPAPAPQPVQSHPPIIAATPQPVKTKKGVKR 
KADTTTPTTIDPIHEPPSLPPEPKTTKLGQRRESSRPVKPPKKDVPDSQQHPAPEKSS 
KVSEQLKCCSG I LKEMFAKKHAAYAWPF YKPVDVEALGLHDYCDI I KHPMDMSTI KSK 
LEAREYRDAQEFGADVRLMFSNCYKYNPPDHEWAMARKLQDVFEMRFAKMPDEPEEP 
WAVSSPAVPPPTKWAPPSSSDSSSDSSSDSDSSTDDSEEERAQRLAELQEQLKAVH 
EQLAALSQPQQNKPKKKEKDKKEKKKEKHKRKEEVEENKKSKAKEPPPKKTKKNNSSN 
SNVSKKEPAPMKSKPPPTYESEEEDKCKPMSYEEKRQLSLDINKLPGEKLGRWHIIQ 
SREPSLKNSNPDEIEIDFETLKPSTLRELERYVTSCLRKKRKPQAEKVDVIAGSSKMK 
GFSSSESESSSESSSSDSEDSETEMAPKSKKKGHPGREQKQHHHHHHQQMQQAPAPVP 
QQPPPPPQQPPPPPPPQQQQQPPPPPPPPSMPQQAAPAMKSSPPPFIATQVPVLEPQL 
PGSVFDPIGHFTQPILHLPQPELPPHLPQPPEHSTPPHLNQHAWSPPALHNALPQQP 
SRPSNRAAALPPKPARPPAVSPALTQTPLLPQPPMAQPPQVLLEDEEPPAPPLTSMQM 
QLYLQQLQKVQPPTPLLPSVKVQSQPPPPLPPPPHPSVQQQLQQQPPPPPPPQPQPPP 
QQQHQPPPRPVHLQPMQFSTHIQQPPPPQGQQPPHPPPGQQPPPPQPAKPQQVIQHHH 
SPRHHKSDPYSTGHLREAPSPLMIHSPQMSQFQSLTHQSPPQQNVQPKKQVTGRAGPS 
PVGQGRGCLPTSPAAVPVPSQELRAASWQPQPLVWKEEKIHSPIIRSEPFSPSLRP 
EPPKHPESIKAPVYVPGPEMKPVDVGRPVIRPPEQNAPPPGAPDKDKQKQEPKTPVAP 
KKDLKIKNMGSWASLVQKHPTTPSSTAKSSSDSFEQFRRAAREKEEREKALKAQAEHA 
EKEKERLRQERMRSREDEDALEQARRAHEEARRRQEQQQQQRQEQQQQQQQQAAAVAA 
AATPQAQSSQPQSMLDQQRELARKREQERRRREAT4AATIDMNFQSDLLSIFEENLF 



Further analysis of the NOV79a protein yielded the following properties shown in 
Table 79B. 



Table 79B. Protein Sequence Properties NOV79a 


PSort 

analysis: 


0.9800 probability located in nucleus; 0.3000 probability located in 
microbody (peroxisome); 0.1000 probability located in mitochondrial matrix 
space; 0.1000 probability located in lysosome (lumen) 


SignalP 
analysis: 


No Known Signal Sequence Predicted 



A search of the NOV79a protein against the Geneseq database, a proprietary 
database that contains sequences published in patents and patent publication, yielded 
several homologous proteins shown in Table 79C. 



Table 79C. Geneseq Results for NOV79a 



Geneseq 
Identifier 


Protein/Organism/Length 
[Patent #, Date] 


NOV79a 
Residues/ 

Match 
Residues 


Identities/ 
Similarities for 
the Matched 
Region 


Expect 
Value 


AAY57898 


Human transmembrane protein 
HTMPN-22 - Homo sapiens, 688 
aa. [W09961471-A2, 02-DEC- 
1999] 


1..667 
1..667 


667/667 (100%) 
667/667(100%) 


0.0 
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AAY07027 


Breast cancer associated antigen 
precursor sequence - Homo 
sapiens, 754 aa. [WO9904265-A2, 
28-JAN-1999] 


44.. 724 
13..708 


407/732 (55%) 
487/732 (65%) 


0.0 


AAY07114 


WO9904265 Seq ID No: 685 - 
Homo sapiens, 947 aa. 
[WO9904265-A2, 28-JAN-1999] 


35..73S 
4..686 


357/761 (46%) 
444/761 (57%) 


e-170 


AAW81168 


Transcriptional regulatory factor 
RTNG3 - Homo sapiens, 947 aa. 
[WO9848015-A1, 29-OCT-1998] 


35..7S8 
4..686 


357/761 (46%) 
444/761 (57%) 


e-170 


AAU16206 


Human novel secreted protein, Seq 
ID 1 159 - Homo sapiens, 235 aa. 
[WO200155322-A2, 02-AUG- 
2001] 


51. .255 
1..203 


118/206 (57%) 
137/206 (66%) 


2e-59 


In a BLAST search of public sequence databases, the NOV79a protein was found to 
have homology to the proteins shown in the BLASTP data in Table 79D. 


Table 79D. Public BLASTP Results for iN(>\ 7<> : , 


Protein 
Accession 
Number 


Protein/Organism/Length 


NOV79a ! 
Residues/ 

Match 
Residues 


Identities/ 
Similarities for 
the Matched 
Portion 


Expect 
Value 


060885 


Bromodomain-containing protein 
4 (HUNK1 protein) - Homo 
sapiens (Human), 1362 aa. 


1..1390 
1..1362 


1357 1391 (97%) 
1360/1391 (97%) 


0.0 


Q9ESU6 


CELL PR( H 11 1 RATK >N 
RELATED PROTEIN CAP - 
Mus musculus (Mouse), 1400 aa. 


1-1390 
1..1400 


1318/1400 (94%) 
1338/1400 (95%) 


0.0 


AAL67833 


BROMODOMAIN- 
CONTAINING PROTEIN BRD4 
LONG VARIANT - Mus 
musculus (Mouse), 1400 aa. 


1-1390 
1..1400 


1318/1400 (94%) 
1338/1400 (95%) 


0.0 


060433 


R3 1 546_1 - Homo sapiens 
(Human), 73 1 aa (fragment). 


1..719 
12. .730 


719/719(100%) 
719/719(100%) 


0.0 


AAL67834 


BROMODOMAIN- 
CONTAINING PROTEIN BRD4 
SHORT VARIANT - Mus 
musculus (Mouse), 723 aa. 


1..719 
1..720 


694/720 (96%) | 
700/720 (96%) 


0.0 



PFam analysis predicts that the NOV79a protein contains the domains shown in the 
Table 79E. 
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Table 79E. Domain Analysis of NOV79a 


Pfam Domain 


NOV79a Match 
Region 


Identities/ 
Similarities 
for the Matched 
Region 


Expect 
Value 


bromodomain: domain 1 
of 2 


63..152 


42/92 (46%) 
82/92 (89%) 


8.6e-45 


bromodomain: domain 2 
of 2 


356..445 


40/92 (43%) 
81/92 (88%) 


3e-40 



Example 80. 



The NOV80 clone was analyzed, and the nucleotide and predicted polypeptide 
sequences are shown in Table 80A. 



Table 80A. NOV80 Sequence Analysis 



SEQ ID NO: 225 



(1776 bp 



NOV80a, 

CG59572-01 DNA Sequence 



TGGTTCGTTTATTCCTGGGGTTGTCATATC ATGGCTTATAATGACACAGACAGAAACC 



AGACTGAGAAGCTCCTAAAAAGAGTACGAGAACTGGAGCAAGAGGTGCAAAGACTTAA 
AAAGGAACAGGCCAAAAATAAGGAGGACTCAAACATTAGAGAAAATTCAGCAGGAGCT 
GGAAAAACTAAGCGTGCATTTGATTTCAGTGCTCATGGCCGAAGACACGTAGCCCTAA 
GAATAGCCTATATGGGCTGGGGATACCAGGGCTTTGCTAGTCAGGAAAACACAAATAA 
TACCATTGAAGAGAAACTGTTTGAAGCTCTAACCAAGACTCGACTAGTAGAAAGCAGA 



AGGTGATCTCACTTGACCTTCGCTCTCAGTTTCCAAGGGGCAGGGATTCCGAGGACTT 
TAATGTAAAAGAGGAGGCTAATGCTGCTGCTGAAGAGATCCGTTATACCCACATTCTC 
AATCGGGTACTCCCTCCAGACATCCGTATATTGGCCTGGGCCCCTGTAGAACCAAGCT 
TCAGTGCTAGGTTCAGCTGCCTTGAGCGGACTTACCGCTATTTTTTCCCTCGTGCTGA 
TTTAGATATTGTAACCATGGATTATGCAGCTCAGAAGTATGTTGGCACCCATGATTTC 
AGGAACTTGTGTAAAATGGATGTAGCCAACGGTGTGATTAATTTTCAGAGGACTATTC 
TATCTGCTCAAGTACAGCTAGTGGGCCAGAGCCCAGGTGAGGGGAGATGGCAAGAACC 
TTTCCAGTTATGTCAGTTTGAAGTGACTGGCCAGGCATTCCTTTATCATCAAGTCCGA 
TGTATGATGGCTATCCTCTTTCTGATTGGCCAAGGAATGGAGAAGCCAGAGATTATTG 
ATGAGCTGCTGAATATAGAGAAAAATCCCCAAAAGCCTCAATATAGTATGGCTGTAGA 
ATTTCCTCTAGTCTTATATGACTGTAAGTTTGAAAATGTCAAGTGGATCTATGACCAG 
GAGGCTCAGGAGTTCAATATTACCCACCTACAACAACTGTGGGCTAATCATGCTGTCA 



aataggaccaaagatggatggaatgacagaatggggaaatgttaagccctctgtcata 
aagcagaccagtgcctttgtagaaggagtgaagatgcgcacatataagcccctcatgg 
accgtcctaaatgccaaggactggaatcccggatccagcattttgtacgtaggggacg 
aattgagcacccacatttattccatgaggaagaaacaaaagccaaaagggactgtaat 
gacacactagaggaagagaatactaatttggagacaccaacgaagagggtctgtgttg 
acacagaaattaaaagcatcatttaa ccatagacaatttgccaggatctaggaaccac 
ctaatggtaggtggacagaaaaggaaaaaaaaaaaaatttacttgcaagtactaggaa 



TTCAGATGATCAGCTCTTAAAAAAAAAAAAAAAGCAAAAAGACTAAAGCCCTATTAAG 
GAAGTTATTGCTTTAATAAGAAATTTCAAATATTCTCTTATCCCGGTCCAAAAGGATT 
AAGCGATTAAAGAACGTAAAATGGi 



rGTAT? AAATTCA I 1 J 'GCAAG AA 



ORF Start: ATG at 31 jORF Stop: TAA at 1474 
SEQ ID NO: 226 (481 aa |m\V at 55646.8kD~ 



NOV80a, 

CG59572-01 Protein Sequence 



MAYNDTDRNQTEKLLKRVRELEQEVQRLKKEQAKNKEDSNIRENSAGAGKTKRAFDFS 
AHGRRHVALR I AYMGWG YQGFASQENTNNTI EEKLFEALTKTRLVE SRQTSNYHRCGR 
TDKGVSAFGQVISLDLRSQFPRGRDSEDFNVKEEANAAAEEIRYTHILNRVLPPDIRI 
LAWAPVEPSFSARFSCLERTYRYFFPRADLDIVTMDYAAQKYVGTHDFRNLCKMDVAH 
GVINFQRTILSAQVQLVGQSPGEGRWQEPFQLCQFEVTGQAFLYHQVRCMMAILFLIG 
QGMEKPEIIDELLNIEKNPQKPQYSMAVEFPLVLYDCKFENVKWIYDQEAQEFNITHL 
QQLWANHAVKTHMLYSMLQGLDTVPVPCGIGPKMDGMTEWGNVKPSVIKQTSAFVEGV 
KMRTYKPLMDRPKCQGLESRIQHFVRRGRIEHPHLFHEEETKAKRDCNDTLEEENTNL 
ETPTKRVCVDTEI KSI I 



SEQ ID NO: 227 



|1508 bp 



NOV80b, 

CG59572-02 DNA Sequence 


GAACTGGAGCAAGAGGTGCAAAGACTTAAAAAGGAACAGGCCAAAAATAAGGAGGACT 
CAAACATTAGAGAAAATTCAGCAGGAGCTGGAAAAACTAAGCGTGCATTTGATTTCAG 
TGCTCATGGCCGAAGACACGTAGCCCTAAGAATAGCCTATATGGGCTGGGGATACCAG 
GGCTTTGCTAGTCAGGAAAACACAAATAATACCATTGAAGAGAAACTGTTTGAAGCTC 
TAACCAAGACTCGACTAGTAGAAAGCAGACAGACATCCAACTATCACCGATGTGGGAG 
AACAGATAAAGGAGTTAGTGCCTTTGGACAGGTGATCTCACTTGACCTTCGCTCTCAG 
TTTCCAAGGGGCAGGGATTCCGAGGACTTTAATGTAAAAGAGGAGGCTAATGCTGCTG 
CTGAAGAGATCCGTTATACCCACATTCTCAATCGGGTACTCCCTCCAGACATCCGTAT 
ATTGGCCTGGGCCCCTGTAGAACCAAGCTTCAGTGCTAGGTTCAGCTGCCTTGAGCGG 
ACTTACCGCTATTTTTTCCCTCGTGCTGATTTAGATATTGTAACCATGGATTATGCAG 
CTCAGAAGTATGTTGGCACCCATGATTTCAGGAACTTGTGTAAAATGGATGTAGCCAA 
CGGTGTGATTAATTTTCAGAGGACTATTCTATCTGCTCAAGTACAGCTAGTGGGCCAG 
AGCCCAGGTGAGGGGAGATGGCAAGAACCTTTCCAGTTATGTCAGTTTGAAGTGACTG 
GCCAGGCATTCCTTTATCATCAAGTCCGATGTATGATGGCTATCCTCTTTCTGATTGG 
CCAAGGAATGGAGAAGCCAGAGATTATTGATGAGCTGCTGAATATAGAGAAAAATCCC 
CAAAAGCCTCAATATAGTATGGCTGTAGAATTTCCTCTAGTCTTATATGACTGTAAGT 
TTGAAAATGTCAAGTGGATCTATGACCAGGAGGCTCAGGAGTTCAATATTACCCACCT 
ACAACAACTGTGGGCTAATCATGCTGTCAAAACTCACATGTTGTATAGTATGCTACAA 
GGACTGGACACTGTTCCAGTACCCTGTGGAATAGGACCAAAGATGGATGGAATGACAG 
AATGGGGAAATGTTAAGCCCTCTGTCATAAAGCAGACCAGTGCCTTTGTAGAAGGAGT 
GAAGATGCGCACATATAAGCCCCTCATGGACCGTCCTAAATGCCAAGGACTGGAATCC 
CGGATCCAGCATTTTGTACGTAGGGGACGAATTGAGCACCCACATTTATTCCATGAGG 
AAGAAACAAAAGCCAAAAGGGACTGTAATGACACACTAGAGGAAGAGAATACTAATTT 
GGAGACACCAACGAAGAGGGTCTGTGTTGACACAGAAATTAAAAGTATCATTTAACCA 
TAGACAATTTGCCAGGATCTAGGAACCACCTAATGGTAGGTGGACAGAaAAGGAAAAA 




ORF Start: ATG at 2 


ORF Stop: TAA at 1445 




SEQ ID NO: 228 


481 aa jMW at 55646.8kD 


NOV80b, 

CG59572-02 Protein Sequence 


MAYNDTDRNQTEKLLKRVRELEQEVQRLKKEQAKNKEDSNIRENSAGAGKTKRAFDFS 
AHGRRHVALRIAYMGWGYQGFASQEKTNNTIEEKLFEALTKTRLVESRQTSNYHRCGR 
TDKGVSAFGQVI SL.DLRSQFPRGRDSEDFNVKEEANAAAEE I RYTH ILNRVLPPDI RI 
LAWAPVEPSFSARFSCLERTYRYFFPRADLDIVTMDYAAQKYVGTHDFRNLCKMDVAN 
GVINFQRTILSAQVQLVGQSPGEGRWQEPFQLCQFEVTGQAFLYHQVRCMMAILFLIG 
QGMEKPEIIDELLNIEKNPQKPQYSMAVEFPLVLYDCKFENVKWIYDQEAQEFNITHL 
QQLWANHAVKTHMLYSMLQGLDTVPVPCGIGPKMDGMTEWGNVKPSVIKQTSAFVEGV 
KMRTYKPLMDRPKCQGLESRIQHFVRRGRIEHPHLFHEEETKAKRDCNDTLEEENTNL 
T DTEIKSII 



Sequence comparison of the above protein sequences yields the following sequence 
relationships shown in Table 80B. 



Table 80B. Comparison of NOV80a against NO V80b. 


Protein Sequence 


NOV80a Residues/ 
Match Residues 


Identities/ 
Similarities for the Matched Region 


NOV80b 


1..481 
1..481 


459/481 (95%) 
459/481 (95%) 



Further analysis of the NOV80a protein yielded the following properties shown in 
Table 80C. 



Table 80C. Protein Sequence Properties NOV80a 


PSort 
analysis: 


0.6500 probability located in cytoplasm; 0.1000 probability located in 
mitochondrial matrix space; 0.1000 probability located in lysosome (lumen); 
0.0142 probability located in microbody (peroxisome) 


SignalP 
analysis: 


No Known Signal Sequence Predicted 
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A search of the NOV80a protein against the Geneseq database, a proprietary 
database that contains sequences published in patents and patent publication, yielded 
several homologous proteins shown in Table 80D. 





Table 80D. Geneseq Results for NOV80a 




Geneseq 
Identifier 


Protein/Organism/Length 
[Patent #, Date] 


NOV80a 
Residues/ 

Match 
Residues 


Identities/ 
Similarities for 
the Matched 
Region 


, 

Expect 
Value 


AAM79457 


Human protein SEQ ID NO 3103 - 
Homo sapiens, 490 aa. 
[WO200157190-A2, 09-AUG- 
2001] 


1..481 
10..490 


478/481 (99%) 
480/481 (99%) 


0.0 


AAM78473 


Human protein SEQ ID NO 1 135 - 
Homo sapiens, 481 aa. 
[WO200157190-A2, 09-AUG- 
2001] 


1..481 
1..481 


- 

478/481 (99%) 
480/481 (99%) 


0.0 


AAG64907 


Human depressed growth rate 
protein DEG1 - Homo sapiens, 
248 aa. [CN 12960 14-A, 23-MAY- 
2001] 


209..431 
1..223 


223/223 (100%) 
223/223 (100%) 


e-132 


AAG02637 


Human secreted protein, SEQ ID 
NO: 6718 - Homo sapiens, 96 aa. 
[EP1033401-A2, 06-SEP-2000] 


361. .456 
1..96 


96/96 (100%) 
96/96 (100%) 


5e-53 


AAB96592 


Putative P. abyssi pseudourydilate 
synthase I - Pyrococcus abyssi, 
263 aa. [FR2792651-A1, 27-OCT- 
2000] 


65..367 
3..261 


79/305 (25%) 
140/305 (45%) 


4e-16 



In a BLAST search of public sequence databases, the NOV80a protein was found to 
have homology to the proteins shown in the BLASTP data in Table 80E. 
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Table 80E. Public BLASTP Results for NOV80a 


Protein 
Accession 
Number 


Protein/Organism/Length 


NOV80a 
Residues/ 

Match 
Residues 


Identities/ 
Similarities for 
the Matched 
Portion 


Expect 
Value 


Q9BZE2 


FKSG32 - Homo sapiens (Human), 
481 aa. 


1..481 
1..481 


481/481 (100%) 
481/481 (100%) 


0.0 


Q96J23 



Q96NB4 


HYPOTHETICAL 55.6 KDA 
PROTEIN - Homo sapiens (Human), 
481 aa. 


1..481 
1..481 


478/481 (99%) 
480/481 (99%) 


0.0 


CDNA FLJ31 140 FIS, CLONE 
IMR322001218, HIGHLY 
SIMILAR TO MUS MUSCULUS 
PSEUDOURIDINE SYNTHASE 3 
(PUS3) MRNA - Homo sapiens 
(Human), 481 aa. 


1..481 
1..481 


478/481 (99%) 
479/481 (99%) 


0.0 


Q9JI38 


PSEUDOURIDINE SYNT1 IASE 3 - 
Mus musculus (Mouse), 481 aa. 


5..480 
4..480 


407 479 (84%) 
434/479 (89%) 


0.0 


Q9D0F7 


2610020J05RIK PROTEIN - Mus 
musculus (Mouse), 316 aa. 


5..314 
4..315 


276/312 (88%) 
291 512 (92%) 


e-158 



PFam analysis predicts that the NOV80a protein contains the domains shown in the 
Table 80F. 



Table 80F. Domain Analysis of NOVSOa 


Pfam Domain 


NOV80a Match 
Region 


Identities/ j 
Similarities Expect 
for the Matched j Value 
Region 


PseudoU synth 1 : domain 1 
of 1 


88. .307 


70/249 (28%) ] 4.7e-57 
176/249 (71%) | 



Example 81. 

The NOV81 clone was analyzed, and the nucleotide and predicted polypeptide 
sequences are shown in Table 81 A. 



Table S 1 A. NOV81 Sequence Analysis 




SEQ ID NO: 229 |3080 bp 


NOVSla, 

CG59522-01 DNA Sequence 


TTCCAGCCGGCAGGATGGAGGACGAGGAAGGCCCTGAGTATGGCAAACCTGACTTTGT 
GCTTTTGGACCAAGTGACCATGGAGGACTTCATGAGGAACCTGCAGCTCAGGTTCGAG 
AAGGGCCGCATCTACACCTACATCGGTGAGGTGCTGGTGTCCGTGAACCCCTACCAGG 
AGCTGCCCCTGTATGGGCCTGAGGCCATCGCCAGGTACCAGGGCCGTGAGCTCTATGA 
j [ , J It 1 , ^ T ; ( 
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TCCAGGGACACCTGCATCGTCATCTCAGrr - , . , , 

GTAAGCACATCATGCAGTACATCGCTGCTGTCACCAATCCAAGCCAGAGGGCTGAGGT 

GGAGAGGGTCAAGGACGTGCTGCTCAAGTCCACCTGTGTGCTGGAGGCCTTTGGCAAT 

GCCCGCACCAACCGCAATCACAACTCCAGCCGCTTTGGCAAGTACATGGACATCAACT 

TTGACTTCAAGGGGGACCCGATCGGAGGACACATCCACAGCTACCTACTGGAGAAGTC 

TCGGGTCCTCAAGCAGCACGTGGGTGAAAGAAACTTCCACGCCTTCTACCAATTGCTG 

AGAGGCAGTGAGGACAAGCAGCTGCATGAACTGCACTTGGAGAGAAACCCTGCTGTAT 

ACAATTTCACACACCAGGGAGCAGGACTCAACATGACTGTGAGTGATGAGCAGAGCCA 

CCAGGCAGTGACCGAGGCCATGAGGGTCATCGGCTTCAGTCCTGAAGAGGTGGAGTCT 

GTGCATCGCATCCTGGCTGCCATATTGCACCTGGGAAACATCGAGTTTGTGGAGACGG 

AGGAGGGTGGGCTGCAGAAGGAGGGCCTGGCAGTGGCCGAGGAGGCACTGGTGGACCA 

TGTGGCTGAGCTGACGGCCACACCCCGGGACCTCGTGCTCCGCTCCCTGCTGGCTCGC 

ACAGTTGCCTCGGGAGGCAGGGAACTCATAGAGAAGGGCCACACTGCAGCTGAGGCCA 

GCTATGCCCGGGATGCCTGTGCCAAGGCAGTGTACCAGCGGCTGTTTGAGTGGGTGGT 

GAACAGGATCAACAGTGTCATGGAACCCCGGGGCCGGGATCCTCGGCGTGATGGCAAG 

GACACAGTCATTGGCGTGCTGGACATCTATGGCTTCGAGGTGTTTCCCGTCAACAGTT 

TCGAGCAGTTCTGCATCAACTACTGCAACGAGAAGCTGCAGCAGCTATTCATCCAGCT 

CATCCTGAAGCAGGAACAGGAAGAGTACGAGCGCGAGGGCATCACCTGGCAGAGCGTT 

GAGTATTTCAACAACGCCACCATTGTGGATCTGGTGGAGCGGCCCCACCGTGGCATCC 

TGGCCGTGCTGGACGAGGCCTGCAGCTCTGCTGGCACCATCACTGACCGAATCTTCCT 

GCAGACCCTGGACATGCACCACCGCCATCACCTACACTACACCAGCCGCCAGCTCTGC 

CCCACAGACAAGACCATGGAGTTTGGCCGAGACTTCCGGATCAAGCACTATGCAGGGG 

ACGTCACGTACTCCGTGGAAGGCTTCATCGACAAGAACAGAGATTTCCTCTTCCAGGA 

CTTCAAGCGGCTGCTGTACAACAGCACGGACCCCACTCTACGGGCCATGTGGCCGGAC 

GGGCAGCAGGACATCACAGAGGTGACCAAGCGCCCCCTGACGGCTGGCACACTCTTCA 

AGAACTCCATGGTGGCCCTGGTGGAGAACCTTGCCTCCAAGGAGCCCTTCTACGTCCG 

CTGCATCAAGCCCAATGAGGACAAGGTAGCTGGGAAGCTGGATGAGAACCACTGTCGC 

CACCAGGTCGCATACCTGGGGCTGCTGGAGAATGTGAGGGTCCGCAGGGCTGGCTTCG 

CTTCCCGCCAGCCCTACTCTCGATTCCTGCTCAGGTACAAGATGACCTGTGAATACAC 

ATGGCCCAACCACCTGCTGGGCTCCGACAAGGCAGCCGTGAGCGCTCTCCTGGAGCAG 

CACGGGCTGCAGGGGGACGTGGCCTTTGGCCACAGCAAGCTGTTCATCCGCTCACCCC 

GGACACTGGTCACACTGGAGCAGAGCCGAGCCCGCCTCATCCCCATCATTGTGCTGCT 

ATTGCAGAAGGCATGGCGGGGCACCTTGGCGAGGTGGCGCTGCCGGAGGCTGAGGGCT 

ATCTACACCATCATGCGCTGGTTCCGGAGACACAAGGTGCGGGCTCACCTGGCTGAGC 

TGCAGCGGCGATTCCAGGCTGCAAGGCAGCCGCCACTCTACGGGCGTGACCTTGTGTG 

GCCGCTGCCCCCTGCTGTGCTGCAGCCCTTCCAGGACACCTGCCACGCACTCTTCTGC 

AGGTGGCGGGCCCGGCAGCTGGTGAAGAACATCCCCCCTTCAGACATGCCCCAGATCA 

AGGCCAAGGTGGCCGCCATGGGGGCCCTGCAAGGGCTTCGTCAGGACTGGGGCTGCCG 

ACGGGCCTGGGCCCGAGACTACCTGTCCTCTGCCACTGACAATCCCACAGCATCAAGC 

CTGTTTGCTCAGCGACTAAAGACACTTCAGGACAAAGATGGCTTCGGGGCTGTGCTCT 

TTTCAAGCCATGTCCGCAAGGTGAACCGCTTCCACAAGATCCGGAACCGGGCCCTCCT 

GCTCACAGACCAGCACCTCTACAAGCTGGACCCTGACCGGCAGTACCGGGTGATGCGG 

GCCGTGCCCCTTGAGGCGGTGACGGGGCTGAGCGTGACCAGCGGAGGAGACCAGCTGG 

TGGTGCTGCACGCCCGCGGCCAGGACGACCTCGTGGTGTGCCTGCACCGCTCCCGGCC 

GCCATTGGACAACCGCGTTGGGGAGCTGGTGGGCGTGCTGGCCGCACACTGCCGCAGG 

GAGGGCCGCACCCTGGAGGTTCGCGTCTCCGACTGCATCCCACTAAGCCATCGCGGGG 

TCCGGCGCCTCATCTCCGTGGAGCCCAGGCCGGAGCAGCCAGAGCCCGATTTCCGCTG 

CGCTCGCGGCTCCTTCACCCTGCTCTGGCCCAGCCGCTGAGCGCCCGCACCCGCCGCA 

2CCCGA 




ORF Start: ATG at | ORF Stop: TGA at 3054 
15 1 




SEQ ID NO: 230 1013 aa MVV at 1 16044.5kD 


NOV81a, 

CG59522-01 Protein 
Sequence 


MSDEEGPEYGKPDFVLLDQVTMEDFMRNLQLRFEKGRIYTYIGEVLVSVNPYQELPLY 
GPEAIARYQGRELYERPPHLYAVANAAYKAMKHRSRDTCIVISGESGAGKTEASKHIM 
QYIAAVTNPSQRAEVERVKDVLLKSTCVLEAFGNARTNRNHNSSRFGKYMDINFDFKG 
DPIGGHIHSYLLEKSRVLKQHVGERNFHAFYQLLRGSEDKQLHELHLERNPAVYNFTH 
QGAGLNMTVSDEQSHQAVTEAMRVIGFSPEEVESVHRILAAILHLGNIEFVETEEGGL 

ACAKAVYQRLFEWWNRINSVMEPRGRDPRRDGKDTVIGVLDIYGFEVFPVNSFEQFC 
INYCNEKLQQLFIQLILKQEQEEYEREGITWQSVEYFNNATIVDLVERPHRGILAVLD 
EACSSAGTITDRIFLQTLDMHHRHHLHYTSRQLCPTDKTMEFGRDFRIKHYAGDVTYS 
VEGFIDKNRDFLFQDFKRLLYNSTDPTLRAMWPDGQQDITEVTKRPLTAGTLFKNSMV 
ALVENLASKEPFYVRCIKPNEDKVAGKLDENHCRHQVAYLGLLENVRVRRAGFASRQP 
YSRFLLRYKMTCEYTWPNHLLGSDKAAVSALLEQHGLQGDVAFGHSKLFIRSPRTLVT 
LEQSRARLI PI IVLLLQKAWRGTLARWRCRRLRAI YTIMRWFRRHKVRAHLAELQRRF 
QAARQPPLYGRDLVWPLPPAVLQPFQDTCHALFCRWRARQLVKNIPPSDMPQIKAKVA 
AMGALQGLRQDWGCRRAWARDYLSSATDNPTASSLFAQRLKTLQDKDGFGAVLFSSHV 
RKVNRFHK I RNRALLLTDQHLYKLDPDRQYRVMRAVP LEAVTGLSVTSGGDQLWLHA 
RGQDDLWCLHRSRPPLDNRVGELVGVLAAHCRREGRTLEVRVSDCIPLSHRGVRRLI 
i E ,_~£ r LFi .= L £ rLL> 
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Further analysis of the NOV81a protein yielded the following properties shown in 
Table 81B. 



Table SIB. Protein Sequence Properties NOVSla 


PSort 
analysis: 


0.8800 probability located in nucleus; 0.3902 probability located in 
microbody (peroxisome); 0.2210 probability located in lysosome (lumen); 
0.1000 probability located in mitochondrial matrix space 


SignalP 
analysis: 


No Known Signal Sequence Predicted 



A search of the NOV81a protein against the Geneseq database, a proprietary 
database that contains sequences published in patents and patent publication, yielded 
several homologous proteins shown in Table 81C. 



Table SIC. Geneseq Results for NOVSla 


Geneseq 
Identifier 




Protein/Organism/Length 
[Patent #, Date] 


NOV81a 
Residues/ 

Match 
Residues 


Identities/ 
Similarities for 
the Matched 
Region 


Expect 
Value 


AAU23125 


Novel human enzyme 
polypeptide #21 1 - Homo 
sapiens, 1 026 aa. 
[WO200155301-A2, 02-AUG- 
2001] 


1..1013 
9.. 1026 


1009/1018(99%) 
1011/1018(99%) 


0.0 


AAU23128 


Novel human enzyme 
polypeptide #214 - Homo 
sapiens, 909 aa. [WO200155301- 
A2, 02-AUG-2001] 


1..853 
9..866 


851/858(99%) 
851/858 (99%) 


0.0 


AAM80123 


Human protein SEQ ID NO 3769 
- Homo sapiens, 764 aa. 
[WO200157190-A2, 09-AUG- 
2001] 


243..1011 
1..762 


438/769 (56%) 
570/769 (73%) 


0.0 


AAM79139 


Human protein SEQ ID NO 1801 
- Homo sapiens, 753 aa. 
[WO200157190-A2, 09-AUG- 
2001] 


254..1011 
1..751 


434/758 (57%) 
564/758 (74%) 


0.0 


AAM39991 


Human polypeptide SEQ ID NO 
3136 - Homo sapiens, 1063 aa. 
[WO200153312-A1, 26-JUL- 
2001] 


10..933 
47..986 


410/966 (42%) 
556/966 (57%) 


0.0 
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In a BLAST search of public sequence databases, the NOV81a protein was found to 
have homology to the proteins shown in the BLASTP data in Table 8 ID. 



Table 81D. Public BLASTP Results for NOV81a 


Protein 
Accession 
Number 


Protein/Organism/Length 


NOV81a 
Residues/ 

Match 
Residues 


Identities/ 
Similarities for 
the Matched 
Portion 


Expect 
Value 


Q63357 


MYOSIN I - Rattus norvegicus 
(Rat), 1006 aa. 


1..1011 
1..1004 


606/1011 (59%) 
780/1011 (76%) 


0.0 


A53933 


myosin I myr 4 - rat, 1006 aa. 


1..1011 
1..1004 


604/1011 (59%) 
778/1011 (76%) 


0.0 


Q96RI6 


UNCONVENTIONAL MYOSIN 
1G VALINE FORM - Homo 
sapiens (Human), 633 aa 
(fragment). 


33..646 
1..619 


612/619 (98%) 
612/619 (98%) 


0.0 


Q96RI5 


UNCONVENTIONAL MYOSIN 
1G METHONINE FORM - Homo 
sapiens (Human), 633 aa 
(fragment). 


33..646 
1..619 


611/619(98%) 
612/619(98%) 


0.0 


Q23978 


Myosin IA (MIA) (Brush border 
myosin IA) (BBMIA) - Drosophila 
melanogaster (Fruit fly), 1011 aa. 


8..1012 
6..1007 


503/1017(49%) 
686/1017(66%) 


0.0 



PFam analysis predicts that the NOV81a protein contains the domains shown in the 
Table 8 IE. 



Table 81E. Domain Analysis of NOV81a 



Pfam Domain 


P 

NOV81a Match 
Region 


Identities/ 
Similarities 
for the Matched 
Region 


Expect 
Value 


PRK: domain 1 of 1 


97.. 109 


8/13 (62%) 
10/13 (77%) 


3.7 


Vir DNA binding: domain 1 
of 1 


575..592 


5/18(28%) 
14/18(78%) 


8.2 


myosin head: domain 1 of 1 


11. .689 


305/747(41%) 
531/747 (71%) 


8.1e-288 
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Example 82. 

The NOV82 clone was analyzed, and the nucleotide and predicted polypeptide 
sequences are shown in Table 82A. 



Table 82A. NOV82 Sequence Analysis 




SEQIDNO:231 1 1066 bp 


NOV82a, 

CG59520-01 DNA Sequence 


GAACGAATGGGAAACCAGAAATCAGATATTTATGO :aj 3 J ? ^GGATTTCGTTC 
AGCACTACTCCCAGATCGTTAGGGTGCTGACTGAGGATGAGATGGGGCACCCAGAGAC 
AGGAGATGCTACTGCCCGGCTCAAGGAGGTCCTGGAGTACAATGCCATTGGAGGCAAG 
TATCACCGAGGTTTGATGGTGCTAGTAGCGTTCCGGGAGCTGGTGGAGCCGAGGAAAC 
TGGATGCTGATAGTCTCCAGTGGGCACCGACTGTGGGCTGGTATGCGCAACTGCTGCA 
AGCTTTCTTCCTGGTGGCAGATGACATTATGGATTCATCCCTTACCTGCCAGGGACAG 
ATCTCCTGGTATCAGAAGCTGGGCATGGGTTTGGATGCCATCAATGATGCTATCCTTC 
TGGAAGCATGTATCTACTGCCTGCTGAAGCTGTATTGCCGGGAGCAGCCCTATTACCT 
GAACCTGATGGAGCTCTTCCAGCAGAATTCTTATCAGACTGAGATTGGGCAGACCCTC 
GACCTCATCACAACCCCCCAGGGCAATGTGGATCTTCGCAGATGCACCGAAAAAAGGC 
ACAAATCTGTTGTCAAGTACAAGACAGCTTTCTACTCCTTCTACCTTCCTGTAGCTGC 
AGCCATGTACATGTCAAGAATGGATGACAAGAAGGAGCACACCAGTGCCAAGAAGATC 
CTGCTGGAGATTCAAGAGTTCTTTCAGATTCAGGATGATTACCTTGACTTCTCTGGGG 
ACCCCAGTGTGACTGGCAGAGTTGGCAATGACTTCCAGGACAACAAATGCAGCTGGCT 
GGTGGTTCAGTGTCTGCTACAGGCCACTCCAGAACAGTACCAGATCCTGAAGGAAAAT 
TACAGGCAGAAGGAGGCCGAGAAGGTGGCCCGGGTGAAGGCACTATACGAGGAGCTGG 
ATCTGCCAGCCGTGTTCTTGCAGTATGAGAAAGACAGTTACAGCCACGTTATGGGTCT 
CATCGAACAGTACGCAGAGCCCCTGCCCCCAGCCATCTTTCTGGGGCTTGTGCACAAA 
M A' 4A3TGGAAAAA3TGAC 




ORF Start: ATG at 7 


ORF Stop: TGAat 1063 




SEQ ID NO: 232 


352 aa MW at 10740.3kD 


NOV82a, 

CG59520-01 Protein Sequence 


MGNQKSDIYAQAKQDFVQHYSQIVRVLTEDEMGHPETGDATARLKEVLEYNAIGGKYH 
RGLMVLVAFRELVEPRKLDADSLQWAPTVGWYAQLLQAFFLVADDIMDSSLTCQGQIS 
WYQKLGMGLDAINDAILLEACIYCLLKLYCREQPYYLNLMELFQQNSYQTEIGQTLDL 
ITTPQGHVDLRRCTEKRHKSWKYKTAFYSFYLPVAAAMYMSRMDDKKEHTSAKKILL 
EIQEFFQIQDDYLDFSGDPSVTGRVGNDFQDNKCSWLWQCLLQATPEQYQILKENYR 
QKEAEKVARVKALYEELDLPAVFLQ YEKDS YSHVMGL I EQYAEPLP PAI FLGLVHKI Y 
KWKK 



Further analysis of the NOV82a protein yielded the following properties shown in 
Table 82B. 



Table 82B. Protein Sequence Properties NOV82a 


PSort 

analysis: 


0.4066 probability located in microbody (peroxisome); 0.3000 probability 
located in nucleus; 0.1000 probability located in mitochondrial matrix space; 
0.1000 probability located in lysosome (lumen) 


SignalP 
analysis: 


No Known Signal Sequence Predicted 

, „„ 



A search of the NOV82a protein against the Geneseq database, a proprietary 
database that contains sequences published in patents and patent publication, yielded 
several homologous proteins shown in Table 82C. 
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Table 82C. Cicncscq Results for NOV82a 


Geneseq 
Identifier 


Protein/Organism/Length [Patent 
#, Date] 


NOV82a 
Residues/ 

Match 
Residues 


Identities/ 
Similarities for 
the Matched 
Region 


Expect 
Value 


AAG29733 


Arabidopsis thaliana protein 
fragment SEQ ID NO: 35427 - 
Arabidopsis thaliana, 342 aa. 
[EP1033405-A2, 06-SEP-2000] 


10..352 
2..342 


147/343 (42%) 
219/343 (62%) 


7e-75 


AAG29732 


Arabidopsis thaliana protein 
fragment SEQ ID NO: 35426 - 
Arabidopsis thaliana, 349 aa. 
[EP1033405-A2, 06-SEP-2000] 


10..352 
9..349 


147/343 (42%) 
219/343 (62%) 


7e-75 


AAG29734 


Arabidopsis thaliana protein 
fragment SEQ ID NO: 35428 - 
Arabidopsis thaliana, 305 aa. 
[EP1033405-A2, 06-SEP-2000] 


47..352 
1..305 


138/306 (45%) 
204/306 (66%) 


4e-73 


AAY43635 


Amino acid sequence of the farnesyl 
pyrophosphate synthase enzyme - 
Phaffia rhodozyma, 355 aa. 
[EP955363-A2, 10-NOV-1999] 


12..352 
11. .355 


145/346(41%) 
208/346 (59%) 


4e-69 


AAB48971 


Sunflower seedling farnesyl 
pyrophosphate synthase (FPS) - 
Helianthus annuus, 341 aa. 
IEP1063297-AL 27-DEC-2000] 


13..352 
6..341 


138/343 (40%) 
204/343 (59%) 


3e-64 



In a BLAST search of public sequence databases, the NOV82a protein was found to 
have homology to the proteins shown in the BLASTP data in Table 82D. 
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Table S2I). Public BLASTP Results for NOV82a 


Protein 
Accession 
Number 


Protein/Organism/Length 


NOV82a 
Residues/ 

Match 
Residues 


Identities/ 
Similarities for 
the Matched 
Portion 


Expect 
Value 


Q96G29 


FARNESYL DIPHOSPHATE 
SYNTHASE (FARNESYL 
PYROPHOSPHATE 
SYNTHETASE, 

DIMETHYLALLYLTRANSTRA 
NSFERASE, 

GERANYLTRANSTRANSFERA 
SE) - Homo sapiens (Human), 419 
aa. 


2..352 
69..419 


291/351 (82%) 
317/351 (89%) 


e-168 


PI 4324 


Farnesyl pyrophosphate synthetase 
(FPP synthetase) (FPS) (Farnesyl 
diphosphate synthetase) [Includes: 
Dimethylallyltransferase (EC 
2.5.1.1); Geranyltranstransferase 
(EC 2.5.1.10)] - Homo sapiens 
(Human), 353 aa. 


2..352 
3..353 


291/351 (82%) 
317/351 (89%) 


e-168 


A35726 


farnesyl-pyrophosphate synthetase 
- human, 353 aa. 


2..352 
3..353 


290/351 (82%) 
316/351 (89%) 


e-168 


AAL58886 


FARNESYL DIPHOSPHATE 
SYNTHASE - Bos taurus (Bovine), 
353 aa. 


2..352 
3..353 


270/351 (76%) 
308/351 (86%) 


e-157 


Q14329 


FARNESYL PYROPHOSPHATE 
SYNTHETASE LIKE-4 PROTEIN 
- Homo sapiens (Human), 348 aa. 


6..352 
2..348 


268/347 (77%) 
295/347 (84%) 


e-150 



PFam analysis predicts that the NOV 82a protein contains the domains shown in the 
Table 82E. 



Table 82E. Domain Analysis of NOV82a 


Pfam Domain 


NOV82a Match 
Region 


Identities/ 
Similarities 
for the Matched 
Region 


Expect 
Value 


polyprenyl synt: domain 1 
of 1 


43..3 15 


82/285 (29%) 
237/285 (83%) 


6.3e-91 
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Example 83. 

The NOV83 clone was analyzed, and the nucleotide and predicted polypeptide 
sequences are shown in Table 83A. 



Table 83A. NOV83 Sequence Analysis 




SEQIDNO:233 (411 bp 


NOV83a, 

CG5 975 8-01 DNA Sequence 


TGCCTACCCCGAGACTGCTGCTGTTCGG JACCTG .. 3TGAATGCCCCATCACCATG 


TCTGACCTGGAGGCAAAACCTTCAACTGAGCATTTGGGGGATAAGATAAAAGATGAAG 
ATATTAAACTCAGGGTTATTGGACAGGATAGCAGTGAGATTCATTTCAAAGTGAAAAT 
GACAACACCTCTCAAGAAACTCAAGAAATCGTACTGTCAGAGACAGGGCGTTCCAGTG 
AATTCCCTCAGGTTTCTCTTTGAAGGTCAGAGAATTGCTGATAATCATACTCCAGAAG 
AACTGGGAATGGAGGAAGAAGATGTGATTGAGGTTTATCAGGAACAAATCGGAGGTCA 






TTAAA 




ORF Start: ATG at 56jORF Stop: TAG at 359 




SEQ ID NO: 234 1 101 aa |MW at 1 1526.0RD 


NOV83a, 

CG59758-01 Protein Sequence 


MSDLEAKPSTEHLGDKIKDEDIKLRVIGQDSSEIHFKVKMTTPLKKLKKSYCQRQGVP 
VNSLRFLFEGQRIADNHTPEELGMEEEDVIEVYQEQIGGHSTV 




SEQ ID NO: 235 658 bp 


NOV83b, 

CG59758-02 DNA Sequence 


CTACCCCGAGACTGCTGCTGTTCGGAGACCTGCAGGTGAATGCCCCATCACCATGTCT 


GACCTGGAGGCAAAACCTTCAACTGAGCATTTGGGGGATAAGATAAAAGATGAAGATA 

AACACCTCTCAAGAAACTCAAGAAATCGTACTGTCAGAGACAGGGCGTTCCAGTGAAT 
TCCCTCAGGTTTCTCTTTGAAGGTCAGAGAATTGCTGATAATCATACTCCAGAAGAAC 
TGGGAATGGAGGAAGAAGATGTGATTGAGGTTTATCAGGAACAAATCGGAGGTCATTC 
AACAGTTTAGACAATCGGAGGTCATTCAACAGTTTAGACAATCGGAGGTCATTCAACA 


GTTTAGACAATCGGAGGTCATTCAACAGTTTAGACAATCGGAGGTCATTCAACAGTTT 


AGACAATCGGAGGTCATTCAACAGTTTAGACAATCGGAGGTCATTCAACAGTTTAGAC 


AATCGGAGGTCATTCAACAGTTTAGACAATCGGAGGTCATTCAACAGTTTAGACAATC 


GGAGGTCATTCAACAGTTTAC-ACAATCGGAGGTCATTCJACAGTTTAGACJATCQGAQ 


T 1 * 1 




ORF Start: ATG at 53 


ORF Stop: TAG at 356 




SEQ ID NO: 236 


101 aa |MWatll526.0kD 


NOV83b, 

CG59758-02 Protein Sequence 


MSDLEAKPSTEHLGDKIKDEDIKLRVIGQDSSEIHFKVKMTTPLKKLKKSYCQRQGVP 
VNSLRFLFEGQRIADNHTPEELGMEEEDVIEVYQEQIGGHSTV 



Sequence comparison of the above protein sequences yields the following sequence 
relationships shown in Table 83B. 



Table 83B. Comparison of NOV83a against NOV83b. 


Protein Sequence 


NOV83a Residues/ 
Match Residues 


Identities/ 
Similarities for the Matched Region 


NOV83b 


1.. 101 
1 ..101 


101/101 (100%) 
101/101 (100%) 



Further analysis of the NOV83a protein yielded the following properties shown in 
Table 83C. 
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1 Table 83C. Protein Sequence Properties NOV83a 


PSort 
j analysis: 


0.6500 probability located in cytoplasm; 0.1000 probability located in 
mitochondrial matrix space; 0.1000 probability located in lysosome (lumen); 
0.0000 probability located in endoplasmic reticulum (membrane) 


| SignalP 
| analysis: 


No Known Signal Sequence Predicted 



A search of the NOV83a protein against the Geneseq database, a proprietary 
database that contains sequences published in patents and patent publication, yielded 
several homologous proteins shown in Table 83D. 





Table 83D. Geneseq Results for NOV83a 




[ 

Geneseq 
Identifier 


Protein/Organism/Length 
[Patent #, Date] 


IN KJ V OJa 

Residues/ 

Match 
Residues 


Identities/ 
Similarities for 
the Matched 
Region 


Expect 
Value 


AAM79976 


Human protein SEQ ID NO 3622 - 
Homo sapiens, 125 aa. 
[WO200157190-A2, 09-AUG- 
2001] 


1-101 

25..125 


100/101 (99%) 
100/101 (99%) 


le-52 


AAM78992 


Human protein SEQ ID NO 1654 - 
Homo sapiens, 101 aa. 
[WO200157190-A2, 09-AUG- 
2001] 


1-101 
1 -101 


100/101 (99%) 
100/101 (99%) 


le-52 


AAY49967 


Human sentrin protein sequence - 
Homo sapiens, 101 aa. 
[US5985664-A, 16-NOV-1999] 


1-101 
1..101 


89/101 (88%) 
94/101 (92%) 


2e-45 


AAW87984 


Ubiquitin-like domain of the 
protein SUMOl - Mammalia, 101 
aa. [W09857978-A1, 23-DEC- 
1998] 


1..101 
1..101 


89/101 (88%) 
94/101 (92%) 


2e-45 


AAW60079 


Homo sapiens sentrin- 1 
polypeptide - Homo sapiens, 101 
aa. [WO9820038-A1, 14-MAY- 
1998] 


1..101 
1..101 


89/101 (88%) 
94/101 (92%) 


2e-45 



In a BLAST search of public sequence databases, the NOV83a protein was found to 
have homology to the proteins shown in the BLASTP data in Table 83E. 
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Table 83E. Public BLASTP Results for NOV83a 


Protein 
Accession 
Number 


Protein/Organism/Length 


NOV83a 
Residues/ 

Match 
Residues 


Identities/ 
Similarities 
for the 
Matched 
Portion 


Expect 
Value 

6e-45 


Q93068 


Ubiquitin-like protein SMT3C 
precursor (Ubiquitin-homology domain 
protein PIC1) (Ubiquitin-like protein 
UBL1) (Ubiquitin-related protein 
SUMO-1) (GAP modifying protein 1) 
(GMP1) (Sentrin) - Homo sapiens 
(Human), and, 101 aa. 


1..101 
1..101 


89/101 (88%) 
94/101 (92%) 


Q9MZD5 


SENTRJN - Cervus nippon (Sika deer), 
101 aa. 


1..101 
1.-101 


88/101 (87%) 
93/101 (91%) 


2e-44 


057686 


SUMO-1 PROTEIN - Xenopus laevis 
(African clawed frog), 102 aa. 


1..100 
1..101 


83/101 (82%) 
90/101 (88%) 


2e-39 


Q9PT08 


S\l 1 1 I SIOl \Tlh IEI MED 
PROTEIN 1 - Oncorhynchus mykiss 
(Rainbow trout) (Salmo gairdneri), 1 0 1 
aa. 


1..97 
1..97 


72/97(74%) 
84/97 (86%) 


9e-35 


; Q9D466 1 493341 1G06RIK PROTEIN - Mus j 1..97 
| 1 musculus (Mouse), 1 17 aa. 1 ..96 


68/97 (70%) 
80/97 (82%) 


8e-30 



PFam analysis predicts that the NOV83a protein contains the domains shown in the 



Table 83F. Domain Analysis of NOV83a 


Pfam Domain 


Identities/ 

NOV83a Match Similarities 
Region for the Matched 
Region 


Expect 
Value 


ubiquitin: domain 1 of 
1 


20..95 14/83 (17%) 
[ 66/83 (80%) 


4.7e-18 



Example 84. 



The NOV84 clone was analyzed, and the nucleotide and predicted polypeptide 
sequences are shown in Table 84A. 
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Table 84A. NOV84 Sequence Analysis 




SEQ ID NO: 237 


912 bp 


NOV84a, 

CG59586-01 DNA Sequence 


GGGCGCCCTGCTGGCATCTGAGCTGGGATGGAAATTCTATGATGCTGATGATTATCAC 
CCGGAGGAAAATCGAAGGAAGATGGGAAAAGGCATACCGCTCAATGACCAGGACCGGA 
TTCCATGGCTCTGTAACTTGCATGACATTTTACTAAGAGATGTAGCCTCGGGACAGCG 
TGTGGTTCTAGCCTGTTCAGCCCTGAAGAAAACGTACAGAGACATATTAACACAAGGA 
AAAGATGGTGTAGCTCTGAAGTGTGAGGAGTCGGGAAAGGAAGCAAAGCAGGCTGAGA 
TGCAGCTCCTGGTGGTCCATCTGAGCGGGTCGTTTGAGGTCATCTCTGGACGCTTACT 
CAAAAGAGAGGGACATTTTATGCCCCCTGAATTATTGCAGTCCCAGTTTGAGACTCTG 
GAGCCCCCAGCAGCTCCAGAAAACTTTATCCAAATAAGTGTGGACAAAAATGTTTCAG 
AGATAATTGCTACAATTATGGAAACCCTAAAAATGAAATGACAATGATTTTGTATCAG 
TGGTCCAAACAGAACTAAGCATAAATCATTGTGCCATCCCAAACCTCGTTCCAGCCGC 


CTTGCCCATACTAGATTCTAAATGTTTCTAAAGGCAAACCCCAATGTGTCAAGACAGA 


CTTGTTTAGGTGTAATTTTAGGAATTATGCTGGTTCATCAGGAAGCAGAGGGGGAGTT 


TTAAAAGTCAAGCTTAAATTGAAGTTTAAATTCATCTATAACCAAATCAAATGATCAG 


AGGAAATTCTGTAATCAATGCTGGAAATCGTTACATTGTTTAGAACATTCTTGCTCAT 


GCCTGTATTTGCACAAATAAATGAAACTTCGCTGTAAAAAAA 




ORF Start: ATG at 9 


ORF Stop: TGA at 561 




SLQ ID NO: 238 


184 aa MW at 20352.2RD 


NOV84a, 

CG59586-01 Protein Sequence 


MGSSGCLCFSGSGKSTVGALLASELGWKFYDADDYHPEENRRKMGKGIPLNDQDRIPW 
LCNLHDILLRDVASGQRWLACSALKKTYRDILTQGKDGVALKCEESGKEAKQAEMQL 
LWHLSGSFEVISGRLLKREGHFMPPELLQSQFETLEPPAAPENFIQISVDKNVSEII 
ATIMETLKMK 



Further analysis of the NOV84a protein yielded the following properties shown in 
Table 84B. 



Table 84B. Protein Sequence Properties NOV84a 


PSort 
analysis: 


0.6500 probability located in cytoplasm; 0.1000 probability located in 
mitochondrial matrix space; 0.1000 probability located in lysosome (lumen); 
0.1000 probability located in plasma membrane 


SignalP 
analysis: 


No Known Signal Sequence Predicted 



A search of the NOV84a protein against the Geneseq database, a proprietary 
database that contains sequences published in patents and patent publication, yielded 
5 several homologous proteins shown in Table 84C. 



Table 84C. Geneseq Results for NOV84a 


Geneseq 
Identifier 


Protein/Organism/Length [Patent 
#, Date] 


NOV84a 
Residues/ 

Match 
Residues 


Identities/ 
Similarities for 
the Matched 
Region 


Expect 
Value 


AAG73989 


Human colon cancer antigen 
protein SEQ ID NO:4753 - Homo 
sapiens, 193 aa. [WO2001 22920- 
A2, 05-APR-2001] 


10..184 
19..193 


175/175 (100%) 
175/175 (100%) 


le-97 
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AAB58998 


Breast and ovarian cancer 
associated antigen protein sequence 
SEQ ID 706 - Homo sapiens, 193 
aa. [WO200055173-A1, 21-SEP- 
2000] 


10..184 
19. .193 


175/175 (100%) 
1 nil id (iuo%) 


le-97 


AAM89100 


Human immune/haematopoietic 
antigen SEQ ID NO: 16693 - Homo 
sapiens, 133 aa. [WO200157182- 
A2, 09-AUG-2001] 


24.. 126 
22.. 124 


70/103 (67%) 
77/103 (73%) 


le-34 


AAG50675 


Arabidopsis thaliana protein 
fragment SEQ ID NO: 64243 - 
Arabidopsis thaliana, 1 75 aa. 
[EP1033405-A2, 06-SEP-2000] 


10. .179 
4..167 


75/173 (43%) 
102/173 (58%) 


4e-28 


AAG50674 


Arabidopsis thaliana protein 
fragment SEQ ID NO: 64242 - 
Arabidopsis thaliana, 187 aa. 
[EP1033405-A2, 06-SEP-2000] 


10..179 
16.. 179 


75/173 (43%) 
102/173 (58%) 


4e-28 


In a BLAST search of public sequence databases, the NOV84a protein was found to 
have homology to the proteins shown in the BLASTP data in Table 84D. 


Table 84D. Public BLASTP Results for NOV84a 


Protein 
Accession 
Number 


: 

Protein/Organism/Length 


NOV 84a 
Residues/ 

Match 
Residues 


Identities/ 
Similarities for 
the Matched 
Portion 


Expect 
Value 


BAB74785 


GLUCONOKINASE - Anabaena 
sp. (strain PCC 7120), 160 aa. 


10..183 
9.. 160 


72/174 (41%) 
101/174(57%) 


le-30 


Q9RT56 


THERMORESI ST ANT 
GLUCONOKINASE - 
Deinococcus radiodurans, 172 aa. 


10.. 183 
4..159 


66/174 (37%) 
101/174 (57%) 


le-29 


CAC93415 


PUTATIVE Gl 1 C< >NOKlN \M 
(EC 2.7.1 .12) - Yersinia pestis, 
167 aa. 


10.. 174 
12. .159 


68/166 (40%) 
95/166 (56%) 


2e-29 


Q9CMM6 


GLK - Pasteurella multocida, 172 
aa. 


10..182 
15..169 


68 174(39%) 
99/174 (56%) 


2e-29 


AAK86014 


I AGRC 329P - Agrobacterium 
tumefaciens str. C58 (Cereon), 
163 aa. 


10..182 
5..159 


74/173 (42%) 
98/173 (55%) 


6e-29 



PFam analysis predicts that the NOV84a protein contains the domains shown in the 
Table 84E. 
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Table 84E. Domain Analysis of NOV84a 


Pfam Domain 


NOV84a Match Region 


Identities/ 
Similarities 
for the Matched Region 


Expect Value 


SKI: domain 1 of 1 


9..182 


37/206(18%) 
114/206 (55%) 


1.1 



Example 85. 

The NOV85 clone was analyzed, and the nucleotide and predicted polypeptide 
sequences are shown in Table 85A. 



Table 85A. NOV85 Sequence Analysis 




SEQ ID NO: 239 '4332 bp 


NOV85a, 

CG59704-01 DNA Sequence 


V T " ^ T MG ~ L T r [ 


AATGACGAGATCGGCCAGGGTGCCTTCAGCACTATTTACAAGGGCCGCTATCGCACCA 
CCACCGAGTTCTACGCGATTGCTTCCATCGACAAGAAGCGACGGGAGCGCGTCGTGAA 
CTGCGTTCAGCTGTTACGCTCCATGCACCACTCAAACGTCATAGAGTTCCACAACTGG 
TATGAGACCAACAATCACTTGTGGATCATTACGGAGTACTGCACCGGCGGAGACATGA 
GCACGATCCTCCGCTCGAACATTAATCTCACCACTCAGGCGGTCCAGGCGTTCGGCCG 
TGATGTGGCGATGGGCCTCATGTACATCCACAGTAAGGGTGTCGTGTATAACGACTTG 
CAGACTCGCAATCTGCTGATGGACTCCGCAGCAATGCTGCGCTTCCACGACTTTAGCT 
TGGCCTGTCTCTTCCAAGACGCGGCGACGCGGCCACTGGTGGGGACGCCACTGTACAT 
GGCCCCCGAGTTGTTCATGGCGGATCGCCCGCTGTACTCGATGGCATCAGACCTGTGG 
TCCTTCGGTTGTGTGCTGCACGAGCTGGCGACAGGCAAGCCGCCCTTTGCCGCATCCG 
ACCTCGAGACGCTGCTGGGCGACATACTGACGAGTCCGACGCCAGCGGTGCCTGGTGC 
GCCGGAGTCCTTTCAAACGCTCCTGTGCGGCCTGCTGGAAAAGGACCCGTTGAAGCGC 

GCAACGGCTTTCCATCTCAGGTGGCGTGGGAGGACTACAAGCGTTCGCGTTCTGGACG 
CGGTGCGAGTCAGTATAATTGGACGGACTCCGATGTGCGTGTGGCAGTGGCTCACGCC 
GTGGGGGCAGCGAAATCAAACGCTTCTACGCACAACGTGGAGGAGAGGGAGCGAGCGG 
CTGCGACGTTGAACGTCGCGAAGGAGCTGGACTTCACTGCAAGCGCGGCGATGTTGCT 
GGAACGGTTACCGGAGCGGACACAGGAGCGTGCTGCGCACGCAACAGGCCATGTCGCG 
ACCGCGCACGGCAGCCTGGTGCACGGCTGCCCATCCACGGCCTCAGCGGCGACCTCGC 
CAAGACGTTCAAGGACAAGGCGGCGCTGCTCAAGATTGTGGAAGAGGTCAAAACCGCT 
GTCGAGGGCTTCAAGCCGTGGGTGTCCTTCCACGTCGCTGCGCCACCCGGGCATGAGG 
GAGCGCCACTGGACCGGCTTGTCTCAGAAGCTCGGGATGAAGCTGGTGCCTGGCGACA 
CACTGATGCTTCTGGAGGACTGCGAACCGCTGCTAGCGCACCGCGACACCATTATCAG 
CTACTGCGAGGTGGCCGCGAAGGAGTCGCAGATCGAGATGACGCTCAAGGACATGCGT 
GCCAAGTGGGAGACCAAGTGCTTCATCATCGAGGCATACAAGGAGACAGGCACGTACA 
TCCTCAAGGACACCTCCGAGGTGGTGGAGCTCCTCGACGAGCACCTCAACGTCGTCCA 
GCAGCTGCAGTTCTCTCCATTCAAGGGCTACTTCGAGGAGTCCATCACGGACTGGGAG 
CGCTCCCTCAACCTCATCTCCGACATACTCGAACAATGGCTGGAGTGCCAGCGAGCGT 
GGCGTTATCTGGAGCCGATCCTCAACTCGGAGGACATCGCCATGCAGCTACCGCGACT 
GTCCACGCTGTTCGAGAAGGTGGACCGCACATGGAGACGTGTCATGGGCAACGCGCAC 
GCGCAGCCAAACGCACTCGAGTACTGCATTGGCACAAACAAGCTCTTGGACCACCTGC 
GCGAGGCGAACCGGCTCCTCGAAGTGCTGCAGCACTTGATGGCGCAGAAGGTCAACGT 
TGCCGCTGTTGGTCCGACTGGCACCGGCAAGTCCATCTCACTCGCGCGTCTCGTGCTT 
GGCGGCGGCATGCCGGCCAACTTTCTTGGCCTCAACTTCACCTTCTCGGCGCAGACAA 
AGTGCACAGTGTTGCAGAATTCACTGATGGCCAAGTTCGATAAGCGGCGCTCGCACGT 
CTACGGCGCCCCTGCCGGTAAGCACTTTCTCATCTTCATTGACGACGCGAACCTGCCG 
CAGCCAGAGAAGTACGGCGCGCAGCCCCCGGTGGAGCTTCTGCGGCAGATGCTCGCCC 
AAGGCGGCTTCTACAACTTTACAGGTGGCATCAAGTGGTCCTCCATCATCGACTGCTC 
GCTTGCGCTGGCGATGGGGCCGCCTGGCGGGGGCCGCAGCCGGGTTTCGAACCGCTTT 
ATGCGCTACTTCAATTACCTTGCCTTCCCCGAGATGTCGGACATGTCGAAGCGAACGA 
TCTTGCAGGCCATCCTCGTCGGCGGCCTCGCGCAGAGCGGCCTCGCTGACCGCCTCGC 
GAACGTCGCCTCCGCCGTGGTCGATAGCACGTTGCGGGTGTTTCGCAAGTGCACCCAG 

ACAGACAAGGGTCTGTTCATCGAGTACCTCAATGCCGAGCTGCCGTCCATGGGGGTGG 
ACAAGTCCTACAACGAGGTAGTGAAGGCTGACCGCCTCATCTTTGCCGACGTACTGAG 

GAGCTGCTGGAGGCGTACAATGACGAGAATGAAGTGAAGATGAACCTCGTGCTCTTCC 
TCGACGCCATCGAGCATGTCTGCCGTATCTCGCGCGTGCTGCGACTGCCGAACGGGCA 

r i t i r rG5CTT'r 
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AGGAATGGCACGAGAGCCTCGCGAAGTTGCTGCTCGAGTGTGGCAAGGACGAGAAGAA 
GCGGACGTTTCTCTTCGCCGACACCCAGCTGGCGCATCCGACGTTTCTGGAGGATGTG 
GCGGGCCTGCTCACATCGGGTGATGTGCCGAACCTCTTTGAGGACCAAGATATCGAGC 
TCATCAACGACAAGTTTCGCGGCGTCTGCCTAAGCGAGAACCTGCCAACGACGAAGGT 
GTCGGTGTACGCGCGCTTTGTGAAGGAGGCGCGAGCCAACCTGCACCTTGTGCTCGCC 
TTCTCTCCCATCGGAGAGGCGTTTCGCAGCCGCCTGCGTATGTTCCCATCGCTCATTG 
CGTGCTGCACAATCGACTGGTTTGCTGAGTGGCCATCCGAGGCGCTACTGTCGGTAGC 
CGCAGTGCAGCTGAACGCCGGCGACGTTACTGACGTCATGGGGGCGGCAAGCCATGCC 
GACTTGCCGGGCTGCTTCCAGGCAGTGCACCGCGCGGCGGCGGAGGTGACGGAGCGCT 
TCTTCACGGAAACGCGTCGTCGCTCGTACGTGACGCCGACGTCCTATCTGTCGCTCCT 
CTCCAACTTCAAAGTGATGGCGGCGGCAAAACGCCGCTTCGTTCGCGAGCAGCGCGGC 
CGCCTCGAGAAGGGGCTGGAGAAGCTGCGGCACACCGAGGTGCAAGTGGCGGAGCTGG 
AGGCCCAGCTCAAGGCGCAGCAGCCGGTTCTGGTGCAGAAAAAGGCAGAGATTCAGTC 
GATGATGGAGCGGCTGACGGTGGACCGAAAGGAGGCGGCGGTGAAGGAGGCGGACGCG 
CGCAGGGAGGCCCAGCTTCCCGGTGGCCGTGCTGCATACGGCGGTGAAGATGACGAAT 
GAGCCGCCGATGGGGCTGCGGGCGAACGTGATGCGCTCCTACTACGGCTTCACTCCCG 
AGGACCTCGAGCAGGAGGAGAAGCCCGCCGAGTTCAAAAAGATGTTGATGGCATCCGC 


ATGCCTGGTCCCATACCCGAGCACTGAAGAGCAGGGTCTCTGGAGCCTGGCATCGTGG 


GGTGGCCCTCAGCTTCCCCACTCACTGTGGGAAGTTTCCTTAGTGTCTCTGAGCCTGT 


TTCCTCATCCGTTGCCTGAGGATAAACCTGCTTCAGGATTGTTGGTGAAAAGACTTCC 


CTCACCTAGCTTCTGTAACGCCACTGCATGCCACCACTGCTGAGTACTGTTTGTTTGC 
TAGGTTGGTGTCATTCTCATTTTACCAGAAAGTGAAGCTC 




OPvF Start: ATG at 
41 


ORF Stop: TGA at 3944 




Srn ID NO: 240 


1301 an MW at 14611 5.7kD 


NOV85a, 

CG59704-01 Protein 
Sequence 


MNNYVLNDEIGQGAFSTIYKGRYRTTTEFYAIASIDKKRRERWNCVQLLRSMHHSNV 
IEFHNWYETNNHLWIITEYCTGGDMSTILRSNINLTTQAVQAFGRDVAMGLMYIHSKG 
WYNDLQTRNLLMDSAAMLRFHDFSLACLFQDAATRPLVGTPLYMAPELFMADRPLYS 
MASDLWSFGCVLHELATGKPPFAASDLETLLGDILTSPTPAVPGAPESFQTLLCGLLE 
KDPLKRYAWVDWRSEFWDEPLPLPSNGFPSQVAWEDYKRSRSGRGASQYNWTDSDVR 
VAVAHAVGAAKSNASTHNVEERERAAATLNVAKELDFTASAAMLLERLPERTQERAAH 
ATGHVATAHGSLVHGCPSTASAATSPRRSRTRRRCSRLWKRSKPLSRASSRGCPSTSL 
RHPGMRERHWTGLSQKIjGMKLVPGDTLMLLEDCEPLLAHRDTIISYCEVAAKESQIEM 
TLKDMRAKWETKCFIIEAYKETGTYILKDTSEWELLDEHLNWQQLQFSPFKGYFEE 
SITDWERSLNLISDILEQWLECQRAWRYLEPILNSEDIAMQLPRLSTLFEKVDRTWRR 
VMGNAHAQPNALEYCIGTNKLLDHLREANRLLEVLQHLMAQKVNVAAVGPTGTGKSIS 
LARLVLGGGMPAWFLGLNFTFSAQTKCTVLQNSLMAKFDKRRSHVYGAPAGKHFLIFI 
DDANLPQPEKYGAQPPVELLRQMIAQGGFYNFTGGIKWSSIIDCSLALAMGPPGGGRS 
RVSNRFMRYFNYLAFPEMSDMSKRTILQAILVGGLAQSGLADRLANVASAWDSTLRV 
FRKCTQVFLPTPAHVHYSFNMRDVMRVFPLLYTADKSVLQSEESIVRLWMHEMQRVFY 
DRLVDATDKGLFIEYLNAELPSMGVDKSYNEWKADRLIFADVLSDKGVYEQITDMNA 
LTTRMNELLEAYNDElSrEVKMNLVLFLDAIEHVCRISRVLRLPNGHCLLLGVGGSGRKS 
LTRLACSLIAEMEVFTIELSKNFGVKEWHESLAKLLLECGKDEKKRTFLFADTQLAHP 
TFLEDVAGLLTSGDVPNLFEDQDIELINDKFRGVCLSENLPTTKVSVYARFVKEARAN 
LHLVLAFSPIGEAFRSRLRMFPSLIACCTIDWFAEWPSEALLSVAAVQLNAGDVTDVM 
GAASHADLPGCFQAVHRAAAEVTERFFTETRRRSYVTPTSYLSLLSNFKVMAAAKRRF 
VREQRGRLEKGLEKLRHTEVQVAELEAQLKAQQPVLVQKKAEIQSMMERLTVDRKEAA 
\ P F A 1 ^ -t, ,L£ „ , ■ ri, L 



Further analysis of the NOV85a protein yielded the following properties shown in 
Table 85B. 



Table S5K. Protein Sequence Properties NGV85a 


PSort 
analysis: 


0.8800 probability located in nucleus; 0.3562 probability located in 
microbody (peroxisome); 0.1671 probability located in lysosome (lumen); 
0.1000 probability located in mitochondrial matrix space 


SignalP 
analysis: 


No Known Signal Sequence Predicted 
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A search of the NOV85a protein against the Geneseq database, a proprietary 
database that contains sequences published in patents and patent publication, yielded 
several homologous proteins shown in Table 85C. 



Table 85C. Geneseq Results for NOV85a 


Geneseq 
Identifier 


Protein/Organism/Length 
[Patent #, Date] 


NOV85a 
Residues/ 

Match 
Residues 


Identities/ 
Similarities for 
the Matched 
Region 


Expect 
Value 


AAM79863 


Human protein SEQ ID NO 3509 - 
Homo sapiens, 2127 aa. 
[WO200157190-A2, 09-AUG- 
2001] 


602..1287 
168..847 


218/692 (31%) 
347/692 (49%) 


le-89 


AAM79862 


Human protein SEQ ID NO 3508 - 
Homo sapiens, 2127 aa. 
[WO200157190-A2, 09-AUG- 
2001] 


602..1287 
168..847 


218/692 (31%) 
347/692 (49%) 


le-89 


AAM78879 


Human protein SEQ ID NO 1541 - 
Homo sapiens, 2143 aa. 
[WO200157190-A2, 09-AUG- 
2001] 


602..1287 
108..787 


218/692 (31%) 
347/692 (49%) 


le-89 


AAM78878 


Human protein SEQ ID NO 1540 - 
Homo sapiens, 2067 aa. 
[WO200157190-A2, 09-AUG- 
2001] 


602.. 1287 
108..787 


218/692 (31%) 
347/692 (49%) 


le-89 


AAM80293 


Human protein SEQ ID NO 3945 - 
Homo sapiens, 1774 aa. 
[WO200157190-A2, 09-AUG- 
2001] 


910..1293 
33. .405 


153/393 (38%) 
227/393 (56%) 


5e-70 



In a BLAST search of public sequence databases, the NOV85a protein was found to 
have homology to the proteins shown in the BLASTP data in Table 85D. 



346 



Table 85D. Public BLASTP Results for NOV85a 


Protein 
Accession 
Number 


Protein/Organism/Length 


NOV85a 
Residues/ 

Match 
Residues 


Identities/ 
Similarities for 
the Matched 
Portion 


Expect 
Value 


AAL37427 


CILIARY DYNEIN HEAVY 
CHAIN 7 - Homo sapiens 
(Human), 4024 aa. 


628.. 1293 
1975..2655 


271/692 (39%) 
395/692 (56%) 


e-132 


Q27812 


DYNE IN III WY CHAIN 
ISOTYPE7B (EC 3.6.1.3) - 
Tripneustes gratilla (Hawaian sea 
urchin), 1314 aa (fragment). 


601. .1247 
654..1310 


264/667 (39%) 
389/667 (57%) 


e-127 


Q9MBF8 


1 BETA DYNEIN HEAVY 
CHAIN - Chlamydomonas 
reinhardtii, 4513 aa. 


611..1293 
2486..3159 


257/693 (37%) 
377/693 (54%) 


e-117 


Q9VJC6 


DHC36C PROTEIN - Drosophila 
melanogaster (Fruit fly), 4010 aa. 


596..1275 
1913..2604 


249/699 (35%) 
383 '699 (54%) 


e-116 


Q9VWZ3 


DHC16F PRO! EIN - Drosophila 
melanogaster (Fruit fly), 408 1 aa. 


618..1301 
2022..2709 


248/704 (35%) 
380/704 (53%) 


e-108 



PFam analysis predicts that the NOV85a protein contains the domains shown in the 
Table 85E. 



Table 85E. Domain Analysis of NOV85a 



Pfam Domain 


NOV85a Match 
Region 


Identities/ 
Similarities 
for the Matched 
Region 


Expect 
Value 


pkinase: domain 1 of 
1 


4..250 


80/286 (28 c ! ) 
190/286(66%) 


6.8e-62 


DEAD: domain 1 of 1 


613..637 


7/25 (28%) 
22/25 (88%) 


0.83 


dNK: domain 1 of 1 


8(>5..1020 


32/179(18%) 
101/179(56%) 


6.8 



Example 86. 



The NOV86 clone was analyzed, and the nucleotide and predicted polypeptide 
sequences are shown in Table 86 A. 
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Table 86A. NOV86 Sequence Analysis 




SEQIDNO:241 [ 1420 bp 


l NOV86aT^" 

CG59628-01 DNA Sequence 


ii i i- r t -, i i ~t 


CGAAAGTCATGTTACCCAACACCGGGAGGCTGGCAGGATGTACAGTTTTTATCACAGG 
TGCAAGCCGTGGCATTGGCAAAGCTATTGCATTGAAAGCAGCAAAGGATGGAGCAAAT 
ATTGTTATTGCTGCAAAGACCGCCCAGCCACATCCAAAACTTCTAGGCACAATCTATA 
CTGCTGCTGAAGAAATTGAAGCAGTTGGAGGAAAGGCCTTGCCATGTATTGTTGATGT 
GAGAGATGAACAGCAGATCAGTGCTGCAGTGGAGAAAGCCATCAAGAAATTTGGAGGA 
ATTGATATTCTGGTAAATAATGCCAGTGCCATTAGTTTGACCAATACATTGGACACAC 
CTACCAAGAGATTGGATCTGATGATGAACGTGAACACCAGAGGCACCTACCTTGCATC 
TAAAGCATGTATTCCTTATTTGAAAAAGAGCAAAGTTGCTCATATCCTCAATATCAGT 
CCACCACTGAACCTAAATCCAGTTTGGTTCAAACAGCACTGTGCTTATACCATTGCTA 
AGTATGGTATGTCTATGTATGTGCTTGGAATGGCAGAAGAATTTAAAGGTGAAATTGC 
AGTCAATGCATTATGGCCTAAAACAGCCATACACACTGCTGCTATGGATATGCTGGGA 
GGACCTGGTATCGAAAGCCAGTGTAGAAAAGTTGATATCATTGCAGATGCAGCATATT 
CCATTTTCCAAAAGCCAAAAAGTTTTACTGGCAACTTTGTCATTGATGAAAATATCTT 
AAAAGAAGAAGGAATAGAAAATTTTGACGTTTATGCAATTAAACCAGGTCATCCTTTG 
CAACCAGATTTCTTCTTAGATGAATACCCAGAAGCAGTTAGCAAGAAAGTGGAATCAA 




TGGAGCTGTGGAAGAAACATTTAGAATTGTTAAGGACTCTCTCAGTGATGATGTTGTT 
AAAGCCACTCAAGCAATCTATCTGTTTGAACTCTCCGGTGAAGATGGTGGCACGTGGT 
TTCTTGATCTGAAAAGCAAGGGTGGGAATGTCGGATATGGAGAGCCTTCTGATCAGGC 
AGATGTG GTGATGAGTATGACT ACTGATGAC TTTG TAAAAATGTTTT CAGGTAAACTA 
AAACCAACAATGGCATTCATGTCAGGGAAATTGAAGATTAAAGGTAACATGGCCCTAG 
CAATCAAATTGGAGAAGCTAATGAATCAGATGAATGCCAGACTGTGAAGGAAAATATA 
AAAAAAAAGTCGACTGCTATGCTCAAAAAGTAAAAAAAGCTCAACAGTTAAAATCTAA 




ORF Start: ATG at 67 ORF Stop: TGA at 1 32 1 


jSEQIDNO: 242 41 8 aa |MW at 45394.2kD 


NOV86a, 

CG59628-01 Protein Sequence 


MLPNTGRLAGCTVFITGASRGIGKAIALKAAKDGANIVIAAKTAQPHPKLLGTIYTAA 
EE I EAVGGKALPC I VDVRDEQQI SAAVEKAI KKFGG ID I LVNNASAI SLTNTLDTPTK 
RLDLMMNVNTRGTYLASKACIPYLKKSKVAHILNISPPLNLNPVWFKQHCAYTIAKYG 
MSMYVLGMAEEFKGEIAVNALWPKTAIHTAAMDMLGGPGIESQCRKVDIIADAAYSIF 
QKPKSFTGNFVIDENILKEEGIENFDVYAIKPGHPLQPDFFLDEYPEAVSKKVESTGA 
VPEFKEEKLQLQPKPRSGAVEETFRIVKDSLSDDWKATQAIYLFELSGEDGGTWFLD 
LKSKGGNVGYGEPSDQADWMSMTTDDFVKMFSGKLKPTMAFMSGKLKIKGNMALAIK 
LEKLMNQMNARL 



Further analysis of the NOV86a protein yielded the following properties shown in 
Table 86B. 



Table 86B. Protein Sequence Properties NOV86a 


['Sort 

analysis: 


0.5500 probability located in endoplasmic reticulum (membrane); 0.5000 
probability located in microbody (peroxisome); 0.1900 probability located in 
lysosome (lumen); 0.1000 probability located in endoplasmic reticulum 
(lumen) 


SignalP 
analysis: 


No Known Signal Sequence Predicted 



A search of the NOV86a protein against the Geneseq database, a proprietary 
database that contains sequences published in patents and patent publication, yielded 
5 several homologous proteins shown in Table 86C. 
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Table 86C. Geneseq Rest 


ItsforNOV86a 




Geneseq 
Identifier 


Protein/Organism/Length 
[Patent #, Date] 


NOV86a 
Residues/ 

Match 
Residues 


Identities/ 
Similarities for 
the Matched 
Region 


Expect 
Value 


AAG81260 ! 


Human AFP protein sequence SEQ 
ID NO: 3 8 - Homo sapiens, 418 aa. 
[WO200129221-A2, 26-APR- 
2001] 


1..418 
1-418 


418/418(100%) 
418/418(100%) 


0.0 


AAB84367 


Amino acid sequence of human 
alcohol dehydrogenase 21612 - 
Homo sapiens, 418 aa. 
[WO2001 44446- A2, 21-JUN-2001] 


1-418 
1..418 


418/418(100%) 
418/418(100%) 


0.0 


AAG81258 


Human AFP protein sequence SEQ 
ID NO:34 - Homo sapiens, 383 aa. 
[WO200129221-A2, 26-APR- 
2001] 


1..382 
1..382 


382/382 (100%) 
382/382 (100%) 


0.0 


ABB 10251 


Human cDNA SEQ ID NO: 559 - 
Homo sapiens, 278 aa. 
[WO200154474-A2, 02-AUG- 
2001] 


141-418 
1..278 


271/278 (97%) 
274/278 (98%) 


e-156 


AAU23020 


Novel human enzyme polypeptide 
#106 - Homo sapiens, 278 aa. 
[WO2001 55301 -A2, 02-AUG- 
2001] 


141..418 
1..278 


271/278 (97%) 
274/278 (98%) 


e-156 



In a BLAST search of public sequence databases, the NOV 86a protein was found to 
have homology to the proteins shown in the BLASTP data in Table 86D. 



Table 86D. Public BLASTP Results for NOV86a 


Protein 
Accession 
Number 


Protein/Organism/Length 


NOV86a 
Residues/ 

Match 
Residues 


Identities/ 
Similarities for 
the Matched 
Portion 


Expect 
Value 


CAC38510 


SEQUENCE 37 FROM 
PATENT WOO 129221 - Homo 
sapiens (Human), 418 aa. 


4^ 4^ 

OO OO 


418/418(100%) 
418/418(100%) 


0.0 


CAC38508 


SEQUENCE 33 FROM 
PATENT WOO 129221 - Homo 
sapiens (Human), 383 aa. 


1..382 
1..382 




382/382 (100%) 
382/382 (100%) 




0.0 
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Q99LV2 


HYPOTHETICAL 54.9 KDA 
PROTEIN - Mus musculus 
(Mouse), 496 aa. 


1..418 
1..496 


355/496 (71%) 
390/496 (78%) 


0.0 


Q9BT58 


SIMILAR TO RIKEN CDNA 
2610207116 GENE - Homo 
sapiens (Human), 345 aa. 


163..418 
90..345 


253/256 (98%) 
254/256 (98%) 


e-143 


Q9VB10 


CG5590 PROTEIN (GH01709P) 
- Drosophila melanogaster (Fruit 
fly), 412 aa. 


4..418 
3..412 


238/422 (56%) 
300/422 (70%) 


e-128 



PFam analysis predicts that the NOV86a protein contains the domains shown in the 
Table 86E. 



Table 86E. Domain Analysis of NOV86a 


Pfam Domain 


NOV86a Match 
Region 


Identities/ 
Similarities 
for the Matched 
Region 


Expect 
Value 


beta-lactamase: domain 1 
of 1 


222.-236 


4/15 (27%) 
14/15(93%) 


6.5 


adh short: domain 1 of 1 


9..321 


74/339 (22%) 
211/339(62%) 


2.4e-29 


SCP2: domain 1 of 1 


306..415 


41/114(36%) 
87/114(76%) 


1.5e-25 



Example 87. 

The NOV87 clone was analyzed, and the nucleotide and predicted polypeptide 
sequences are shown in Table 87A. 



Table 87 A. NOV87 Sequence Analysis 




SEQIDNO:243 1 888 bp 


NOV87a, 

CG5 95 16-01 DNA Sequence 


TTCAACAAGGGCCCCTCCTACAGGCTCTTGGCGGACGTCCAGAACAGGCTTCTGTTCA 
AATATGACTCCCAGAAGGAGGCAGAGCTCCGCAGCTGGATCAAGGGATTCACTGGCCT 
CTCCATCCGCCCCGACTTCCAGAAGGGCCTGAAGGACGGGATTATTTTATGCACACTC 
GTGAACAAACTGCAGCCGGGCTCAGTCCCCAAGATCAACGGCTTCCGTGTAGAACTGG 
CACCAGCTAGAAAACCTCTCCAACATCCTCAAGGCAATGGTCAGCTACGGCATGATCC 
CGTGGACCTATTTGAGGCCAACGACCTGTTTGAGAGTGGGAACAATATGCAGGTGCGG 
GTGTCTCTTCTCGCCCTGGCAGGGAAGGCCAAGACTAAGGGGCTGCAGAGCGGGGTGG 
ACATCCGTGACAAGTACTCAGAGAAGCAGAACTTCAACGACACCACCATGAAGGCCAG 
GCTGTGCGTCATCCGGCTGCAGATTACCAACAAATGTGCCAGCCAGTCAGGCATGACC 
GCATACGTCACGAGGAGGCATCTCTACGACCCCAAGAACCGCATCCTGCCCCCCATGG 
ACAACTCGACCATCAGCCTCCGGATGGGTACAAACAAGTGCGCCAGCCAGGTGGGCAT 
GACGGCTCCCGGGAACCAGTGGCACATCTATGACACCAAGTTGGGAATCGACAAGTGT 
GAGAACTCCTCCATGTCCCTGAAGATGGGCTACACGCAGGTCGCCAATCACAGCAGAC 
AGGTCTTTGGCCTAGGCCGGCAAATATATGAACCCAAGTACCAGCCGGGTGGCCCAGT 
GGCCCACGGGGCTCCCTCCGCCGGCAACTGCCCAGGGCCAGGGGAGGCCCCTTAGTAC 
CAGGAGGAGACCAGCTAC 


jORF Start: TTC at 1 (ORF Stop: TAG at 865 
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SEQIDNO:244 (288 aa |MW at 31831.2kD 


NOV87a, 

CG59516-01 Protein Sequence 


FNKGPSYRLLADVQNRLLFKYDSQKEAELRSWIKGFTGLSIRPDFQKGLKDGIILCTL 
VNKLQPGSVPKINGFRVELAPARKPLQHPQGNGQLRHDPVDLFSftNDLFESGNNMQVR 
VSLLALAGKAKTKGLQSGVDIRDKYSEKQNFNDTTMKARLCVIRLQITNKCASQSGMT 
AYVTRRHLYDPKNRILPPMDNSTISLRMGTNKCASQVGMTAPGNQWHIYDTKLGIDKC 
ENSSMSLKMGYTQVANHSRQVFGLGRQIYEPKYQPGGPVAHGAPSAGNCPGPGEAP 




SEQIDNO:245 1 888 bp 


NOV87b, 

CG595 16-02 DNA Sequence 


TTCAACAAGGGCCCCTCCTACAGGCTCTTGGCGGACGTCCAGAACAGGCTTCTGTTCA 
AATATGACTCCCAGAAGGAGGCAGAGCTCCGCAGCTGGATCAAGGGATTCACTGGCCT 
CTCCATCCGCCCCGACTTCCAGAAGGGCCTGAAGGACGGGATTATTTTATGCACACTC 
GTGAACAAACTGCAGCCGGGCTCAGTCCCCAAGATCAACGGCTTCCGTGTAGAACTGG 
CACCAGCTAGAAAACCTCTCCAACATCCTCAAGGCAATGGTCAGCTACGGCATGATCC 
CGTGGACCTATTTGAGGCCAACGACCTGTTTGAGAGTGGGAACAATATGCAGGTGCGG 
GTGTCTCTTCTCGCCCTGGCAGGGAAGGCCAAGACTAAGGGGCTGCAGAGCGGGGTGG 
ACATCCGTGACAAGTACTCAGAGAAGCAGAACTTCAACGACACCACCATGAAGGCCAG 
GCTGTGCGTCATCCGGCTGCAGATTACCAACAAATGTGCCAGCCAGTCAGGCATGACC 
GCATACGTCACGAGGAGGCATCTCTACGACCCCAAGAACCGCATCCTGCCCCCCATGG 
ACAACTCGACCATCAGCCTCCGGATGGGTACAAACAAGTGCGCCAGCCAGGTGGGCAT 
GACGGCTCCCGGGAACCAGTGGCACATCTATGACACCAAGTTGGGAATCGACAAGTGT 
GAGAACTCCTCCATGTCCCTGAAGATGGGCTACACGCAGGTCGCCAATCACAGCAGAC 
AGGTCTTTGGCCTAGGCCGGCAAATATATGAACCCAAGTACCAGCCGGGTGGCCCAGT 
GGCCCACGGGGCTCCCTCCGCCGGCAACTGCCCAGGGCCAGGGGAGGCCCCTTAGTAC 
CAGGAGGAGACCAGCTAC 




ORF Start: TTC at 1 


ORF Stop: TAG at 865 




SEQ ID NO: 246 


288 aa |MW at 31831.2kD 


NOV87b, 

CG595 16-02 Protein Sequence 


FNKGPSYRLLADVQNRLL.FKYDSQKEAELRSWIKGFTGLSIRPDFQKGLKDGIILCTL 
VNKLQPGSVPKINGFRVELAPARKPLQHPQGNGQLRHDPVDLFEANDLFESGNMMQVR 
VSLLALAGKAKTKGLQSGVDIRDKYSEKQNFNDTTMKARLCVIRLQITNKCASQSGMT 
AYVTRRHLYDPKNRILPPMDNSTISLRMGTNKCASQVGMTAPGNQWHIYDTKLGIDKC 
t i r HSF'.iVFGI EI c i :pg; 3E = 



Sequence comparison of the above protein sequences yields the following sequence 
O relationships shown in Table 87B. 

ru 



Table 87B. Comparison of NOV87a against NOV87b. 


Protein Sequence 


NOV87a Residues/ 
Match Residues 


Identities/ 
Similarities for the Matched Region 


NOV87b 


1..288 
1..288 


288/288 (100%) 
288/288 ( 100%) 



Further analysis of the NOV87a protein yielded the following properties shown in 
Table 87C. 



Table 87C. Protein Sequence Properties NOV87a 


PSort 
analysis: 


0.4500 probability located in cytoplasm; 0.3000 probability located in 
microbody (peroxisome); 0.2110 probability located in lysosome (lumen); 
0.1000 probability located in mitochondrial matrix space 


SignalP 
analysis: 


No Known Signal Sequence Predicted 
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A search of the NOV87a protein against the Geneseq database, a proprietary 
database that contains sequences published in patents and patent publication, yielded 
several homologous proteins shown in Table 87D. 



Table 87D. Geneseq Results for NOV87a 


Geneseq 
Identifier 


Protein/Organism/Length [Patent 
#, Date] 


NOV87a 
Residues/ 

Match 
Residues 


Identities/ 
Similarities for 
the Matched 
Region 


Expect 
Value 


AAR94888 


Carponin - Homo sapiens, 297 aa. 
[JP08073380-A, 19-MAR-1996] 


1..265 
6..272 


136/269 (50%) 
176/269 (64%) 


7e-63 


AAR72588 


Carponin protein - Homo sapiens, 
297 aa. [WO9509010-A, 06-APR- 
1995] 


I ..265 
6..272 


136/269 (50%) 
176/269 (64%) 


7e-63 


AAB43807 


Human cancer associated protein 
sequence SEQ ID NO: 1252 - Homo 
sapiens, 163 aa. [WO200055350- 
Al,21-SEP-2000] 


1 64..273 
4..116 


67/113 (59%) 
82/113 (72%) 


6e-30 


! AAM73074 


Human bone marrow expressed 
probe encoded protein SEQ ID NO: 
33380 - Homo sapiens, 71 aa. 
[WO200157276-A2, 09-AUG-2001] 


157..225 
2..71 


49/70 (70%) 
55/70 (78%) 


4e-21 


AAM60434 


Human brain expressed single exon 
probe encoded protein SEQ ID NO: 
32539 - Homo sapiens, 71 aa. 
[WO200157275-A2, 09-AUG-2001] 


157..225 
2..71 


49/70 (70%) 
55/70 (78%) 


4e-21 



In a BLAST search of public sequence databases, the NOV87a protein was found to 
have homology to the proteins shown in the BLASTP data in Table 87E. 



Table 87E. Public BLASTP Results for NOV87a 


Protein 
Accession 
Number 


Protein/Organism/Length 


NOV87a 
Residues/ 

Match 
Residues 


Identities/ 
Similarities for 
the Matched 
Portion 


Expect 
Value 


Q08094 


Calponin H2, smooth muscle - Sus 
scrofa (Pig), 296 aa (fragment). 


1..287 
6..296 


219/291 (75%) 
237/291 (81%) 


e-116 


Q99439 


Calponin H2, smooth muscle 
(Neutral calponin) - Homo sapiens 
(Human), 309 aa. 


1..288 
6..297 

L_ 


218/292 (74%) 
235/292 (79%) 


e-115 
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Q08093 


Calponin H2, smooth muscle - 
Mus musculus (Mouse), 305 aa. 


1..288 
6..293 


214/291 (73%) 
231/291 (78%) 


e-112 


093547 


CALPONIN H3 - Xenopus laevis 
(African clawed frog), 295 aa. 


1..269 
5..276 


179/273 (65%) 
208/273 (75%) 


6e-91 


Q922F8 


UNKNOWN (PROTEIN FOR 
MGC:8135) - Mus musculus 
(Mouse), 242 aa. 


59..2S8 
1..230 


166/233 (71%) 
179/233 (76%) 


8e-83 



PFam analysis predicts that the NOV87a protein contains the domains shown in the 
Table 87F. 



Table 87F. Domain Analysis of NOV87a 


Pfam Domain 


NOV87a Match 
Region 


Identities/ 
Similarities 
for the Matched 
Region 


Expect 
Value 


CH: domain 1 of 1 


23..123 


27/124(22%) 
65/124 (52%) 


0.068 


calponin: domain 1 of 

2 


159..183 


17/26 (65%) 
21/26 (81%) 


3.8e-07 


calponin: domain 2 of 
2 

~_ — - ■ 


198..223 


15/26 (58%) 
19/26(73%) 


3e-08 



Example 88. 



The NOV88 clone was analyzed, and the nucleotide and predicted polypeptide 
5 sequences are shown in Table 88A. 



Table 88A. NOV88 Sequence Analysis 




SEQIDNO:247 12213 bp 


NOV88a, 

CG59671-02 DNA Sequence 


CGTGAGGCACCCACTCTGGGAGCACAGAGAGCTCAGGTAGCCTGCCTAGATGGCGGCG 
CGCACCCTGGGCCGCGGCGTCGGGAGGCTGCTGGGCAGCCTGCGAGGGCTCTCGGGGC 
AGCCCGCGCGGCCGCCGTGCGGGGTGAGCGCGCCGCGCAGGGCGGCCTCGGGACCCTC 
GGGCAGCGCTCCCGCAGTTGCAGCAGCAGCAGCACAGCCAGGCTCGTATCCCGCGCTG 
AGTGCACAGGCAGCCCGGGAGCCGGCCGCCTTCTGGGGGCCTCTGGCGCGGGACACTC 
TCGTGTGGGACACCCCCTACCACACCGTCTGGGACTGCGACTTCAGCACTGGCAAGAT 
CGGCTGGTTCCTGGGAGGCCAGTTAAATGTCTCTGTCAACTGCTTGGACCAGCATGTT 
CGGAAGTCCCCCGAGAGCGTTGCTTTGATCTGGGAGCGCGATGAGCCTGGAACGGAAG 
TGAGGATCACCTACAGGGAACTACTGGAGACCACGTGCCGCCTGGCCAACACGCTGAA 
GAGGCATGGAGTCCACCGTGGGGACCGTGTTGCCATCTACATGCCCGTGTCCCCATTG 
GCTGTGGCAGCAATGCTGGCCTGTGCCAGGATCGGAGCTGTCCACACAGTCATCTTTG 
CTGGCTTCAGTGCAGAGTCCTTGGCTGGGAGGATCAATGATGCCAAGTGCAAGGTGGT 
TATCACCTTCAACCAAGGACTCCGGGGTGGGCGCGTGGTGGAGCTGAAGAAAATAGTG 
GATGAGGCTGTGAAGCACTGCCCCACCGTGCAGCATGTCCTGGTGGCTCACAGGACAG 
ACAACAAGGTCCACATGGGGGATCTGGACGTCCCGCTGGAGCAGGAAATGGCCAAGGA 
GGACCCTGTTTGCGCCCCAGAGAGCATGGGCAGTGAGGACATGCTCTTCATGCTGTAC 
ACCTCAGGGAGCACCGGAATGCCCAAGGGCATCGTCCATACCCAGGCAGGCTACCTGC 
TCTATGCCGCCCTGACTCACAAGCTTGTGTTTGACCACCAGCCAGGTGACATCTTTGG 






TGCAATGGTGCCACCAGCGTCCTTTTTGAGAGCACCCCAGTTTATCCCAATGCTGGTC 
GGTACTGGGAGACAGTAGAGAGGTTGAAGATCAATCAGTTCTATGGCGCCCCAACGGC 
T< T C(,3 TrTTGCTGAAATACGGTGATGCCTGGGTGAAGAAGTATGATCGCTCCTCC 
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CTGCGGACCCTGGGGTCAGTGGGAGAGCCCATCAACTGTGAGGCCTGGGAGTGGCTTC 
ACAGGGTGGTGGGGGACAGCAGGTGCACGCTGGTGGACACCTGGTGGCAGACAGAAAC 
AGGTGGCATCTGCATCGCACCACGGCCCTCGGAAGAAGGGGCGGAAATCCTCCCTGCC 




TCGTGGAGGGCAGCAACGTCTCre 

GGCCAGGACCATCTATGGCGACCACCAGCGATTTGTGGACGCCTACTTCAAGGCCTAC 
CCAGGCTATTACTTCACTGGAGACGGGGCTTACCGAACTGAGGGCGGCTATTACCAGA 
TCACAGGGCGGATGGATGATGTCATCAACATCAGTGGCCACCGGCTGGGGACCGCAGA 
GATTGAGGACGCCATCGCCGACCACCCTGCAGTACCAGAAAGTGCTGTCATTGGCTAC 
CCCCACGACATCAAAGGAGAAGCTGCCTTTGCCTTCATTGTGGTGAAAGATAGTGCGG 
GTGACTCAGATGTGGTGGTGCAGGAGCTCAAGTCCATGGTGGCCACCAAGATCGCCAA 
ATATGCTGTGCCTGATGAGATCCTGGTGGTGAAACGTCTTCCAAAAACCAGGTCTGGG 
AAGGTCATGCGGCGGCTCCTGAGGAAGATCATCACTAGTGAGGCCCAGGAGCTGGGAG 
ACACTACCACCTTGGAGGACCCCAGCATCATCGCAGAGATCCTGAGTGTCTACCAGAA 
GTGCAAGGACAAGCAGGCTGCTGCTAAGTGAGCTGGCACCTTGTGGGGCTCTTGGGAT 
GGGCGGGCACCCAAGCCCTGGCTTGTCCTTCCCAGAAGGTACCCCTGAGGTTGGCGTC 






jORF Start: ATG at 50|ORF Stop: TGA at 21 17 


SEQ ID NO: 248 689 aa M\V at 74S55.9kD 


NOV88a, 

CG59671-02 Protein Sequence 


MAARTLGRGVGRLLGSLRGLSGQPARPPCGVSAPRRAASGPSGSAPAVAAAAAQPGSY 
PALSAQAAREPAAFWGPLARDTLVWDTPYHTVWDCDFSTGKIGWFLGGQLNVSVNCLD 
QHVRKSPESVALIWERDEPGTEVRITYRELLETTCRLANTLKRHGVHRGDRVAIYMPV 
S PLAVAAMLACAR I GAVHTVI FAGFSAESLAGRI NDAKCKWI TFNQGLRGGRWELK 
KIVDEAVKHCPTVQHVLVAHRTDNKVHMGDLDVPLEQEMAKEDPVCAPESMGSEDMLF 
MLYTSGSTGMPKGIVHTQAGYLLYAALTHKLVFDHQPGDIFGCVADIGWITGHSYWY 
GPLCNGATSVLFESTPVYPNAGRYWETVERLKINQFYGAPTAVRLLLKYGDAWVKKYD 
RSSLRTLGSVGEPINCEAWEWLHRWGDSRCTLVDTWWQTETGGICIAPRPSEEGAEI 
LPAMAMRPFFGIVPVLMDEKGSWEGSNVSGALCISQAWPGMARTIYGDHQRFVDAYF 
KAYPGYYFTGDGAYRTEGGYYQITGRMDDVINISGHRLGTAEIEDAIADHPAVPESAV 
IGYPHDIKGEAAFAFIWKDSAGDSDVWQELKSMVATKIAKYAVPDEILWKRLPKT 
r 1 T I 2A;'3L 3DTTTLEDF I TJ J i Z ? 



Further analysis of the NOV88a protein yielded the following properties shown in 
Table 88B. 



Table SSIi. Protein Sequence Properties l\OVSN;i 


PSort 
analysis: 


0.6500 probability located in plasma membrane; 0.6000 probability located in 
nucleus; 0.4340 probability located in mitochondrial inner membrane; 0.3000 
probability located in Golgi body 


SignalP 
analysis: 


Likely cleavage site between residues 23 and 24 



A search of the NOV88a protein against the Geneseq database, a proprietary 
database that contains sequences published in patents and patent publication, yielded 
several homologous proteins shown in Table 88C. 



354 



Table 88C. Geneseq Results for NOV88a 


Geneseq 
Identifier 


Protein/Organism/Length 
[Patent #, Date] 


NOV88a 
Residues/ 

Match 
Residues 


Identities/ 
Similarities for 
the Matched 
Region 


Expect 
Value 


AAU23058 


Novel human enzyme polypeptide 
#144 - Homo sapiens, 664 aa. 
[WO200155301-A2, 02-AUG- 
2001] 


26..689 
1..664 


663/664 (99%) 
663/664(99%) 


0.0 


AAB34712 


Human secreted protein encoded 
by DNA clone vo9 1 - Homo 
sapiens, 518 aa. [WO200055375- 
Al,21-SEP-2000] 


172..689 
1..518 


518/518(100%) 
518/518(100%) 


0.0 


AAU23050 


Novel human enzyme polypeptide 
#136 - Homo sapiens, 479 aa. 
[WO200155301-A2, 02-AUG- 
2001] 


224..689 
18..479 


459/466 (98%) 
461/466 (98%) 


0.0 


ABB 12253 


Human acetate-coA ligase 
homologue, SEQ ID NO:2623 - 
Homo sapiens, 446 aa. 
[WO200157188-A2, 09-AUG- 
2001] 


1 ..446 
1..446 


446/446(100%) 
446/446(100%) 


0.0 


AAR23968 


facA gene product - Penicillium 
chrysogenum, 669 aa. 
[WO9207079-A, 30-APR-1992] 


58..684 
45. .669 


305/629 (48%) 
407/629 (64%) 


e-175 



In a BLAST search of public sequence databases, the NOV88a protein was found to 
have homology to the proteins shown in the BLASTP data in Table 88D. 



Table 88D. Public BLASTP Results for NOV88a 


Protein 
Accession 
Number 


Protein/Organism/Length 


NOV88a 
Residues/ 

Match 
Residues 


Identities/ 
Similarities for 
the Matched 
Portion 


Expect 
Value 

.. _ 


Q99NB1 


ACETYL-COA SYNTHETASE 2 - 
Mus musculus (Mouse), 682 aa. 


1..687 
1..680 


599/687 (87%) 
638/687 (92%) 


0.0 


Q9BEA3 


ACETYL-COA SYNTHETASE 2 - 
Bos taurus (Bovine), 675 aa. 


I. .689 
1 ..675 


575/689 (83%) 
625/689 (90%) 


0.0 


Q9NUB1 


l)J5f>8C'l H (NO\ f I . AMP- 
BINDING ENZYME SIMILAR TO 
ACETYL-COENZYME A 


212..689 
1..478 


478/478(100%) 
478/478(100%) 


0.0 
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LIGASE)) - Homo sapiens (Human), 
478 aa (fragment). 








Q96JI1 


KIAA1846 PROTEIN - Homo 
sapiens (Human), 354 aa (fragment). 


336.-689 
1..354 


354/354 (100%) 
354/354 (100%) 


0.0 


Q9HV66 


ACETYL-COENZYME A 
SYNTHETASE - Pseudomonas 
aeruginosa, 645 aa. 


58..675 
24..639 


326/619(52%) 
433/619 (69%) 


0.0 



PFam analysis predicts that the NOV88a protein contains the domains shown in the 
Table 88E. 



Table 88E. Domain Analysis of NOV88a 


Pfam Domain 


NOV88a Match 
Region 


Identities/ 
Similarities 
for the Matched 
Region 


Expect 
Value 


AMP-binding: domain 1 
of 1 


142..580 


121/441 (27%) 
341/441 (77%) 


7.1e-117 



Example 89. 



The NOV89 clone was analyzed, and the nucleotide and predicted polypeptide 
sequences are shown in Table 89A. 



Table 89A. NOV89 Sequence Analysis 




SEQIDNO:249 1 1268 bp 


NOV89a, 

CG56870-01 DNA Sequence 


ACTTCTTTCTTTTCTGTTTCAGAGTTACTGATTTATTCTTGAGATTCCTCTACTCTCG 


TTATCTGACCTCATGGATGAACTTCAGGATGTTCAGCTCACAGAGATCAAACCACTTC 

AGAAACAACTCATGGTGTGGTCCACGTCACTATAAGAGGCTTACCCAAAGGAAACAGA 
CCAGTTATACTAACATATCATGACATTGGCCTCAACCGTAAATCCTGTTTCAATGCAT 
TCTTTAACTTTGAGGATATGCAAGAGATCACCCAGCACTTTGCTGTCTGTCATGTGGA 
TGCCCCAGGCCAGCAGGAAGGTGCACCCTCTTTCCCAACAGGGTATCAGTACCCCACA 
ATGGATGAGCTGGCTGAAATGCTGCCTCCTGTTCTTACCCACCTAAGCCTGAAAAGCA 
TCATTGGAATTGGAGTTGGAGCTGGAGCTTACATCCTCAGCAGATTTGCACTCAACCA 

ATTGACTGGGCAGCTTCCAAACTCTCTGGCCTGACAACCAATGTTGTGGACATTATTT 
TGGCTCATCACTTTGGGCAGGAAGAGTTACAGGCCAACCTGGACCTGATCCAAACCTA 
CAGAATGCATATTGCCCAAGACATCAACCAAGACAACCTGCAGCTCTTCTTGAATTCC 
TACAATGGGCGCAGAGACCTGGAGATCGAAAGACCCATACTGGGCCAAAATGATAACA 
AATCAAAAACATTAAAGTGTTCTACTTTACTGGTGGTAGGGGACAATTCGCCTGCAGT 
TGAGGCTGTGGTCGAATGCAATTCCCGCCTGAACCCTATAAATACAACTTTGCTAAAG 
ATGGCGGACTGTGGGGGACTGCCCCAGGTAGTTCAGCCTGGGAAGCTCACCGAGGCCT 
TCAAGTACTTTTTGCAGGGAATGGGCTACGTCCCGTCTGCCAGCATGACTCGGCTCGC 
CCGATCACGAACCCACTCAACCTCGAGTAGCCTCGGCTCTGGAGAAAGTCCCTTCAGC 
CGGTCTGTCACCAGCAATCAGTCAGATGGAACTCAAGAATCCTGTGAGTCCCCTGATG 
TCCTGGACAGACACCAGACCATGGAGGTGTCCTGCTAAGCAGATGCTCCTCCCCTGGA 
CCATTGCAAGTCCATCCTTCAAATGACCACTCCATAATATAACATTTCAT 




ORE Start: ATG at 71 jORF Stop: TAA at 1 196 




SEQIDNO:250 |375 aa TmW at41413.3kD 


NOV89a, 

CG56870-01 Protein Sequence 


MDELQDVQLTEIKPLLNDKNGTRNFQDFDCQEHDIETTHGWHVTIRGLPKGNRPVIL 
TYHDIGLNRKSCFNAFFNFEDMQEITQHFAVCHVDAPGQQEGAPSFPTGYQYPTMDEL 
AEML ??VLTHLSLKS I IG I GVGAGAY I LSRFAL^H PELVSGLVL I NVDP CAKGWI DWA 
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NOV89b, 

CG5 6870-02 DNA Sequence 



TTCTAAATGATAAGAATGGTACAAGAAACTTCCAGGACTTTGACTGTCAGGAACATGA 
TATAGAAACAACTCATGGTGTGGTCCACGTCACTATAAGAGGCTTACCCAAAGGAAAC 
AGACCAGTTATACTAACATATCATGACATTGGCCTCAACCATAAATCCTGTTTCAATG 
CATTCTTTAACTTTGAGGATATGCAAGAGATCACCCAGCACTTTGCTGTCTGTCATGT 
GGATGCCCCAGGCCAGCAGGAAGGTGCACCCTCTTTCCCAACAGGGTATCAGTACCCC 
ACAATGGATGAGCTGGCTGAAGTGCTGCCTCCTGTTCTTACCCACCTAAGCCTGAAAA 
GCATCATTGGAATTGGAGTTGGAGCTGGAGCTTACATCCTCAGCAGATTTGCACTCAA 
CCATCCAGAGCTTGTGGAAGGCCTTGTGCTCATTAATGTTGACCCTTGCGCTAAAGGC 
TGGATTGACTGGGCAGCTTCCAAACTCTCTGGCCTGACAACCAATGTTGTGGACATTA 
TTTTGGCTCATCACTTTGGGCAGGAAGAGTTACAGGCCAACCTGGACCTGATCCAAAC 
CTACAGAATGCATATTGCCCAAGACATCAACCAAGACAACCTGCAGCTCTTCTTGAAT 
TCCTACAATGGACGCAGAGACCTGGAGATCGAAAGACCCATACTGGGCCAAAATGATA 
ACAAATCAAAAACATTAAAGTGTTCTACTTTACTGGTGGTAGGGGACAATTCGCCTGC 
AGTTGAGGCTGTGGTCGAATGCAATTCCCGCCTGAACCCTATAAATACAACTTTGCTA 
AAGATGGCGGACTGTGGGGGACTGCCCCAGGTAGTTCAGCCTGGGAAGCTCACCGAGG 
CCTTCAAGTACTTTTTGCAGGGAATGGGCTACATACCATCTGCCAGCATGACTCGGCT 
CGCCCGATCACGAACCCACTCAACCTCGAGTAGCCTCGGCTCTGGAGAAAGTCCCTTC 
AGCCGGTCTGTCACCAGCAATCAGTCAGATGGAACTCAAGAATCCTGTGAGTCCCCTG 

GGACCATTGCAAGTC 



NOV89b, 
CG56870-02 Protein Sequence 



NOV89c, 

CG56870-03 DNA Sequence 



ASKLSGLTTNWDI ILAHHFGQEELQANLDLIQTYRMHIAQDINQDNLQLFLNSYNGR 
RDLEIERPILGQNDNKSKTLKCSTLLWGDNSPAVEAWECNSRLNPINTTLLKMADC 
GGLPQWQPGKLTEAFKYFLQGMGYVPSASMTRLARSRTHSTSSSLGSGESPFSRSVT 
SNQSDGTQESCESPDVLDRHQTMEVSC 



TCGTTATCTGACCTCA TGGATGAACTTCAGGATGTTCAGCTCACAGAGATCAAACCAC 



SEQ ID NO: 252 



ORF Start: ATG at 16 ORF Stop: TAA at 1 141 



375 aa MW at 41376.2kD 



MDELQDVQLTEIKPLLNDKNGTRNFQDFDCQEHDIETTHGWHVTIRGLPKGNRPVIL 
TYHD I GLNHKSCFNAFFNFEDMQE I TQHFAVCHVDAPGQQEGAPS FPTG YQYPTMDEL 
AEVLPPVLTHLSLKSI IGIGVGAGAYILSRFALNHPELVEGLVLINVDPCAKGWIDWA 
ASKLSGLTTNWDI ILAHHFGQEELQANLDLIQTYRMHIAQDINQDNLQLFLNSYNGR 
RDLEIERPILGQNDNKSKTLKCSTLLWGDNSPAVEAWECNSRLNPINTTLLKMADC 
GGLPQWQPGKLTEAFKYFLQGMGYIPSASMTRLARSRTHSTSSSLGSGESPFSRSVT 
l T ESI R 13 



SEQ ID NO: 253 



; 1232 bp 



ACTTCTTTGTTTTCTGTTTCAGAGTTACTGATTTATTCTTGAGATTCCTCTACTCTCG 



TTATCTGACCTC ATGGATGAACTTCAGGATGTTCAGCTCACAGAGATCAAACCACTTC 
TAAATGATAAGGAACATGATATAGAAACAACTCATGGTGTGGTCCACGTCACTATAAG 
AGGCTTACCCAAAGGAAACAGACCAGTTATACTAACATATCATGACATTGGCCTCAAC 
CATAAATCCTGTTTCAATGCATTCTTTAACTTTGAGGATATGCAAGAGATCACCCAGC 
ACTTTGCTGTCTGTCATGTGGATGCCCCAGGCCAGCAGGAAGGTGCACCCTCTTTCCC 
AACAGGGTATCAGTACCCCACAATGGATGAGCTGGCTGAAATGCTGCCTCCTGTTCTT 
ACCCACCTAAGCCTGAAAAGCATCATTGGAATTGGAGTTGGAGCTGGAGCTTACATCC 
TCAGCAGATTTGCACTCAACCATCCAGAGCTTGTGGAAGGCCTTGTGCTCATTAATGT 
TGACCCTTGCGCTAAAGGCTGGATTGACTGGGCAGCTTCCAAACTCTCTGGCCTGACA 
ACCAATGTTGTGGACATTATTTTGGCTCATCACTTTGGGCAGGAAGAGTTACAGGCCA 
ACCTGGACCTGATCCAAACCTACAGAATGCATATTGCCCAAGACATCAACCAAGACAA 
CCTGCAGCTCTTCTTGAATTCCTACAATGGGCGCAGAGACCTGGAGATCGAAAGACCC 
ATACTGGGCCAAAATGATAACAAATCAAAAACATTAAAGTGTTCTACTTTACTGGTGG 
TAGGGGACAATTCGCCTGCAGTTGAGGCTGTGGTCGAATGCAATTCCCGCCTGAACCC 
TATAAATACAACTTTGCTAAAGATGGCGGACTGTGGGGGACTGCCCCAGGTAGTTCAG 
CCTGGGAAGCTCACCGAGGCCTTCAAGTACTTTTTGCAGGGAATGGGCTACGTCCCGT 
CTGCCAGCATGACTCGGCTCGCCCGATCACGAACCCACTCAACCTCGAGTAGCCTCGG 
CTCTGGAGAAAGTCCCTTCAGCCGGTCTGTCACCAGCAATCAGTCAGATGGAACTCAA 
GAATCCTGTGAGTCCCCTGATGTCCTGGACAGACACCAGACCATGGAGGTGTCCTGCT 
A &GCAGATGCTCCTCCCCTGGACCATTGCAAGTCCATCCTTCAAATGACCACTCCATA 
ATATAACATTTCAT 



ORF Start: ATG at 71 ORF Stop: TAA at 1 160 



SEQ ID NO: 254 



363 aa MW at 39967.8kD 



NOV89c, 
CG56870-03 Protein Sequence lu 



DIILAHHFGQEELQANLDLIQTYRMHIAQDINQDNLQLFLNSYNGRRDLEIERPILGQ 
NDNKSKTLKCSTLLWGDNSPAVEAWECNSRLNPINTTLLKKADCGGLPQWQPGKL 
TEAFKYFLQGMGYVPSASMTRLARSRTHSTSSSLGSGESPFSRSVTSNQSDGTQESCE 
S PDVLDRHQTMEVS C 



SEQ ID NO: 255 j 1220 bp 
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NOV89d, 

CG56870-04 DNA Sequence 


ACTTCTTTCTTTTCTGTTTCAGAGTTACTGATTTATTCTTGAGATTCCTCTACTCTCG 


TTATCTGACCTCATGGATGAACTTCAGGATGTTCAGCTCACAGAGATCAAACCACTTC 
TAAATGATAAGAATGGTACAAGAAACTTCCAGGACTTTGACTGTCAGGAACATGATAT 
AGAAACAACTCATGGTGTGGTCCACGTCACTATAAGAGGCTTACCCAAAGGAAACAGA 
CCAGTTATACTAACATATCATGACATTGGCCTCAACCGTAAATCCTGTTTCAATGCAT 
TCTTTAACTTTGAGGATATGCAAGAGATCACCCAGCACTTTGCTGTCTGTCATGTGGA 
TGCCCCAGGCCAGCAGGAAGGTGCACCCTCTTTCCCAACAGGGTATCAGTACCCCACA 
ATGGATGAGCTGGCTGAAATGCTGCCTCCTGTTCTTACCCACCTAAGCCTGAAAAGCA 
TCATTGGAATTGGAGTTGGAGCTGGAGCTTACATCCTCAGCAGATTTGCACTCAACCA 
TCCAGAGCTTGTGGAAGGCCTTGTGCTCATTAATGTTGACCCTTGCGCTAAAGGCTGG 
ATTGACTGGGCAGCTTCCAAACTCTCTGGCCTGACAACCAATGTTGTGGACATTATTT 
TGGCTCATCACTTTGGGCAGGAAGAGTTACAGGCCAACCTGGACCTGATCCAAACCTA 
CAGAATGCATATTGCCCAAGACATCAACCAAGACAACCTGCAGCTCTTCTTGAATTCC 
TACAATGGACGCAGAGACCTGGAGATCGAAAGACCCATACTGGGCCAAAATGATAACA 
AATCAAAAACATTAAAGTGTTCTACTTTACTGGTGGTAGGGGACAATTCGCCTGCAGT 
TGAGGCTGTGATGGCGGACTGTGGGGGACTGCCCCAGGTAGTTCAGCCTGGGAAGTTC 
ACCGAGGCCTTCAAGTACTTTTTGCAGGGAATGGGCTACACACCATCTGCCAGCATGA 
CTCGGCTCGCCCGATCACGAACCCACTCAACCTCGAGTAGCCTCGGCTCTGGAGAAAG 
TCCCTTCAGCCGGTCTGTCACCAGCAATCAGTCAGATGGAACTCAAGAATCCTGTGAG 
TCCCCTGATGTCCTGGACAGACACCAGACCATGGAGGTGTCCTGCTAAGCAGATGCTC 
CTCCCCTGGACCATTGCAAGTCCATCCTTCAAATGACCACTCCATAATATAACATTTC 




AT 




ORF Start: ATG at 71 


ORF Stop: TAA at 1148 




SEQ ID NO: 256 


359 aa !MW at 39652.2RD 


NOV89d, 

CG56870-04 Protein Sequence 


MDELQDVQLTEIKPLLNDKNGTRNFQDFDCQEHDIETTHGWHVTIRGLPKGNRPVIL 
TYHDIGLNRKSCFNAFFNFEDMQEITQHFAVCHVDAPGQQEGAPSFPTGYQYPTMDEL 
AEMLPPVLTHLSLKSIIGIGVGAGAYILSRFALNHPELVEGLVLINVDPCAKGWIDWA 
ASKLSGLTTNWDIILAHHFGQEELQANLDLIQTYRMHIAQDINQDNLQLFLNSYNGR 
RDLEIERPILGQNDNKSKTLKCSTLLWGDNSPAVEAVMADCGGLPQWQPGKFTEAF 
KYFLQGMGYTPSASMTRIARSRTHSTSSSLGSGESPFSRSVTSNQSDGTQESCESPDV 
LDRHQTMEVSC 




SEQ ID NO: 257 970 bp 


NOV89e, 

CG56870-05 DNA Sequence 


ATGGATGAACTTCA J-JA7 iTT :A'J : -'A' -A JA7'.\AAA _VA ■ ~T .'TAAATGATAAGA 
ATGGTACAAGAAACTTCCAGGACTTTGACTGTCAGTATCAGTACCCCACAATGGATGA 


ATTGGAGTTGGAGCTGGAGCTTACATCCTCAGCAGATTTGCACTCAACCATCCAGAGC 
TTGTGGAAGGCCTTGTGCTCATTAATGTTGACCCTTGCGCTAAAGGCTGGATTGACTG 
GGCAGCTTCCAAACTCTCTGGCCTGACAACCAATGTTGTGGACATTATTTTGGCTCAT 
CACTTTGGGCAGGAAGAGTTACAGGCCAACCTGGACCTGATCCAAACCTACAGAATGC 
ATATTGCCCAAGACATCAACCAAGACAACCTGCAGCTCTTCTTGAATTCCTACAATGG 
GCGCAGAGACCTGGAGATCGAAAGACCCATACTGGGCCAAAATGATAACAAATCAAAA 
ACATTAAAGTGTTCTACTTTACTGGTGGTAGGGGACAATTCGCCTGCAGTTGAGGCTG 
TGGTCGAATGCAATTCCCGCCTGAACCCTATAAATACAACTTTGCTAAAGATGGCGGA 
CTGTGGGGGACTGCCCCAGGTAGTTCAGCCTGGGAAGCTCACCGAGGCCTTCAAGTAC 
TTTTTGCAGGGAATGGGCTACGTCCCGTCTGCCAGCATGACTCGGCTCGCCCGATCAC 

CACCAGCAATCAGTCAGATGGAACTCAAGAATCCTGTGAGTCCCCTGATGTCCTGGAC 
AGACACCAGACCATGGAGGTGTCCTGCTAAGCAGATGCTCCTCCCCTGGACCATTGCA 
AGTCCATCCTTCAAATGACCACTCCATAATATAACATTTCAT 




ORF Start: ATG at 1 


ORF Stop: TAA at 898 




SEQ ID NO: 258 


299 aa |MW at 32956.9kD 


NOV89e, 

CG56870-05 Protein Sequence 


MDELQDVQLTE I KPLLNDKNGTRNFQDFDCQYQYPTMDELAEMLPPVLTHLSLKS I IG 
IGVGAGAYILSRFALNHPELVEGLiVLINVDPCAKGWIDWAASKLSGLTTNWDIILAH 
HFGQE ELQANLDL I QTYRMHI AQDINQDNLQLFLNS YNGRRDLE IERPI LGQNDNKSK 
TLKCSTLLWGDNSPAVEAWECNSRLNPINTTLLKMADCGGLPQWQPGKLTEAFKY 
FLQGMGYVPSASMTRLARSRTHSTSSSLGSGESPFSRSVTSNQSDGTQESCESPDVLD 
RHQTMEVSC 



Sequence comparison of the above protein sequences yields the following sequence 
relationships shown in Table 89B. 
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Table SOB. Comparison of NOV89a against NOV89b through N( >V89e. 


Protein Sequence 


NOV89a Residues/ 
Match Residues 


Identities/ 
Similarities for the Matched Region 


NOV89b 


1..375 
1..375 


336/375 (89%) 
338/375 (89%) 


NOV89c 


1..375 
1..363 


326/375 (86%) 
326/375 (86%) 


NOV89d 


1..375 
1..359 


321/375 (85%) 
321/375 (85%) 


NOV89e 


104..375 
28..299 


233/272 (85%) 
233/272 (85%) 



Further analysis of the NOV89a protein yielded the following properties shown in 



Table 89C. Protein Sequence Properties NOV89a 



PSort 
analysis: 


0.4500 probability located in cytoplasm; 0.3000 probability located in 
microbody (peroxisome); 0.1685 probability located in lysosome (lumen); 
0.1000 probability located in mitochondrial matrix space 


SignalP 
analysis: 


No Known Signal Sequence Predicted 



A search of the NOV89a protein against the Geneseq database, a proprietary 
database that contains sequences published in patents and patent publication, yielded 
several homologous proteins shown in Table 89D. 



Table 89D. Geneseq Results for NOV89a 


Geneseq 
Identifier 


Protein/Organism/Length [Patent 
#, Date] 


NOV89a 
Residues/ 

Match 
Residues 


Identities/ 
Similarities for 
the Matched 
Region 


Expect 
Value 


AAM94019 


Human stomach cancer expressed 
polypeptide SEQ ID NO 108 - 
Homo sapiens, 363 aa. 
[WO200109317-A1, 08-FEB-2001] 


1..375 
1..363 


360/375 (96%) 
361/375 (96%) 


0.0 


AAG64392 


Human reducing agent and 
tunicamycin-responsive protein 40 - 
Homo sapiens, 363 aa. 
[WO200155375-A1, 02-AUG- 
2001] 


1..375 
1..363 


360/375 (96%) 
361/375 (96%) 


0.0 
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AAB94494 


Human protein sequence SEQ ID 
NO:l 51 86 - Homo sapiens, 363 aa. 
[EP1 07461 7-A2, 07-FEB-2001] 


1..375 
1..363 


360/375 (96%) 
361/375 (96%) 


0.0 


AAU31598 


Novel human secreted protein 
#2089 - Homo sapiens, 395 aa. 
LWO200179449-A2, 25-OCT-2001] 


68..374 
1..307 


282/323 (87%) 
286/323 (88%) 


e-154 


AAB95462 


Human protein sequence SEQ ID 
NO: 17944 - Homo sapiens, 286 aa. 
[EP1074617-A2, 07-FEB-2001] 


133..375 
44..286 


240/243 (98%) 
242/243 (98%) 


e-138 



In a BLAST search of public sequence databases, the NOV89a protein was found to 
have homology to the proteins shown in the BLASTP data in Table 89E. 



Table 89E. Public BLASTP Results for NOV89a 



Protein 
Accession 
Number 


Protein/Organism/Length 


NOV89a 
Residues/ 

Match 
Residues 


Identities/ 
Similarities for 
the Matched 
Portion 


Expect 
Value 


Q9UGV2 


NDRG3 protein - Homo sapiens 
(Human), 375 aa. 


1..375 
1..375 


373/375 (99%) 
374/375 (99%) 


0.0 


Q96PL8 


NDR1 -RELATED 
DEVELOPMENT PROTEIN NDR3 
- Homo sapiens (Human), 375 aa. 


1..375 
1..375 


372/375 (99%) 
373/375 (99%) 


0.0 


Q9QYF9 


NDRG3 protein (Ndr3 protein) - 
Mus musculus (Mouse), 375 aa. 


., 

1..375 
1..375 


358/375 (95%) 
368/375 (97%) 


0.0 


AAH18504 


SIMILAR TO N-MYC 
DOWNSTREAM REGULATED 3 - 
Mus musculus (Mouse), 388 aa. 


1..375 
1..388 


359/388 (92%) 
368/388 (94%) 


~~oT~~ 


Q96SM2 


CDNA FLJ14759 FIS, CLONE 
NT2RP3003290, MODERATELY 
SIMILAR TO MUS MUSCULUS 
NDR1 RELATED PROTEIN NDR3 
- Homo sapiens (Human), 363 aa. 


1..375 
1..363 


360/375 (96%) 
361/375 (96%) | 


o.o I 



PFam analysis predicts that the NOV89a protein contains the domains shown in the 
Table 89F. 
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Table 89F. Domain Analysis of NOV89a 


Pfam Domain 


NOV89a Match 
Region 


Identities/ 
Similarities 
for the Matched 
Region 


Expect 
Value 


Orn Arg deC N: domain 1 
ofl 


62..89 


7/33(21%) 
24/33 (73%) 


1.9 


abhydrolase: domain 1 of 1 


87..310 


48/239 (20%) 
142/239 (59%) 


0.0066 


Ndr: domain 1 of 1 


22..346 


210/340 (62%) 
311/340 (91%) 


3.7e-211 



Example 90. 

The NOV90 clone was analyzed, and the nucleotide and predicted polypeptide 
sequences are shown in Table 90A. 



Table 90A. NOV90 Sequence Analysis 




SEQIDNO:259 1 632 bp 


NOV90a, 

CG59764-01 DNA Sequence 


„ "j i A " r "TTTA? I " TG t TT ~ TTT i 


TCGCCGGTGCGACGGTACCACCATCCCAGCTGTGAGGCTGCCATCAACACCCACATCA 
GCCTGGAGCTCCACGCATCCTATGTGTACCTGTCCATGGCCTTCTACTTCGACCAGGA 
CGACGCGGCCCTGGAGCACTTTGACCGCTACTTCCTGCGCCAGTCGCAGGAGAAAAGG 
GAGCACGCCCAGGAGCTGATGAGCCTGCAGAACCTGCGCGGTGGCCGCATCTGCCTTC 
ATGACATCAGGAAGCCAGAGGGCCAAGGCTGGGAGAGCGGGCTCAAGGCCATGGAGTG 
CACCTTCCACCTGGAGAAGAACATCAACCAGAGCCTCCTGGAGCTGCACCAGCTGGCC 
AGGGAGAACGGCGACCCCCAGCTCTGCGACTTCCTGGAGAACGACTTCCTGAACCAGC 

CCCGGAAGCAGGCCTGGCAGAGTACCTCTTTAACAAGCTCACCCTGGGCCGCAGCGAA 
CCACTTCCTTGAACCAGCAGGCCAAGACCATCAAAGAGATTGGTGGCTACCT 




ORF Start: ATG at 41 


ORE Stop: TGA at 590 




SEQ ID NO: 260 


183 aa |MW at 21 159.6k!) 


NOV90a, 

CG59764-01 Protein Sequence 


MHFFDPSPVRRYHHPSCEAAINTHISLELHASYVYLSMAFYFDQDDAALEHFDRYFLR 
QSQEKREHAQELMSLQNLRGGRICLHDIRKPEGQGWESGLKAMECTFHLEKNINQSLL 
ELHQLARENGDPQLCDFLENDFLNQQAKTIKELGGYLSNLHKMGAPEAGLAEYLFNKL 
TLGRSEPLP 



Further analysis of the NOV90a protein yielded the following properties shown in 
Table 90B. 



Table 90B. Protein Sequence Properties NOV90a 


PSort 
analysis: 


0.4500 probability located in cytoplasm; 0.1400 probability located in 
microbody (peroxisome); 0.1000 probability located in mitochondrial matrix 
space; 0.1000 probability located in lysosome (lumen) 


SignalP 
analysis: 


No Known Signal Sequence Predicted 
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A search of the NOV90a protein against the Geneseq database, a proprietary 
database that contains sequences published in patents and patent publication, yielded 
several homologous proteins shown in Table 90C. 



Table 90C. Geneseq Results for NOV90a ] 


Geneseq 
Identifier 


Protein/Organism/Length [Patent 
#, Date] 


NOV90a 
Residues/ 

Match 
Residues 


Identities/ 
Similarities for 
the IVIatched 
Region 


Expect 
Value 


AAU07889 


Polypeptide sequence for human 
hspG34a - Homo sapiens, 221 aa. 
[WO200166752-A2, 13-SEP-2001] 


7..180 
45..218 


159/174 (91%) 
164/174 (93%) 




4e-91 


AAU07890 


Polypeptide sequence for human 
hspG34b - Homo sapiens, 1 83 aa. 
[WO200166752-A2, 13-SEP-2001] 


6..177 
6..177 


125/172 (72%) 
149/172 (85%) 


6e-70 


AAB90804 


Human shear stress-response 
protein SEQ ID NO: 108 - Homo 
sapiens, 183 aa. [WO200 125427- 
Al, 12-APR-2001] 


7.. 180 
7..180 


114/174 (65%) 
141/174 (80%) 




6e-64 


AAR71567 


Human monocyte growth factor - 
Homo sapiens, 1 83 aa. 
[JP07031482-A, 03-FEB-1995] 


7..180 
7..180 


114/174 (65%) 
141/174 (80%) 


6e-64 

j 


AAU27741 


Mouse full-length polypeptide 
sequence #66 - Mus musculus, 182 
aa. [WO200164834-A2, 07-SEP- 
2001] 


6..180 
6..180 


112/175 (64%) 
141/175 (80%) 


5e-63 



In a BLAST search of public sequence databases, the NOV90a protein was found to 
have homology to the proteins shown in the BLASTP data in Table 90D. 





Table 90D. Public HLASTP Results for NOV90a 




Protein 
Accession 
Number 


Protein/Organism/Length 


NOV90a 
Residues/ 

Match 
Residues 


Identities/ 
Similarities for 
the Matched 
Portion 


Expect 
Value 


Q9BXU8 


Ferritin heavy polypeptide-like 17 
- Homo sapiens (Human), 1 83 aa. 


6..177 
6..177 


125/172 (72%) 
149/172 (85%) 


2e-69 


P29389 


Ferritin heavy chain (Ferritin H 
subunit) - Cricetulus griseus 
(Chinese hamster), 185 aa. 


6 180 
10..184 


115/175 (65%) 
142/175 (80%) 


6e-64 
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A26886 


ferritin heavy chain - chicken, 1 80 
aa. 


6..180 
5..179 


112/175 (64%) 
142/175 (81%) 


le-63 


P08267 


Ferritin heavy chain (Ferritin H 
subunit) - Gallus gallus (Chicken), 
179 aa. 


6..180 
4..178 


112/175(64%) 
142/175 (81%) 


le-63 


Q95MP7 


FERRITIN - Canis familiaris 
(Dog), 183 aa. 


6..180 
6..180 


112 175 (64%) 
143/175 (81%) 


2e-63 



PFam analysis predicts that the NOV90a protein contains the domains shown in the 
Table 90E. 



Table 90E. Domain Analysis of NOV90a 


Pfam Domain 


NOV90a Match 
Region 


Identities/ 
Similarities 
for the Matched 
Region 


Expect 
Value 


Bacteriofer: domain 1 of 
1 


14..159 


35/172(20%) 
76/172(44%) 


6.7 


ferritin: domain 1 of 1 


17..173 


92/161 (57%) 
138/161 (86%) 


4.7e-87 



Example 91. 

The NOV91 clone was analyzed, and the nucleotide and predicted polypeptide 
sequences are shown in Table 91 A. 



Table 91 A. NOV91 Sequence Analysis 




SEQ ID NO: 261 


487 bp 


NOV91a, 

CG59710-01 DNA Sequence 


TGCTGTGCGTTGTCTTTCCTTCTCACTCAAGCCTGTGAAATCTCTCTTTCAGGTTGAC 


AGACTAATGGAGTTGCATTTTAAATATCTGGGTGCAATGCAGGTGGCGGACAAGAAGA 
TTGAAGGGGAAAAACACGACATGGTCCGGCGAGGAGAGATCATCGACAATGACACCGA 
GGAGGAGTTCTACCTCCGGCGCCTGGATGCGGGGCTCTTTGTTCTCCAGCACATCTGC 
TACATCATGGCCGAGATCTGCAATGCCAATGTCCCCCAGATTCGCCAGAGGGTTCACC 
AGATCCTAAACATGCGAGGAAGCTCCATCAAAATTGTCAGGCATATCATCAAGGAGTA 
TGCAGAGAACATCGGGGACGGCCGGAGCCCGGAGTTCCGGGAGAACGAGCAAAAGCGC 
ATCCTGGGCTTGCTGGAGAACTTCTAGAGGCACCTTGGCCCTGCGCATCATGGACTCT 
CTCAGCTTCCCTCCCAGGATCAG 




ORF Start: ATG at 65 


ORF Stop: TAG at 431 




SEQ ID NO: 262 


122 aa MW at !4385.4kD 


NOV91a, 

CG59710-01 Protein Sequence 


MELHFKYLGAMQVADKKIEGEKHDMVRRGEIIDNDTEEEFYLRRLDAGLFVLQHICYI 
MAE I CNANVPQIRQRVHQI LNMRGSS I K I VRH I I KEYAENIGDGRS PEFRENEQKR I L 
GLLENF 



Further analysis of the NOV91a protein yielded the following properties shown in 
Table 91B. 
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Table 91B. Protein Sequence Properties NOV91a 


PSort 
analysis: 


0.6500 probability located in cytoplasm; 0.1000 probability located in 
mitochondrial matrix space; 0.1000 probability located in lysosome (lumen); 
0.0000 probability located in endoplasmic reticulum (membrane) 


SignalP 
analysis: 


No Known Signal Sequence Predicted 



A search of the NOV91a protein against the Geneseq database, a proprietary 
database that contains sequences published in patents and patent publication, yielded 
several homologous proteins shown in Table 91C. 



Table 91C. Geneseq Results for NOV91a 



Geneseq 
Identifier 


Protein/Organism/Length 
[Patent #, Date] 


NOV91a 
Residues/ 

Match 
Residues 


Identities/ 
Similarities for 
the Matched 
Region 


Expect 
Value 


AAU28058 


Novel human secretory protein, 
Seq ID No 227 - Homo sapiens, 
518 aa. [WO200166689-A2, 13- 
SEP-2001] 


1..122 
397..518 


122/122(100%) 
122/122(100%) 


— 

le-66 

I 


AAM93729 


Human polypeptide, SEQ ID NO: 
3689 - Homo sapiens, 563 aa. 
[EP1130094-A2, 05-SEP-2001] 


1..122 
442..563 


122/122 (100%) 
122/122(100%) 


le-66 | 


AAB63116 


Human secreted protein sequence 
encoded by gene 39 SEQ ID 
NO: 126 - Homo sapiens, 401 aa. 
[WO200061748-A1, 19-OCT- 
2000] 


1-119 
283. .401 


- 

119/119(100%) 
119/119(100%) 


le-64 


AAU28246 


Novel human secretory protein, 
Seq ID No 603 - Homo sapiens, 
360 aa. [WO200166689-A2, 13- 
SEP-2001] 


1..118 
197..316 




104/120 (86%) 
106/120 (87%) 


2e-51 1 


ABB21673 


Protein #3672 encoded by probe 
for measuring heart cell gene 
expression - Homo sapiens, 32 aa. 
[WO200157274-A2, 09-AUG- 
2001] 


24..55 
1..32 


32/32 (100%) ! 
32/32(100%) 

1 

1 


Ml | 



In a BLAST search of public sequence databases, the NOV91 a protein was found to 
have homology to the proteins shown in the BLASTP data in Table 91 D. 
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Table 91D. Public BLASTP Results lor NOV91a 


| Protein 
| Accession 
i Number 


Protein/Organism/Length 


NOV91a 
Residues/ 

Match 
Residues 


Identities/ 
Similarities for 
the JMatched 
Portion 


Expect 


Q96KD2 


TESTES DEVELOPMENT- 
RELATED NYD-SP19 - Homo 
sapiens (Human), 376 aa. 


1..122 

255.376 


122/122 (100%) 
122/122(100%) 


5e-66 


Q9H7A5 


CDNA: FLJ21 108 FIS, CLONE 
CAS05257 - Homo sapiens 
(Human), 225 aa. 


1..122 
104..225 


121/122 (99%) 
121/122 (99%) 


5e-65 


062703 


PI 4 - Bos taurus (Bovine), 122 aa. 


1..122 
1..122 


116/122 (95%) 
119/122 (97%) 


2e-62 


Q9CWL8 


5730471 K09RIK PROTEIN - Mus 
musculus (Mouse), 563 aa. 


1..I22 
442..S63 


115/122 (94%) 
118/122 (96%) 


3e-62 I 


Q9Y3M7 


DJ633O20.1 (P14L, SIMILAR TO 
BOS TAURUS P14) - Homo 
sapiens (Human), 284 aa 
(fragment). 


1..93 
192..284 


93/93 (100%) 
93/93 (100%) 




3e-48 | 



PFam analysis predicts that the NOV91a protein contains the domains shown in the 
Table 9 IE. 



Table 91E. Domain Analysis of NOV91a 


Identities/ 

Pfam Domain ] NOV91a Match Region ) Similarities 


Expect Value 


j for the Matched Region 


No Significant Matches Found 




Example 92. 



The NOV92 clone was analyzed, and the nucleotide and predicted polypeptide 
sequences are shown in Table 92A. 



1 Table 92A. NOV92 Sequence Analysis 1 




SEQIDNO:263 ]6527 bp 


jNOV92a, 

jCG59754-02 DNA Sequence 


CCACAGAGGGGAAATGCCAGCTTCCCTCTCCCTGGGGCTCCGTGCCC r T 1 r 


GCCCTTCGAATTCCCACCCGCCTCCATCGGCCAGCTGCTCTACATTCCCTGTGTGGTG 


TCCTCGGGGGACATGCCCATCCGTATCACCTGGAGGAAGGACGGACAGGTGATGATCT 
CAGGCTCGGGCGTGACCATCGAGAGCAAGGAATTCATGAGCTCCCTGCAGATCTCTAG 
CGTCTCCCTCAAGCACAACGGCAACTATACATGCATCGCCAGCAACGCAGCCGCCACC 
GTGAGCATTGTGTCTCCAGAACACAGGTTTTTTATTACCTACCACGGCGGGCTGTACA 
TCTCTGACGTACAGAAGGAGGACGCCCTCTCCACCTATCGCTGCATCACCAAGCACAA 
GTATAGCGGGGAGACCCGGCAGAGCAATGGGGCACGCCTCTCTGTGACAGACCCTGCT 
r(G'T ■ [ 1 ~i -7 i f- T r _ 
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1 3 AG CI CCGCCTCGGGCTACCCTATCCCCGCCATCCGCTGGCTCAA 
GGATGGCCGGCCCCTCCCGGCTGACAGCCGCTGGACCAAGCGCATCACAGGGCTGACC 
ATCAGCGACTTGCGGACCGAGGACAGCGGCACCTACATTTGTGAGGTCACCAACACCT 

: TCGGTTCGGCAGAGGCCACAGGCATCCTCATGGTCATTGATCCCCTTCATGTGACCCT 
GACACCAAAGAAGCTGAAGACCGGCATTGGCAGCACGGTCATCCTCTCCTGTGCCCTG 
ACGGGCTCCCCAGAGTTCACCATCCGCTGGTATCGCAACACGGAGCTGGTGCTGCCTG 
ACGAGGCCATCTCCATCCGCGGGCTCAGCAACGAGACGCTGCTCATCACCTCGGCCCA 
GAAGAGCCATTCCGGGGCCTACCAGTGCTTCGCTACCCGCAAGGCCCAGACCGCCCAG 
GACTTTGCCATCATTGCACTTGAGGATGGCACGCCCCGCATCGTCTCGTCCTTCAGCG 
AGAAGGTGGTCAACCCCGGGGAGCAGTTCTCACTGATGTGTGCGGCCAAGGGCGCCCC 
GCCCCCCACGGTCACCTGGGCCCTCGACGATGAGCCCATCGTGCGGGATGGCAGCCAC 

\ CGCACCAACCAGTACACCATGTCGGACGGCACCACCATCAGCCACATGAACGTCACAG 
GCCCCCAGATCCGCGACGGGGGCGTGTACCGGTGCACAGCGCGGAACTTGGTGGGCAG 
TGCTGAATATCAGGCGCGAATAAACGTAAGAGGCCCACCCAGCATCCGGGCTATGCGG 
AACATCACAGCAGTCGCCGGGCGGGACACCCTTATCAACTGCAGGGTCATCGGCTATC 
CCTACTACTCCATCAAGTGGTACAAGGATGCCTTGCTGCTGCCAGACAACCACCGCCA 
GGTGGTGTTTGAGAATGGGACCCTCAAGCTGACTGACGTGCAGAAGGGCATGGATGAG 
GGGGAGTACCTGTGCAGTGTCCTCATCCAGCCCCAGCTCTCCATCAGCCAGAGCGTTC 
ACGTAGCCGTCAAAGTGCCCCCTCTGATCCAGCCCTTCGAATTCCCACCCGCCTCCAT 
CGGCCAGCTGCTCTACATTCCCTGTGTGGTGTCCTCGGGGGACATGCCCATCCGTATC 
ACCTGGAGGAAGGACGGACAGGTGATCATCTCAGGCTCGGGCGTGACCATCGAGAGCA 
AGGAATTCATGAGCTCCCTGCAGATCTCTAGCGTCTCCCTCAAGCACAACGGCAACTA 
TACATGCATCGCCAGCAACGCAGCCGCCACCGTGAGCCGGGAGCGTCAGCTCATCGTG 
CGTGTGCCCCCTCGATTTGTGGTGCAACCCAACAACCAGGATGGCATCTACGGCAAAG 
CTGGTGTGCTCAACTGCTCGGTGGACGGCTACCCCCCACCCAAGGTCATGTGGAAGCA 
TGCCAAGGGGAGCGGGAACCCCCAGCAGTACCACCCTGTGCCCCTCACTGGCCGCATC 
CAGATCCTGCCCAACAGCTCGCTGCTGATCCGCCACGTCCTAGAAGAGGACATCGGCT 
ACTACCTCTGCCAGGCCAGCAACGGCGTAGGCACCGACATCAGCAAGTCCATGTTCCT 
CACAGTCAAGATCCCGGCCATGATCACTTCCCACCCCAACACCACCATCGCCATCAAG 

; GGCCATGCGAAGGAGCTAAACTGCACGGCACGGGGTGAGCGGCCCATCATCATCCGCT 

I GGGAGAAGGGGGACACAGTCATCGACCCTGACCGCGTCATGCGGTATGCCATCGCCAC 
CAAGGACAACGGCGACGAGGTCGTCTCCACACTGAAGCTCAAGCCCGCTGACCGTGGG 
GACTCTGTGTTCTTCAGCTGCCATGCCATCAACTCGTATGGGGAGGACCGGGGCTTGA 
TCCAACTCACTGTGCAAGAGCCCCCCGACCCCCCAGAGCTGGAGATCCGGGAGGTGAA 

i GGCCCGGAGCATGAACCTGCGCTGGACCCAGCGATTCGACGGGAACAGCATCATCACG 
GGCTTCGACATTGAATACAAGAACAAATCAGATTCCTGGGACTTCAAGCAGTCCACAC 
GCAACATCTCCCCCACCATCAACCAGGCCAACATTGTGGACTTGCACCCGGCATCTGT 
GTACAGCATCCGCATGTACTCTTTCAACAAGATTGGCCGCAGTGAACCAAGCAAGGAG 
CTCACCATCAGCACTGAGGAGGCCGCTCCCGATGGGCCCCCCATGGATGTTACCTTGC 

; AGCCAGTGACCTCACAGAGCATCCAGGTGACCTGGAAGGCACCCAAGAAGGAGCTGCA 

j GAACGGTGTCATCCGGGGCTACCAGATTGGCTACAGAGAGAACAGCCCCGGCAGCAAC 
GGGCAGTACAGCATCGTGGAGATGAAGGCCACGGGGGACAGCGAGGTCTACACCCTGG 
ACAACCTCAAGAAGTTCGCCCAGTATGGGGTGGTGGTCCAAGCCTTCAATCGGGCTGG 
CACGGGGCCCTCTTCCAGCGAGATCAATGCCACCACTCTGGAGGATGTGCCCAGCCAG 
CCCCCTGAGAACGTCCGGGCCCTGTCCATCACTTCTGACGTGGCCGTCATCTCCTGGT 
CAGAGCCCCCGCGCAGCACCCTCAATGGCGTCCTCAAAGGCTATCGGGTCATCTTCTG 
GTCCCTCTATGTTGATGGGGAGTGGGGCGAGATGCAGAACATCACCACCACGCGGGAG 
CGGGTGGAGCTGCGGGGCATGGAGAAGTTCACCAACTACAGCGTCCAGGTGCTGGCCT 
ACACCCAGGCTGGGGACGGCGTACGCAGCAGTGTGCTCTACATCCAGACCAAGGAGGA 

GTGTCTTGGCTCCCCCCTACCAAGCCCAACGGGGTGATCCGCAAGTACACCATCTTCT 
GTTCCAGCCCCGGGTCTGGCCAGCCGGCTCCCAGCGAGTACGAGACGAGTCCAGAGCA 

i GCTCTTCTACCGGATCGCCCACCTAAACCGCGGTCAGCAGTATCTGCTGTGGGTGGCC 
GCCGTCACCTCTGCCGGCCGGGGCAACAGCAGCGAGAAGGTGACCATCGAGCCTGCTG 
GCAAGGCCCCAGCAAAGATCATCTCCTTTGGGGGCACCGTGACAACACCTTGGATGAA 
AGATGTTCGGCTGCCTTGCAATTCAGTGGGAGATCCAGCCCCTGCTGTGAAGTGGACC 
AAGGACAGTGAAGACTCGGCCATTCCAGTGTCCATGGATGGGCACCGGCTCATCCACA 
CCAATGGCACACTGCTGCTGCGTGCAGTGAAGGCTGAGGACTCTGGCTACTACACGTG 
CACGGCCACCAACACTGGTGGCTTTGACACCATCATCGTCAACCTTCTGGTGCAAGTT 
CCCCCGGACCAGCCCCGCCTCACTGTCTCCAAAACCTCAGCTTCGTCCATCACCCTGA 
CCTGGATTCCAGGTGACAATGGGGGCAGCTCCATCCGAGGCTTCGTGCTACAGTACTC 
GGTGGACAACAGCGAGGAGTGGAAGGATGTGTTCATCAGCTCCAGCGAGCGCTCCTTC 
AAGCTGGACAGCCTCAAGTGTGGCACGTGGTACAAGGTGAAGCTGGCAGCCAAGAACA 
GCGTGGGCTCTGGGCGCATCAGCGAGATCATCGAGGCCAAGACCCACGGGCGGGAGCC 
CTCCTTCAGCAAAGACCAACACCTCTTCACCCACATCAACTCCACGCATGCTCGGCTT 
AACCTGCAGGGCTGGAACAATGGGGGCTGCCCTATCACAGCCATCGTTCTGGAGTACC 

j GGCCCAAGGGGACCTGGGCCTGGCAGGGCCTCCGGGCCAACAGCTCCGGGGAGGTGTT 
TCTGACGGAACTGCGAGAGGCCACGTGGTACGAGCTGCGCATGAGGGCTTGCAACAGT 
GCGGGCTGCGGCAATGAAACAGCCCAGTTCGCCACCCTGGACTACGATGGCAGCACCA 
TTCCACCCATCAAGTCTGCTCAAGGTGAAGGGGATGATGTGAAGAAGCTGTTCACCAT 
CGGCTGCCCTGTCATCCTGGCCACACTGGGGGTGGCACTGCTCTTCATCGTACGCAAG 
AAGAGGAAGGAGAAACGGCTGAAGCGACTCCGAGATGCAAAGAGTTTGGCAGAAATGT 
TGATAAGCAAGAACAATAGAAGCTTTGACACCCCTGTGAAAGGGCCACCCCAGGGCCC 
ACGGCTACACATTGACATCCCCAGGGTCCAGCTGCTCATCGAGGACAAAGAAGGCATC 
AAGCAACTGGGAGATGACAAGGCCACCATCCCTGTGACAGATGCTGAGTTCAGCCAAG 

jCTGTCAACCCACAGAGCTTCTGTACTGGCGTCTCCTTGCACCACCCAACCCTCATCCA 

|GAGCACAGGACCCCTCATCGACATGTCTGACATCCGGCCAGGAACCAATCCAGTGTCC 
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TGACCAAGTGCCAGGCCTCCACACCTGCCCGCACCCTCACCTCCGACTGGCGCACCGT 
GGGCTCCCAGCATGGTGTCACGGTCACTGAGAGTGACAGCTACAGTGCCAGCCTGTCC 
CAGGACACAGACAAAGGAAGGAACAGCATGGTGTCCACTGAGAGTGCCTCTTCCACCT 
ACGAGGAGCTGGCCCGGGCCTATGAGCATGCCAAGCTGGAGGAGCAGCTGCAGCACGC 
CAAGTTTGAGATCACCGAGTGCTTCATCTCTGACAGTTCCTCTGACCAGATGACCACA 
GGCACCAACGAGAACGCCGACAGCATGACATCCATGAGCACACCCTCAGAGCCTGGCA 
TCTGCCGCTTTACCGCCTCACCACCCAAGCCCCAGGATGCGGACCGGGGCAAAAACGT 
GGCTGTGCCCATCCCTCACCGGGCCAACAAGAGTGACTACTGCAACCTGCCCCTGTAT 
GCCAAGTCAGAGGCCTTCTTTCGAAAGGCAGATGGACGTGAGCCCTGCCCCGTGGTCC 
CACCCCGTGAGGCCTCCATCCGGAACCTGGCTCGAACCTACCACACCCAGGCTCGCCA 
CCTGACCCTGGACCCTGCCAGCAAGTCCTTGGGCCTTCCCCACCCAGGGGCCCCCGCT 
GCCGCCTCCACAGCCACCTTACCTCAGAGGACTCTGGCCATGCCAGCCCCCCCAGCCG 
GCACAGCCCCCCCAGCCCCCGGCCCCACCCCTGCTGAGCCACCCACCGCCCCCAGCGC 
TGCCCCTCCGGCCCCCAGCACCGAGCCTCCACGAGCCGGGGGCCCACACACCAAAATG 
GGGGGCTCCAGGGACTCGCTTCTCGAGATGAGCACATCGGGGGTAGGGAGGTCTCAGA 
AGCAGGGGGCCGGGGCCTACTCCAAATCCTACACCCTGGTGTAGGGCCGGCAGGAAGA 


TGCATTATTTATATTCAACTGACAGACAAAAAC AAC V CG? AA :t iAAACCCC 


CAATCATGAACGCCTGTACATAGAACTCTTTTGTACAAATGAAACTATTTTCTTCTTC 


TCCATGAAGCCAGGGCACAAAGAATTTGACAGTACAAGTCAAATCCCCCACCCCACAA 


AATATGTGTGGAGATATATATACATATATAGACAGACAGGAACGCCTCCACGAGCTAT 


ATATCTATATATTTCTCTCACCCTATTTTGAGACAGAGGCACAAAGACTCAGCAATTT 


T T lAAATCC 


CAACTCAGAC/ n , ' JATTTTACTGTTCCAAGAAAACCAGGAGTTGCTTCAA 




TTTGCAGATGCTTATGTGTTAATACCTTTTTCTATGAAAAAAGACCCAGCGCCGTGTG 
CAATAAAGGTT^ 1 TT2 CAAAAAAAAGCTT 


lORF Start: ATG at 
129 


ORF Stop: TAG at 5958 


! jSEQIDNO:264 


1 943 aa MW at 2 1 1 904.3kD 


NOV92a, 

CG59754-02 Protein 
Sequence 


IT 7? ITWRKDGQVI I SGSGVT I ES KEFKSSLQI SSV3^KH>SGNYTC IASNAAATVS IV 
SPEHRFFITYHGGLYISDVQKEDALSTYRCITKHKYSGETRQSNGARLSVTDPAESIP 
T I LDGPHSQEVWAGHTVELPCTASGYP I PAI RWLKDGRPLPADSRWTKR I TGLT I SDL 
RTEDSGTYICEVTNTFGSAEATGILMVIDPLHVTLTPKKLKTGIGSTVILSCALTGSP 
EFTIRWYRNTELVLPDEAISIRGLSNETLLITSAQKSHSGAYQCFATRKAQTAQDFAI 
IALEDGTPRIVSSFSEKWNPGEQFSLMCAAKGAPPPTVTWALDDEPIVRDGSHRTNQ 
YTMSDGTTISHMNVTGPQIRDGGVYRCTARNLVGSAEYQARINVRGPPSIRAMRNITA 
VAGRDTLINCRVIGYPYYSIKWYKDALLLPDNHRQWFENGTLKLTDVQKGMDEGEYL 
CSVLIQPQLSISQSVHVAVKVPPLIQPFEFPPASIGQLLYIPCWSSGDMPIRITWRK 
DGQVIISGSGVTIESKEFMSSLQISSVSLKHNGKYTCIASNAAATVSRERQLIVRVPP 
RFWQPNNQDGIYGKAGVLNCSVDGYPPPKVMWKHAKGSGNPQQYHPVPLTGRIQILP 
NSSLLIRHVLEEDIGYYLCQASNGVGTDISKSMFLTVKIPAMITSHPNTTIAIKGHAK 
ELNCTARGERPI I IRWEKGDTVIDPDRVMRYAIATKDNGDEWSTLKLKPADRGDSVF 
FSCHAINSYGEDRGLI QLTVQEPPDPPELE I REVKARSMNLRWTQRFDGNS I ITGFDI 
EYKNKSDSWDFKQSTRNISPTINQANIVDLHPASVYSIRMYSFNKIGRSEPSKELTIS 
TEEAAPDGPPMDVTLQPVTSQSIQVTWKAPKKELQNGVIRGYQIGYRENSPGSNGQYS 
IVEMKATGDSEVYTLDNLKKFAQYGVWQAFNRAGTGPSSSEINATTLEDVPSQPPEN 
VRALS I TSDVAVI SWSEPPRSTLNGVLKGYRVI FWSLYVDGEWGEMQNI TTTRERVEL 
RGMEKFTNYSVQVLAYTQAGDGVRSSVLYIQTKEDVPGPPAGIKAVPSSASSVWSWL 
PPTKPNGVIRKYTIFCSSPGSGQPAPSEYETSPEQLFYRIAHLNRGQQYLLWVAAVTS 
AGRGNSSEKVTIEPAGKAPAKIISFGGTVTTPWMKDVRLPCNSVGDPAPAVKWTKDSE 
DSAIPVSMDGHRLIHTNGTLLLRAVKAEDSGYYTCTATNTGGFDTIIVNLLVQVPPDQ 
PRLTVSKTSASSITLTWIPGDNGGSSIRGFVLQYSVDNSEEWKDVFISSSERSFKLDS 
LKCGTWYKVKLAAKNSVGSGRISEIIEAKTHGREPSFSKDQHLFTHINSTHARLNLQG 
WNNGGCPITAIVLEYRPKGTWAWQGLRANSSGEVFLTELREATWYELRMRACNSAGCG 
NETAQFATLDYDGSTIPPIKSAQGEGDDVKKLFTIGCPVILATLGVALLFIVRKKRKE 
KRLKRLRDAKSLAEMLISKNNRSFDTPVKGPPQGPRLHIDIPRVQLLIEDKEGIKQLG 
DDKATIPVTDAEFSQAVNPQSFCTGVSLHHPTLIQSTGPLIDMSDIRPGTNPVSRKNV 
KSAHSTRNRYSSQWTLTKCQASTPARTLTSDWRTVGSQHGVTVTESDSYSASLSQDTD 
KGRNSMVSTESASSTYEELARAYEHAKLEEQLQHAKFEITECFISDSSSDQMTTGTNE 
NADSMTSMSTPSEPGICRFTASPPKPQDADRGKNVAVPIPHRANKSDYCNLPLYAKSE 
AFFRKADGREPCPWPPREASIRNLARTYHTQARHLTLDPASKSLGLPHPGAPAAAST 
ATLPQRTLAMPAPPAGTAPPAPGPTPAEPPTAPSAAPPAPSTEPPRAGGPHTKMGGSR 
DSLLEMSTSGVGRSQKQGAGAYSKSYTLV 




SEQIDNO:265 16049 bp 


NOV92b, 

CG5 9754-01 DNA Sequence 


GCCCTTCGAATTCCCACCCGCCTCCATCGGCCAGCTGCTCTACATTCCCTGTGTGGTG 


TCCTCGGGGGACATGCCCATCCGTATCACCTGGAGGAAGGACGGACAGGTGATGATrT 
CAGGCTCGGGCGTGACCATCGAGAGCAAGGAATTCATGAGCTCCCTGCAGATCTCTAG 
CGTCTCCCTCAAGCACAACGGCAACTATACATGCATCGCCAGCAACGCAGCCGCCACC 
GTGAGCATTGTGTCTCCAGAACACAGGTTTTTTATTACCTACCACGGCGGGCTGTACA 
TCTCTGACGTACAGAAGGAGGACGCCCTCTCCACCTATCGCTGCATCACCAAGCACAA 



367 



GTATAGCGGGGAGACCCGGCAGAGCAATGGGGCACGCCTCTCTGTGACAGACCCTGCT 
GAGTCGATCCCCACCATCCTGGATGGCTTCCACTCCCAGGAAGTGTGGGCCGGCCACA 
CCGTGGAGCTGCCCTGCACCGCCTCGGGCTACCCTATCCCCGCCATCCGCTGGCTCAA 
GGATGGCCGGCCCCTCCCGGCTGACAGCCGCTGGACCAAGCGCATCACAGGGCTGACC 
ATCAGCGACTTGCGGACCGAGGACAGCGGCACCTACATTTGTGAGGTCACCAACACCT 
TCGGTTCGGCAGAGGCCACAGGCATCCTCATGGTCATTGATCCCCTTCATGTGACCCT 
GACACCAAAGAAGCTGAAGACCGGCATTGGCAGCACGGTCATCCTCTCCTGTGCCCTG 
ACGGGCTCCCCAGAGTTCACCATCCGCTGGTATCGCAACACGGAGCTGGTGCTGCCTG 
ACGAGGCCATCTCCATCCGCGGGCTCAGCAACGAGACGCTGCTCATCACCTCGGCCCA 
GAAGAGCCATTCCGGGGCCTACCAGTGCTTCGCTACCCGCAAGGCCCAGACCGCCCAG 
GACTTTGCCATCATTGCACTTGAGGATGGCACGCCCCGCATCGTCTCGTCCTTCAGCG 
AGAAGGTGGTCAACCCCGGGGAGCAGTTCTCACTGATGTGTGCGGCCAAGGGCGCCCC 
GCCCCCCACAGTCACCTGGGCCCTCGACGATGAGCCCATCGTGCGGGATGGCAGCCAC 
CGCACCAACCAGTACACCATGTCGGACGGCACCACCATCAGCCACATGAACGTCACAG 
GCCCCCAGATCCGCGACGGGGGCGTGTACCGGTGCACAGCGCGGAACTTGGTGGGCAG 
| TGCTGAATATCAGGCGCGAATAAACGTAAGAGGCCCACCCAGCATCCGGGCTATGCGG 
j AACATCACAGCAGTCGCCGGGCGGGACACCCTTATCAACTGCAGGGTCATCGGCTATC 
CCTACTACTCCATCAAGTGGTACAAGGATGCCTTGCTGCTGCCAGACAACCACCGCCA 
GGTGGTGTTTGAGAATGGGACCCTCAAGCTGACTGACGTGCAGAAGGGCATGGATGAG 
GGGGAGTACCTGTGCAGTGTCCTCATCCAGCCCCAGCTCTCCATCAGCCAGAGCGTTC 
ACGTAGCCGTCAAAGTGCCCCCTCTGATCCAGCCCTTCGAATTCCCACCCGCCTCCAT 
CGGCCAGCTGCTCTACATTCCCTGTGTGGTGTCCTCGGGGGACATGCCCATCCGTATC 
ACCTGGAGGAAGGACGGACAGGTGATCATCTCAGGCTCGGGCGTGACCATCGAGAGCA 
AGGAATTCATGAGCTCCCTGCAGATCTCTAGCGTCTCCCTCAAGCACAACGGCAACTA 
TACATGCATCGCCAGCAACGCAGCCGCCACCGTGAGCCGGGAGCGTCAGCTCATCGTG 
CGTGTGCCCCCTCGATTTGTGGTGCAACCCAACAACCAGGATGGCATCTACGGCAAAG 
CTGGTGTGCTCAACTGCTCGGTGGACGGCTACCCCCCACCCAAGGTCATGTGGAAGCA 
TGCCAAGGGTAGCGGGAACCCCCAGCAGTACCACCCTGTGCCCCTCACTGGCCGCATC 
CAGATCCTGCCCAACAGCTCGCTGCTGATCCGCCACGTCCTAGAAGAGGACATCGGCT 
ACTACCTCTGCCAGGCCAGCAACGGCGTAGGCACCGACATCAGCAAGTCCATGTTCCT 
CACAGTCAAGATCCCCACCATCCTGGATGGCTTCCACTCCCAGGAAGTGTGGGCCGGC 
CACACCGTGGAGCTGCCCTGCACCGCCTCGGGCTACCCTATCCCCGCCATCCGCTGGC 
TCAAGGATGGCCGGCCCCTCCCGGCTGACAGCCGCTGGACCAAGCGCATCACAGGGCT 
GACCATCAGCGACTTGCGGACCGAGGACAGCGGCACCTACATTTGTGAGGrCACCAAC 
ACCTTCGGTGAGGCCACAGGCATCCTCATGGTCATTGGTGAGGAGCCCCCCGACCCCC 
CAGAGCTGGAGATCCGGGAGGTGAAGGCCCGGAGCATGAACCTGCGCTGGACCCAGCG 
ATTCGACGGGAACAGCATCATCACGGGCTTCGACATTGAATACAAGAACAAATCAGAT 
TCCTGGGACTTCAAGCAGTCCACACGCAACATCTCCCCCACCATCAACCAGGCCAACA 
TTGTGGACTTGCACCCGGCATCTGTGTACAGCATCCGCATGTACTCTTTCAACAAGAT 
TGGCCGCAGTGAACCAAGCAAGGAGCTCACCATCAGCACTGAGGAGGCCTCAGCTCCC 
GATGGGCCCCCCATGGATGTTACCTTGCAGCCAGTGACCTCACAGAGCATCCAGGTGA 
CCTGGAAGCAGGCACCCAAGAAGGAGCTGCAGAACGGTGTCATCCGGGGCTACCAGAT 
TGGCTACAGAGAGAACAGCCCCGGCAGCAACGGGCAGTACAGCATCGTGGAGATGAAG 
GCCACGGGGGACAGCGAGGTCTACACCCTGGACAACCTCAAGAAGTTCGCCCAGTATG 
GGGTGGTGGTCCAGGCCTTCAATCGGGCTGGCACGGGGCCCTCTTCCAGCGAGATCAA 
TGCCACCACTCTGGAGGATGTGCCCAGCCAGCCCCCTGAGAACGTCCGGGCCCTGTCC 
ATCACTTCTGACGTGGCCGTCATCTCCTGGTCAGAGCCCCCGCGCAGCACCCTCAATG 
GCGTCCTCAAAGGCTATCGGGTCATCTTCTGGTCCCTCTATGTTGATGGGGAGTGGGG 
CGAGATGCAGAACATCACCACCACGCGGGAGCGGGTGGAGCTGCGGGGCATGGAGAAG 
TTCACCAACTACAGCGTCCAGGTGCTGGCCTACACCCAGGCTGGGGACGGCGTACGCA 
GCAGTGTGCTCTACATCCAGACCAAGGAGGACGTTCCAGGTCCCCCTGCTGGCATCAA 




GCGAGTACGAGACGAGTCCAGAGCAGCTCTTCTACCGGATCGCCCACCTAAACCGCGG 
TCAGCAGTATCTGCTGTGGGTGGCCGCCGTCACCTCTGCCGGCCGGGGCAACAGCAGC 
GAGAAGGTGACCATCGAGCCTGCTGGCAAGGCCCCAGCAAAGATCATCTCCTTTGGGG 
GCACCGTGACAACACCTTGGATGAAAGATGTTCGGCTGCCTTGCAATTCAGTGGGAGA 
TCCAGCCCCTGCTGTGAAGTGGACCAAGGACAGTGAAGACTCGGCCATTCCAGTGTCC 
ATGGATGGGCACCGGCTCATCCACACCAATGGCACACTGCTGCTGCGTGCAGTGAAGG 
CTGAGGACTCTGGCTACTACACGTGCACGGCCACCAACACTGGTGGCTTTGACACCAT 
CATCGTCAACCTTCTGGTGCAAGTTCCCCCGGACCAGCCCCGCCTCACTGTCTCCAAA 
ACCTCAGCTTCGTCCATCACCCTGACCTGGATTCCAGGTGACAATGGGGGCAGCTCCA 
TCCGAGGTTTTGTGCTACAGTACTCGGTGGACAACAGCGAGGAGTGGAAGGATGTGTT 
CATCAGCTCCAGCGAGCGCTCCTTCAAGCTGGACAGCCTCAAGTGTGGCACGTGGTAC 
AAGGTGAAGCTGGCAGCCAAGAACAGCGTGGGCTCTGGGCGCATCAGCGAGATCATCG 
AGGCCAAGACCCACGGGCGGGAGCCCTCCTTCAGCAAAGACCAACACCTCTTCACCCA 
CATCAACTCCACGCATGCTCGGCTTAACCTGCAGGGCTGGAACAATGGGGGCTGCCCT 
ATCACAGCCATCGTTCTGGAGTACCGGCCCAAGGGGACCTGGGCCTGGCAGGGCCTCC 
GGGCCAACAGCTCCGGGGAGGTGTTTCTGACGGAACTGCGAGAGGCCACGTGGTACGA 
GCTGCGCATGAGGGCTTGCAACAGTGCGGGCTGCGGCAATGAAACAGCCCAGTTCGCC 
ACCCTGGACTACGATGGCAGTACCATTCCACCCATCAAGTCTGCTCAAGGTGAAGGGG 
ATGATGTGAAGAAGCTGTTCACCATCGGCTGCCCTGTCATCCTGGCCACACTGGGGGT 
GGCACTGCTCTTCATCGTACGCAAGAAGAGGAAGGAGAAACGGCTGAAGCGACTCCGA 
GATGCAAAGAGTTTGGCAGAAATGTTGATAAGCAAGAACAATAGAAGCTTTGACACCC 
CTGTGAAAGGGCCACCCCAGGGCCCACGGCTACACATTGACATCCCCAGGGTCCAGCT 
GCTCATCGAGGACAAAGAAGGCATCAAGCAACTGGGTGAGGACAAGGCCACCATCCCT 
GTGACAGATGCTGAGTTCAGCCAAGCTGTCAACCCACAGAGCTTCTGTACTGGCGTCT 
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CCTTGCACCACCCAACCCTCATCCAGAGCACAGGACCCCTCATCGACATGTCTGACAT 
CCGGCCAGGAACCGATCCAGTGTCCAGGAAGAATGTGAAGTCAGCCCACAGCACCCGG 
AACCGGTACTCAAGCCAGTGGACCCTGACCAAGTGCCAGGCCTCCACACCTGCCCGCA 
CCCTCACCTCCGACTGGCGCACCGTGGGCTCCCAGCATGGTGTCACGGTCACTGAGAG 
TGACAGCTACAGTGCCAGCCTGTCCCAGGACACAGACAAAGGAAGGAACAGCATGGTG 
TCCACTGAGAGTGCCTCTTCCACCTACGAGGAGCTGGCCCGGGCCTATGAGCATGCCA 
AGCTGGAGGAGCAGCTGCAGCACGCCAAGTTTGAGATCACCGAGTGCTTCATCTCTGA 
CAGTTCCTCTGACCAGATGACCACAGGCACCAACGAGAACGCCGACAGCATGACATCC 
ATGAGCACACCCTCAGAGCCTGGCATCTGCCGCTTTACCGCCTCACCACCCAAGCCCC 
AGGATGCGGACCGGCTGCTGATGCTGGTCCCAGGTGCCCACCTCCCTCCTCAGTCCAT 
CCATGTTGTAGCATATGTCAGAATTTCCTTCTTACTGAACAAGGGTGGGGGAGACCTG 
GCTTCTGATCTTAGCTCCGGCAGAGCTTGCAGTGAGCCGAGATCACGCGGCACCCGGC 
CACCAACACTGGTGGCTTTGACACCATCATCGTCAACCTGTGAGGCAGGTGACCCCAG 
GTGGGGACAGGGATGGAGAAAGGGTAGGGATTCCATCATGCGAGAGGGTCATCGAATG 
GAAGAAGCCAAACCAAGGGAGAGACAGACCTCTGGAGAAACAGAGGTGCACATGGAAG 
GGGAGGCAGGAGAGCTGGGGAGTGGGAGTGGGAGTGAGGGTGTGGGAGAGCCAGCACC 
TTCCCGTCACGGGGGGACTCCCCACACCCCATCACAGGGTCCGCCCTTGTGCTAAGGG 
GTGGTGGCTTTCCCCTCACAGTTCCCCCGGACCAGCCCCGCCTCACTGTCTCCAAAAC 


CTCAGCTTCGTCCATCACCCTGACCTGGATTCCAGGTGACAATGGGGGCAGCTCCATC 


i r "1 rTCTGGGCC GGG 


GCAGGGAAGGGGCTT - 




OPvF Start: ATG at 
129 


ORF Stop: TAA at 5853 




SEQ ID NO: 266 


1908 aa |MW at 208575 .3kD 


NOV92b, 

CG5 9754-01 Protein 
Sequence 


MPI R ITWRKDGQVI I SGSGVTIESKEFMSSLQI SSVSLKHNGNYTCI ASNAAATVS I V 
SPEHRFFITYHGGLYISDVQKEDALSTYRCITKHKYSGETRQSNGARLSVTDPAESIP 
TILDGFHSQEVWAGHTVELPCTASGYPIPAIRWLKDGRPLPADSRWTKRITGLTISDL 
RrEDSGTYICEVTNTFGSAEATGILMVIDPLHVTLTPKKLKTGIGSTVILSCALTGSP 
EFTIRWYRNTELVLPDEAISIRGLSNETLLITSAQKSHSGAYQCFATRKAQTAQDFAI 
IALEDGTPRIVSSFSEKWNPGEQFSLMCAAKGAPPPTVTWALDDEPIVRDGSHRTNQ 
YTMSDGTTISHMNVTGPQIRDGGVYRCTARNLVGSAEYQARINVRGPPSIRAMRNITA 
VAGRDTLINCRVIGYPYYSIKWYKDALLLPDNHRQWFENGTLKLTDVQKGMDEGEYL 
CSVLIQPQLSISQSVHVAVKVPPLIQPFEFPPASIGQLLYIPCWSSGDMPIRITWRK 
DGQVIISGSGVTIESKEFMSSLQISSVSLKHNGNYTCIASNAAATVSRERQLIVRVPP 
RFWQPNWQDGIYGKAGVLNCSVDGYPPPKVMWKHAKGSGNPQQYHPVPLTGRIQILP 
NSSLLIRHVLEEDIGYYLCQASNGVGTDISKSMFLTVKIPTILDGFHSQEVWAGHTVE 
LPCTASGYPIPAIRWLKDGRPLPADSRWTKRITGLTISDLRTEDSGTYICEVTNTFGE 
ATGILMVIGEEPPDPPELEIREVKARSMNLRWTQRFDGNSIITGFDIEYKNKSDSWDF 
KQSTRNISPTINQANIVDLHPASVYSIRMYSFNKIGRSEPSKELTISTEEASAPDGPP 
MDVTLQPVTSQSIQVTWKQAPKKELQNGVIRGYQIGYRENSPGSNGQYSIVEMKATGD 
SEVYTLDNLKKFAQYGVWQAFNRAGTGPSSSEINATTLEDVPSQPPENVRALSITSD 
VAVISWSEPPRSTLNGVLKGYRVIFWSLYVDGEWGEMQNITTTRERVELRGMEKFTNY 
SVQVLAYTQAGDGVRSSVLYIQTKEDVPGPPAGIKAVPSSASSVWSWLPPTKPNGVI 
RKYTIFCSSPAPQAPSEYETSPEQLFYRIAHLNRGQQYLLWVAAVTSAGRGNSSEKVT 
IEPAGKAPAKIISFGGTVTTPWMKDVRLPCNSVGDPAPAVKWTKDSEDSAIPVSMDGH 
RLIHTNGTLLLRAVKABDSGYYTCTATNTGGFDT I IVNLLVQVPPDQPRLTVSKTSAS 
SITLTWIPGDKGGSSIRGFVLQYSVDNSEEWKDVFISSSERSFKLDSLKCGTWYKVKL 
AAKNSVGSGRISE I I EAKTHGREPSFSKDQHLFTHINSTHARLNLQGWNNGGCP I TAI 
VLEYRPKGTWAWQGLRANSSGEVFLTELRHATWYELRMRACNSAGCGNETAQFATLDY 
DGSTIPPI KSAQGEGDDVKKLFTI GCPVI LATLGVALLF I VRKKRKEKRLKRLRDAKS 
LAEMLISKNNRSFDTPVKGPPQGPRLHIDIPRVQLLIEDKEGIKQLGEDKATIPVTDA 
EFSQAVNPQSFCTGVSLHHPTLIQSTGPLIDMSDIRPGTDPVSRKNVKSAHSTRNRYS 
SQWTLTKCQASTPARTLTSDWRTVGSQHGVTVTESDSYSASLSQDTDKGRNSMVSTES 
ASSTYEELARAYEHAKLEEQLQHAKFEITECFISDSSSDQMTTGTNENADSMTSMSTP 
SEPGICRFTASPPKPQDADRLLMLVPGAHLPPQSIHWAYVRISFLLNKGGGDLASDL 
SSGRACSEPRSRGTRPPTLVALTPSSSTCEAGDPRWGQGWRKGRDSIMREGHRMEEAK 
PRERQTSGETEVHMEGEAGE IGG 1 



Sequence comparison of the above protein sequences yields the following sequence 
relationships shown in Table 92B. 
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Table 92B. Comparison of NOV92a against NOV92b. 


Protein Sequence 


NOV92a Residues/ 
Match Residues 


Identities/ 
Similarities for the Matched Region 


NOV92b 


1..1771 
1..1760 


1663/1773 (93%) 
1681/1773 (94%) 



Further analysis of the NOV92a protein yielded the following properties shown in 
Table 92C. 



Table 92C. Protein Sequence Properties NOV92a 


PSort 
analysis: 


0.7000 probability located in plasma membrane; 0.3000 probability located in 
microbody (peroxisome); 0.3000 probability located in nucleus; 0.2000 
probability located in endoplasmic reticulum (membrane) 


SignalP 
analysis: 


No Known Signal Sequence Predicted 



A search of the NOV92a protein against the Geneseq database, a proprietary 
database that contains sequences published in patents and patent publication, yielded 
several homologous proteins shown in Table 92D. 



Table 92D. Geneseq Results for N(>\ «>2.i 


Geneseq 
Identifier 


Protein/Organism/Length 
[Patent #, Date] 


NOV92a 
Residues/ 

Match 
Residues 


Identities/ 
Similarities for the 
Matched Region 


I 

Expect 
Value 


AAU28091 


Novel human secretory protein, 
Seq ID No 260 - Homo sapiens, 
1744 aa. [WO200166689-A2, 13- 
SEP-2001] 


200.. 1943 
1..1744 

200.. 1943 
1..1744 


1744/1744(100%) 
1744/1744(100%) 


0.0 


' AAM78713 


Human protein SEQ ID NO 1375 
- Homo sapiens, 1744 aa. 
[WO200157190-A2, 09-AUG- 
2001] 


1744/1744(100%) 
1744/1744(100%) 


0.0 


! AAM39040 


Human polypeptide SEQ ID NO 
2185 - Homo sapiens, 1744 aa. 
[WO200153312-A1, 26-JUL- 
2001] 


200.. 1943 
1..1744 


1744/1744(100%) 
1744/1744(100%) 


0.0 


AAW42086 


Human Down syndrome-cell 
adhesion molecule DS-CAM1 - 


44.. 1778 
154.. 1890 


1085/1745 (62%) 
1357/1745 (77%) 

— 


o.o 1 
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[ 


| [W09817795-A1, 30-APR-1998] 








I AAW42087 


Human Down syndrome-cell 
adhesion molecule DS-CAM2 - 
Homo sapiens, 1571 aa. 
[W'(mi77 ( )5-Al,30-APR-1998] 


44..1457 
154..1564 


890/1416(62%) 
1109/1416(77%) 


: 0.0 


In a BLAST search of public sequence databases, the NOV92a protein was found to 
have homology to the proteins shown in the BLASTP data in Table 92E. 


Table 92E. Public BLASTP Results for NOV92a 1 


Protein 
Accession 
Number 


Protein/Organism/Length 


NOV92a 
Residues/ 

Match 
Residues 


Identities/ 
Similarities for the 
Matched Portion 


Value 


AAL57166 


DOWN SYNDROME CELL 
ADHESION MOLECULE 
DSCAML1 - Homo sapiens 
(Human), 2053 aa. 


44.. 104? 
155..2053 


1889/1900 (99%) 
1892/1900 (99%) 


0.0 


Q9ULT7 


KIAA1 132 PROTEIN - Homo 
sapiens (Human), 1 822 aa 
(fragment). 


122.. 1943 
1..1822 


1822/1822(100%) 
1822/1822(100%) 


0.0 


060469 


Down syndrome cell adhesion 
molecule precursor (CHD2) - 
Homo sapiens (Human), 2012 aa. 


44..1 943 
154..2012 


1123/1920 (58%) 
1410/1920 (72%) 


0.0 1 


Q9ERC8 


DOWN SYNDROME CELL 
ADHESION MOLECULE - Mus 
musculus (Mouse), 2013 aa. 


44.. 1943 
154..2013 


1119/1921 (58%) 
1405/1921 (72%) 


0.0 


AAL57167 


DOWN SYNDROME CELL 
ADHESION MOLECULE 
DSCAM - Rattus norvegicus 
(Rat), 2013aa. 


44..1943 
I54..2013 


1119/1921 (58%) 
1405/1921 (72%) 


0.0 



5 PFam analysis predicts that the NOV92a protein contains the domains shown in the 

Table 92F. 
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Table 92F. Domain Analysis of NOV92a 


Pfam Domain 


NOV92a Match Region 


Identities/ 
Similarities 
for the Matched Region 


Expect Value 


ig: domain 1 of 10 


1..48 


12/49 (24%) 
38/49 (78%) 


2.7e-05 


ig: domain 2 of 10 


72..90 


8/19(42%) 
14/19 (74%) 


85 


ig: domain 3 of 10 


130..186 


22/60 (37%) 
46/60 (77%) 


2.1e-14 


ig: domain 4 of 10 


219..278 


16/63 (25%) 
44/63 (70%) 


4.9e-09 


ig: domain 5 of 10 


312..377 


14/69 (20%) 
50/69 (72%) 


1.5e-07 


ig: domain 6 of 10 


409..467 


12/61 (20%) 
41/61 (67%) 


4.8e-05 


ig: domain 7 of 10 


500..561 


17/64 (27%) 
49/64 (77%) 


3.2e-ll 


ig: domain 8 of 10 


594..659 


19/69(28%) 
47/69 (68%) 


9.4e-07 


ig: domain 9 of 10 


693..759 


9/70(13%) 
47/70 (67%) 


7.9e-06 


fn3 : domain 1 of 6 


777..864 


22/89 (25%) 
65/89 (73%) 


3e-16 


fn3: domain 2 of 6 


876..968 


33/93 (35%) 
68/93 (73%) 


3.1e-16 


fn3: domain 3 of 6 


980-1069 


26/93 (28%) 
69/93 (74%) 


2.9e-16 


fn3: domain 4 of 6 


I081..1167 


24/88 (27%) 
64/88 (73%) 


3.7e-17 


ig: domain 10 of 10 


1194-1255 


1 7/65 (26%) 
46/65 (71%) 


4.3e-09 


fn3 : domain 5 of 6 


1274..1357 


30 86 (35%) 
67/86 (78%) 


1.2e-18 


fn3: domain 6 of 6 


1371-1453 


27/86 (31%) 
53/86 (62%) 


0.045 



Example 93. 



The NOV93 clone was analyzed, and the nucleotide and predicted polypeptide 
sequences are shown in Table 93A. 



Table 93A. NOV93 Sequence Analysis 




SEQIDNO:267 1 1272 bp 


NOV93a, 

CG59800-01 DNA Sequence 


agagcctggtatgcaggaggtgctctgtaaatacctgccccatacatacccgccccat 
acatacccaccccatacatacccaccccatacatacctgccctgtccatacctgcccc 
ctacatacctgccccgtccatacctgccccctacatacctgccccgtccatacctgcc 
ccctacatacctgctctgtctatacctgtggctaggactgtggccttgcttcttagcc 
gctcagagcctgcctcctcctctgcagagtggcggtggtagcagggcttcccgcgcgc 
cgatgctgctcgtggccctggtgctcggcgcctactgcctctgcgccctccccggccg 
ctgcccgccggccgcccgcgcccccgcgccggcccccgcgccctccgagccgtccagc 
tccgtccaccgcccgggagcacccggcctgcctttggccagcggtcccggccgccggc 
gcttcccgcaagcgctcatcgttggcgtgaagaagggcggcacgcgcgccctgctgga 
gtttctgcggctgcaccccgacctccgcgcgctgggctctgagccccacttcttcgac 
aggtgccccgaccgcggcctcgcctggtcccggagtctgatgccccgaaccctggatg 
ggcagatcaccatggagacgaccccgggctacttcgtgacgcgagaggccccccgccg 
catccacgccatgtccccggacacgaagctgatcgtggtggtgcggaaccccgtgacc 
cgggccatctccgactaggcccagacgctctccaagaccccgggcctgcccagcttcc 
gcgccctggccttccgccacggcctgggccccgtggacacagcctggagcgccgtccg 
catcggcctgtacgcccagcacctggaccactggctgcgctacttccccctgtcccac 

tgcaggacttcctgggcctgaaacgggtcgtcacggacaagcacttctacttcaacgc 
caccaagggcttcccctgcctcaagaaggcccagggcggcagccgtccccgctgcctg 
ggcaagtccaagggccggccacacccacgcgtgccccaggccgtggtccggcgcctgc 
aggagttctaccggcccttcaaccgcaggttctaccagatgacgggccaggacttcgg 
ctggggctgagcggcaccctggggatgctcagcaccttgattgacacccgctcg 




ORF Start: GAG at 2 


( >RF Stop: (KiC at 1217 




SEQ ID NO: 268 


405 aa I MW at 43994.8kD 


NOV93a, 

CG59800-01 Protein Sequence 


mqevlckylphtypphtypphtypphtylpcpylpptylprpylpptylprpylppty! 

LLCLYLWLGLWPCFLAAQSLPPPLQSGGGSRASRAPMLLVALVLGAYCLCALPGRCPP 1 
AARAPAPAPAPSEPSSSVHRPGAPGLPLASGPGRRRFPQALIVGVKKGGTRALLEFLR 
LHPDXRALGSEXHFFDRCXXXGLXWXRSLMPRTLDGQITMEXTPXYFVTREAPRRIHA 
MSPDTKLIWVRNPVTRAISDXXQTLSKTPGLPSFRALAFRHGLGPVDTAWSAVRIGL 
YAQHLDHWLRYFPLSHFLFVSGERLVSDPAGEVGRVQDFLGLKRWTDKHFYFNATKG 
E f < ^ ? F r 



Further analysis of the NOV93a protein yielded the following properties shown in 
Table 93B. 



Table 93B. Protein Sequence Properties NOV93a 


PSort 
analysis: 


0.6000 probability located in plasma membrane; 0.4000 probability located in 
Golgi body; 0.3000 probability located in endoplasmic reticulum (membrane); 
0.1000 probability located in mitochondrial inner membrane 


SignalP 
analysis: 


Likely cleavage site between residues 7 and 8 



A search of the NOV93a protein against the Geneseq database, a proprietary 
database that contains sequences published in patents and patent publication, yielded 
several homologous proteins shown in Table 93C. 
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Table 93C. Geneseq Results for NOV93a 


Geneseq 
Identifier 


Protein/O rganism/Len gth 
[Patent #, Date] 


NOV93a 
Residues/ 

Match 
Residues 


Identities/ 
Similarities for 
the Matched 
Region 


Expect 
Value 


AAB95507 


Human protein sequence SEQ ID 
NO: 18067 - Homo sapiens, 390 aa. 
[EP1074617-A2, 07-FEB-2001] 


31. .253 
11. .237 


121/229 (52%) 
146/229 (62%) 


4e-55 


AAY17066 


Human 3-OST-3B protein - Homo 
sapiens, 390 aa. [WO9922005-A2, 
06-MAY-1999] 


31. .253 
11. .237 


121/229 (52%) 
146/229 (62%) 


4e-55 


AAB70115 


Human 3-OST-3B - Homo sapiens, 
jyi aa. [WU2UU1 1J91U-A2, 01- 
MAR-2001] 


31. .253 
1 1..238 


121/230 (52%) 
146/230 (62%) 


9e-54 


AAB70114 


Murine 3-OST-3B - Mus sp, 391 
aa. [WO200113910-A2, 01-MAR- 
2001] 


31. .253 
11..238 


119/231 (51%) 
147/231 (63%) 


2e-51 


AAU 12275 




Human PRO5004 polypeptide 
sequence - Homo sapiens, 367 aa. 
[WO200140466-A2, 07-JUN- 
2001] 


86..253 
45..214 




102/170 (60%) 
117/170 (68%) 


9e-48 



In a BLAST search of public sequence databases, the NOV93a protein was found to 
have homology to the proteins shown in the BLASTP data in Table 93D. 



Table 93D. Public BLASTP Results for NOV93a 


Protein 
Accession 
Number 


Protein/Organism/Length 


NOV93a 
Residues/ 

Match 
Residues 


Identities/ 
Similarities for 
the Matched 
Portion 


Expect 
Value 


Q96QI5 


C439A6.1 (NOVEL PROTEIN 
SIMILAR TO HEPARAN 
SULFATE (GLUCOSAMINE) 3-0- 
SULFOTRANSFERASES) - Homo 
sapiens (Human), 381 aa (fragment). 


85..253 
61. .229 


160/169 (94%) 
162/169 (95%) 


2e-89 


Q96RX7 


HEPARAN SULPHATE D- 
GLUCOSAMINYL 3-0- 
SULFOTRANSFERASE-3B LIKE - 
Homo sapiens (Human), 311 aa. 


95..253. 
1..159 


153/159(96%) 
155/159 (97%) 


le-85 


Q9Y662 


HEPARAN SULFATE D- 
GLUCOSAMINYL 3-0- 


31..253 
11. .237 


121/229 (52%) 
146/229 (62%) 


le-54 
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2.8.2.23) - Homo sapiens (Human), 
390 aa. 








Q9QZS6 


D-GLYCOSAMINYL 3-0- 
SULFOTRANSFERASE-3B - Mus 
musculus (Mouse), 390 aa. 


31. .253 
11. .237 


119/230 (51%) 
147/230 (63%) 


3e-52 


Q9Y278 


HEPARAN SI II FATE D- 
GLUCOSAMINYL 3-0- 
SULFOTRANSFERASE-2 (EC 
2.8.2.23) - Homo sapiens (Human), 
367 aa. 


86..253 
45..214 


J 02 170 (60 1 o) 
117/170 (68%) 


3e-47 



PFam analysis predicts that the NOV93a protein contains the domains shown in the 
Table 93E. 



Table 93E. Domain Analysis of NOV93a 






Identities/ 


Pfam Domain 


NOV93a Match Region 


Similarities 1 Expect Value 






for the Matched Region 1 


No Significant Matches Found 



Example 94. 



The NOV94 clone was analyzed, and the nucleotide and predicted polypeptide 
sequences are shown in Table 94A. 



Table 94 A. NOV94 Sequence Analysis 




SEQ ID NO: 269 (2949 bp 


NOV94a, 

CG59761-01 DNA Sequence 


GTCCGCCTC'GGG 3 ^AGCCGCCGAGATGGGGGCCGCCCCGGGCCGCGC 


CCCCGCCGGGTCCCGCCCGCCGCGCTGCCGCTGAGCGCATGGGCCCGGACCGCGCCGC 


GCCGCTCCGGGAGCCGGGCCCGGGGTCCCGCCACCACCGCGCGCGGGACAGATTGATT 
CACTTTGGAGCTGTAAGTACTGATGTATTAGGGTGCAGCGCTCATTGTTCATTGACGC 
AGAGTCCCAAAATGAATATCCAAGAGCAGGGTTTCCCCTTGGACCTCGGAGCAAGTTT 
CACCGAAGATGCTCCCCGACCCCCAGTGCCTGGTGAGGAGGGAGAACTGGTGTCCACA 
GACCCGAGGCCCGCCAGCTACAGTTTCTGCTCCGGGAAAGGTGTTGGCATTAAAGGTG 
AGACTTCGACGGCCACTCCGAGGCGCTCGGATCTGGACCTGGGGTATGAGCCTGAGGG 
CAGTGCCTCCCCCACCCCACCATACTTGAAGTGGGCTGAGTCACTGCATTCCCTGCTG 
GATGACCAAGATGGGATAAGCCTGTTCAGGACTTTCCTGAAGCAGGAGGGCTGTGCCG 
ACTTGCTGGACTTCTGGTTTGCCTGCACTGGCTTCAGGAAGCTGGAGCCCTGTGACTC 
GAACGAGGAGAAGAGGCTGAAGCTGGCGAGAGCCATCTACCGAAAGTACATTCTTGAT 
AACAATGGCATCGTGTCCCGGCAGACCAAGCCAGCCACCAAGAGCTTCATAAAGGGCT 
GCATCATGAAGCAGCTGATCGATCCTGCCATGTTTGACCAGGCCCAGACCGAAATCCA 
GGCCACTATGGAGGAAAACACCTATCCCTCCTTCCTTAAGTCTGATATTTATTTGGAA 
TATACGAGGACAGGCTCGGAGAGCCCCAAAGTCTGTAGTGACCAGAGCTCTGGGTCAG 
GGACAGGGAAGGGCATATCTGGATACCTGCCGACCTTAAATGAAGATGAGGAATGGAA 
GTGTGACCAGGACATGGATGAGGACGATGGCAGAGACGCTGCTCCCCCCGGAAGACTC 
CCTCAGAAGCTGCTCCTGGAGACAGCTGCCCCGAGGGTCTCCTCCAGTAGACGGTACA 
GCGAAGGCAGAGAGTTCAGGTATGGATCCTGGCGGGAGCCAGTCAACCCCTATTATGT 
CAATGCCGGCTATGCCCTGGCCCCAGCCACCAGTGCCAACGACAGCGAGCAGCAGAGC 
CTGTCCAGCGATGCAGACACCCTGTCCCTCACGGACAGCAGCGTGGATGGGATCCCCC 
CATACAGGATCCGTAAGCAGCACCGCAGGGAGATGCAGGAGAGCGTGCAGGTCAATGG 
GCGGGTGCCCCTACCTCACATTCCCCGCACGTACCGGGTGCCGAAGGAGGTCCGCGTG 
GAGCCTCAGAAGTTCGCGGAGGAGCTCATCCACCGCCTGGAGGCTGTGCAGCGCACGC 
GGGAGGCCGAGGAGAAGCTGGAGGAGCGGCTGAAGCGCGTGCGCATGGAGGAGGAAGG 
TGAGGACGGCGATCCATCATCAGGGCCCCCAGGGCCGTGTCACAAGCTGCCTCCCGCC 
CCCGCTTGGCACCACTTCCCGCCCCGCCTGTGTTGGACATGGGCTTGTGCCGGGCTCC 
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GGGATGCACACGAGGAGAACCCTGAGAGCATCCTGGACGAGCACGTACAGCGTGTGCT 
GAGGACACCTGGCCGCCAGTCGCCTGGGCCTGGCCATCGCTCCCCGGACAGTGGGCAC 
GTGGCCAAGATGCCAGTGGCACTGGGGGGTGCCGCCTCGGGGCACGGGAAGCACGTAC 
CCAAGTCAGGGGCGAAGCTGGACGCGGCCGGCCTGCACCACCACCGACACGTCCACCA 
CCACGTCCACCACAGCACAGCCCGGCCCAAGGAGCAGGTGGAGGCCGAGGCCACCCGC 
AGGGCCCAGAGCAGCTTCGCCTGGGGCCTGGAACCACACAGCCATGGGGCAAGGTCCC 
GAGGCTACTCAGAGAGTGTTGGCGCTGCCCCCAACGCCAGTGATGGCCTCGCCCACAG 
TGGGAAGGTGGGCGTGGCGTGCAAAAGAAATGCCAAGAAGGCCGAGTCGGGGAAGAGC 
GCCAGCACCGAGGTGCCAGGTGCCTCGGAGGATGCGGAGAAGAACCAGAAAATCATGC 
AGTGGATCATTGAGGGGGAAAAGGAGATCAGCAGGCACCGCAGGACCGGCCACGGGTC 
TTCGGGGACGAGGAAGCCACAGCCCCATGAGAACTCCAGACCCTTGTCCCTTGAGCAC 
CCCTGGGCCGGCCCTCAGCTCCGGACCTCCGTGCAGCCCTCCCACCTCTTCATCCAAG 

CCGACGTCTGGAGGAGGAAGAAAAGAGAGCCAGCCGAGCACCCTCCAAGCAGAGGTAT 

TGCACGTGGTACCAGCCGTGTCGGACATGGAGCTCTCCGAGACAGAGACAAGATCGCA 
GAGGAAGGTGGGCGGCGGGAGTGCCCAGCCGTGTGACAGCATCGTTGTGGCGTACTAC 
TTCTGCGGGGAACCCATCCCCTACCGCACCCTGGTGAGGGGCCGCGCTGTCACCCTGG 
GCCAGTTCAAGGAGCTGCTGACCAAAAAGGGCAGCTACAGATACTACTTCAAGAAAGT 
GAGCGACGAGTTTGACTGTGGGGTGGTGTTTGAGGAGGTTCGAGAGGACGAGGCCGTC 
CTGCCCGTCTTTGAGGAGAAGATCATCGGCAAAGTGGAGAAGGTGGACTGATAGGCTG 
GTGGGCTGGCCGCTGTGCCAGGCGAGGCCCTTGGCGGGCACGGGTGTCACGGCCAGGC 


t < , 3 i l r — i 




ORF Start: ATG at 97jORF Stop: TGA at 2833 




SEQ ID NO: 270 [912 aa |mW at 101 1 18.1kD 


NOV94a, 

CG59761-01 Protein Sequence 


MGPDRAAPLREPGPGSRHHRARDRLIHFGAVSTDVLGCSAHCSLTQSPKMNIQEQGFP 
LDLGASFTEDAPRPPVPGEEGELVSTDPRPASYSFCSGKGVGIKGETSTATPRRSDLD 
LGYEPEGSASPTPPYLKWAESLHSLLDDQDGISLFRTFLKQEGCADLLDFWFACTGFR 

QAQTEIQATMEENTYPSFLKSDIYLEYTRTGSESPKVCSDQSSGSGTGKGISGYLPTL 
NEDEEWKCDQDMDEDDGRDAAPPGRLPQKLLLETAAPRVSSSRRYSEGREFRYGSWRE 
PVNPYYVNAGYALAPATSANDSEQQSLSSDADTLSLTDSSVDGIPPYRIRKQHRREMQ 
ESVQVNGRVPLPHIPRTYRVPKEVRVEPQKFAEELIHRLEAVQRTREAEEKLEERLKR 
VRMEEEGEDGDPSSGPPGPCHKLPPAPAWHHFPPRLCWTWACAGLRDAHEENPESILD 
EHVQRVLRTPGRQSPGPGHRSPDSGHVAKMPVALGGAASGHGKHVPKSGAKLDAAGLH 
HHRHVHHHVHHSTARPKEQVEAEATRRAQSSFAWGLEPHSHGARSRGYSESVGAAPNA 
SDGLAHSGKVGVACKRNAKKAESGKSASTEVPGASEDAEKNQKIMQWIIEGEKEISRH 
RRTGHGSSGTRKPQPHENSRPLSLEHPWAGPQLRTSVQPSHLFIQDPTMPPHPAPNPL 
TQLEEARRRLEEEEKRASRAPSKQRYVQEVMRRGRACVRPACAPVLHWPAVSDMELS 
ETETRSQRKVGGGSAQPCDSIWAYYFCGEPIPYRTLVRGRAVTLGQFKELLTKKGSY 



Further analysis of the NOV94a protein yielded the following properties shown in 
Table 94B. 



Table 94B. Protein Sequence Properties NOV94a 


PSort 
analysis: 


0.6000 probability located in nucleus; 0.3000 probability located in 
microbody (peroxisome); 0.1000 probability located in mitochondrial matrix 
space; 0.1000 probability located in lysosome (lumen) 


SignalP 
analysis: 


No Known Signal Sequence Predicted 



A search of the NOV94a protein against the Geneseq database, a proprietary 
database that contains sequences published in patents and patent publication, yielded 
several homologous proteins shown in Table 94C. 
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Table 94C. Ceneseq Results for NOV94a 


Geneseq 
Identifier 


Protein/Organism/Length 
[Patent #, Date] 


NOV94a 
Residues/ 

Match 
Residues 


Identities/ 
Similarities for 
the Matched 
Region 


Expect 
Value 


AAG68175 


Wnt signaling protein SEQ ID 
NO:91 - Homo sapiens, 900 aa. 
[WO200177327-A1, 18-OCT- 
2001] 


13..912 
1..900 


898/900 (99%) 
898/900 (99%) 


0.0 


AAW96264 


Human axin - Homo sapiens, 900 
aa. [WO9902179-A1, 21-JAN- 
1999] 


13. .912 
1..900 


898/900 (99%) 
898/900 (99%) 


0.0 


AAWyozoj 


Murine axin - Mus musculus, 992 
aa. [WO9902179-A1, 21-JAN- 
1999] 


6. .912 
84..992 


781/914 (85%) 
820/914 (89%) 


0.0 


AAW93569 


Human conductin protein - Homo 
sapiens, 840 aa. [WO9911780-A2, 1 
11 -MAR- 1999] 


60..912 
12..840 


378/892 (42%) 
506/892 (56%) 


e-171 


AAW93570 


Human conductin protein - Homo 
sapiens, 840 aa. [W0991 1780-A2, 
11 -MAR- 1999] 


60..912 
12..840 


378/892(42%) | 
506/892 (56%) 




e-171 | 



In a BLAST search of public sequence databases, the NOV94a protein was found to 
have homology to the proteins shown in the BLASTP data in Table 94D. 



Table 94D. Public BLASTP Results for NOV94a 



Protein 
Accession 
Number 


Protein/Organism/Length 


NOV94a 
Residues/ 

Match 
Residues 


Identities/ 
Similarities for 
the Matched 
Portion 


Expect 
Value 


015169 


Axin 1 (Axis inhibition protein 1) 
(hAxin) - Homo sapiens (Human), 
900 aa (fragment). 


13..912 
1..900 


898/900 (99%) 
898/900 (99%) 


0.0 


Q96S28 


AXIN - Homo sapiens (Human), 
862 aa. 


50..912 
1..862 


858/863 (99%) 
858/863 (99%) 


0.0 


035625 


Axin 1 (Axis inhibition protein 1 ) 
(Fused protein) - Mus musculus 
(Mouse), 992 aa (fragment). 


6..912 
84.. 992 


781/914(85%) 
820/914(89%) 


0.0 


070239 


Axin 1 protein (Axis inhibition 
protein 1) (rAxin) - Rattus 
norvegicus (Rat), 893 aa 
(fragment). 


6..912 
21. .893 


756/914 (82%) 
793/914(86%) 


0.0 
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T08422 


negative regualtor axin [imported] 


46..912 


726/872(83%) j 






- rat, 832 aa. 


2..832 


760/872(86%) j 


o.o | 



PFam analysis predicts that the NOV94a protein contains the domains shown in the 
Table 94E. 



Table 94E. Domain Analysis of NOV94a 


Pfam Domain 


NOV94a Match Region 


Identities/ 
Similarities 
for the Matched Region 


Expect Value 


RGS: domain 1 of 2 


137..198 


23/75 (31%) 
44/75 (59%) 


5.6e-06 


RGS: domain 2 of 2 


231. .260 


13/30 (43%) 
21/30 (70%) 


0.12 


TP2: domain 1 of 1 


585..709 


33/147(22%) 
52/147 (35%) 


9.6 


DIX: domain 1 of 1 


830..912 


40/86 (47%) 
83/86 (97%) 


5.6e-44 



Example 95. 

The NOV95 clone was analyzed, and the nucleotide and predicted polypeptide 
sequences are shown in Table 95A. 



Table 95A. NOV95 Sequence Analysis 




SEQ ID NO: 271 |2223 bp 


NOV95a, 

CG59756-01 DNA Sequence 


TTGCAGGCATCA" , 1 1CTGGAGGACGGGGAGGTTGTCAGGG 


GCTATGATGAGATGAGTGGGGGCCGCTTCGACTTTGATGATGGAGGGGCGTACTGCGG 
GGGCTGGGAGGGGGGAAAGGCCCATGGGCATGGACTGTGCACAGGCCCCAAGGGCCAG 
GGCGAATACTCTGGCTCCTGGAACTTTGGCTTTGAGGTGGCAGGTGTCTACACCTGGC 
CCAGCGGAAACACCTTTGAGGGATACTGGAGCCAGGGCAAACGGCATGGGCTGGGCAT 
AGAGACCAAGGGGCGCTGGCTCTACAAGGGCGAGTGGACACATGGCTTCAAGGGACGC 
TACGGAATCCGGCAGAGCTCAAGCAGCGGTGCCAAGTATGAGGGCACCTGGAACAATG 
GCCTGCAAGACGGCTATGGCACCGAGACCTATGCTGATGGAGGGACGTACCAAGGCCA 
GTTCACCAACGGCATGCGCCATGGCTACGGAGTACGCCAGAGCGTGCCCTACGGGATG 
GCCGTGGTGGTGCGCTCGCCGCTGCGCACGTCGCTGTCGTCCCTGCGCAGCGAGCACA 




GCCCTCGCCCGCCATCCCGCGTGGCGGCTTCGCGCTCAGCCTCCTGGCCAATGCCGAG 




AGCTGCGGCGCGCAGAGTCGCGCACGTCCGTGGGTAGCCAGCGCAGCCGTGTCAGCTT 
CCTTAAGAGCGACCTCAGCTCGGGCGCCAGCGACGCCGCGTCCACCGCCAGCCTGGGA 
GAGGCCGCCGAGGGCGCCGACGAGGCCGCACCCTTCGAGGCCGATATCGACGCCACCA 
CCACCGAGACCTACATGGGCGAGTGGAAGAACGACAAACGCTCGGGCTTCGGCGTGAG 
CGAACGCTCCAGTGGCCTCCGCTACGAGGGCGAGTGGCTGGACAACCTGCGCCACGGC 
TATGGCTGCACCACGCTGCCCGACGGCCACCGCGAGGAGGGCAAGTACCGCCACAACG 
TGCTGGTCAAGGACACCAAGCGCCGCATGCTGCAGCTCAAGAGCAACAAGGTCCGCCA 
GAAAGTGGAGCACAGTGTGGAGGGTGCCCAGCGCGCCGCTGCTATCGCGCGCCAGAAG 
GCCGAGATTGCCGCCTCCAGGACAAGCCACGCCAAGGCCAAAGCTGAGGCAGCGGAAC 
AGGCCGCCCTGGCTGCCAACCAGGAGTCCAACATTGCTCGCACTTTGGCCAGGGAGCT 
GGCTCCGGACTTCTACCAGCCAGGTCCGGAATATCAGAAGCGCCGGCTGCTGCAGGAG 
ATCCTGGAGAACTCGGAGAGCCTGCTGGAGCCCCCCGACCGGGGCGCCGGCGCAGCGG 
GCCTCCCACAGCCGCCCCGCGAGAGCCCGCAGCTGCACGAGCGTGAGACCCCTCGGCC 
CGAGGGTGGCTCCCCGTCACCGGCCGGGACGCCCCCGCAGCCCAAGCGGCCCAGGCCC 
GGGGTGTCCAAGGACGGCCTGCTGAGCCCAGGCGCCTGGAACGGCGAGCCCAGCGGTG 
AGGGCAGCCGGTCAGTCACTCCGTCCGAGGGCGCGGGCCGCCGCAGCCCCGCGCGTCC 
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AGCCACCGAGCGCATGGCCATCGAGGCTCTGCAGGCACCGCCTGCGCCGTCGCGGGAG 
CCGGAGGTGGCGCTTTACCAGGGCTACCACAGCTATGCTGTGCGCACCACGCCGCCCG 
AGCCCCCACCCTTTGAGGACCAGCCCGAGCCCGAGGTCTCCGGGTCCGAGTCCGCGCC 
CTCGTCCCCGGCCACCGCCCCGCTGCAGGCCCCCACGCTCCGAGGCCCCGAGCCTGCA 
CGCGAGACCCCCGCCAAGCTGGAGCCCAAGCCCATCATCCCCAAAGCCGAGCCCAGGG 
CCAAGGCCCGCAAGACTGAGGCTCGAGGGCTGACCAAGGCGGGGGCCAAGAAGAAGGC 
GCGGAAGGAGGCCGCACTGGCGGCAGAGGCGGAGGTGGAGGTGGAAGAGGTCCCCAAC 
ACCATCCTCATCTGCATGGTGATCCTGCTGAACATCGGCCTGGCCATCCTCTTTGTTC 
ACCTCCTGACCTGACCGTCGCTTACCAGGTGCAGCCAGCTGGCTGGAGGAGGGGTTGG 






ORF Start: ATG at 70 


ORF Stop: TGA at 2158 




SEQIDNO: 272 


696 aa ImW at 74220.7kD 


NOV95a, 

CG59756-01 Protein Sequence 


MSGGRFDFDDGGAYCGGWEGGKAHGHGLCTGPKGQGEYSGSWNFGFEVAGVYTWPSGN 
TFEGYWSQGKRHGLGIETKGRWLYKGEWTHGFKGRYGIRQSSSSGAKYEGTWNNGLQD 
GYGTETYADGGTYQGQFTNGMRHGYGVRQSVPYGMAVWRSPLRTSLSSLRSEHSNGT 
VAPDSPASPASDGPALPSPAIPRGGFALSLLANAEAAARAPKGGGLFQRGALLGKLRR 
AESRTSVGSQRSRVSFLKSDLSSGASDAASTASLGEAAEGADEAAPFEADIDATTTET 
YMGEWKNDKRSGFGVSERSSGLRYEGEWLDNLRHGYGCTTLPDGHREEGKYRHNVLVK 
DTKRRMLQLKSNKVRQKVEHSVEGAQRAAAIARQKAEIAASRTSHAKAKAEAAEQAAL 
AANQESNIARTLARELAPDFYQPGPEYQKRRLLQEILENSESLLEPPDRGAGAAGLPQ 
PPRESPQLHERETPRPEGGSPSPAGTPPQPKRPRPGVSKDGLLSPGAWNGEPSGEGSR 
SVTPSEGAGRRSPARPATERMAIEALQAPPAPSREPEVALYQGYHSYAVRTTPPEPPP 
FEDQPEPEVSGSESAPSSPATAPLQAPTLRGPEPARETPAKLEPKPIIPKAEPRAKAR 
, 1 i.i i i j 1 LC l - T ! 



Further analysis of the NOV95a protein yielded the following properties shown in 
Table 95B. 



Table 95B. Protein Sequence Properties NOV95a 


PSort 
analysis: 


0.8000 probability located in nucleus; 0.7000 probability located in plasma 
membrane; 0.3133 probability located in microbody (peroxisome); 0.2000 
probability located in endoplasmic reticulum (membrane) 


SignalP 
analysis: 


No Known Signal Sequence Predicted 



A search of the NOV95a protein against the Geneseq database, a proprietary 
database that contains sequences published in patents and patent publication, yielded 
several homologous proteins shown in Table 95C. 



Table 95C. Geneseq Results for NOV95a 



Geneseq 
Identifier 


Protein/Organism/Length [Patent 
#, Date] 


NOV95a 
Residues/ 

Match 
Residues 


Identities/ 
Similarities for 
the Matched 
Region 


Expect 
Value 


AAM79123 


Human protein SEQ ID NO 1785 - 
Homo sapiens, 628 aa. 
[WO200157190-A2, 09-AUG-2001] 


3..696 
4..628 


293/704 (41%) 
377/704 (52%) 


e-127 


AAM80107 


Human protein SEQ ID NO 3753 - 

Homo sapiens, 378 aa. 

[W< >200157I90- V2. 09-AUG-2 | 


283..696 
24..378 


146/421 (34%) 
194/421 (45%) 




2e-43 
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Protein #3682 encoded by probe for 
measuring heart cell gene expression 
- Homo sapiens, 135 aa. 
[WO200157274-A2, 09-AUG-2001] 


257.389 
6..135 


78/133 (58%) 
104/133 (77%) 


7e-42 


AAM57089 


probe encoded protein SEQ ID NO: 
29194 - Homo sapiens, 135 aa. 
[WO200157275-A2, 09-AUG-2001] 


257 389 
6..135 


78/1 11 f 

/ O/ 1 J J ^JO /o) 

104/133 (77%) 




AAM 17323 


Peptide #3757 encoded by probe for 
measuring cervical gene expression 
- Homo sapiens, 135 aa. 
[WO200157278-A2, 09-AUG-2001] 


257..389 
6..135 


78/133(58%) 
104/133 (77%) 

,,„.,„ 


7e-42 



In a BLAST search of public sequence databases, the NOV95a protein was found to 
have homology to the proteins shown in the BLASTP data in Table 95D. 



Table 95D. Public BLASTP Results for NOV95a 


Protein 
Accession 
Number 


Protein/Organism/Length 


NOV95a 
Residues/ 

Match 
Residues 


Identities/ 
Similarities 
for the 
Matched 
Portion 


Expect 
Value 


Q9GKY7 


JUNCTOPHILIN TYPE 2 - 
Oryctolagus cuniculus (Rabbit), 694 aa. 


1..696 
1..694 


644/701 (91%) 
662/701 (93%) 


0.0 


Q9ET79 


JUNCTOPHILIN TYPE 2 - Mus 
musculus (Mouse), 696 aa. 


1..696 
1..696 


608/706 (86%) 
644/706 (91%) 


0.0 


Q9BR39 


DJ1108D11.1 (NOVEL PROTEIN 
SIMILAR TO C. ELEGANS T22C1 .7 ) 
- Homo sapiens (Human), 552 aa 
(fragment). 


128..672 
1..545 


544/545 (99%) 
544/545 (99%) 


0.0 


Q9GKY8 


MITSUGUMIN72/JUNCTOPHILIN 
TYPE1 - Oryctolagus cuniculus 
(Rabbit), 662 aa. 


1..696 
1..662 


364/704(51%) 
468/704 (65%) 


0.0 | 


Q9L 1 80 


JUNCTOPHILIN TYPE 1 - Mus 
musculus (Mouse), 660 aa. 


1..696 
1..660 


371/707 (52%) 
469/707 (65%) 




0.0 



PFam analysis predicts that the NOV95a protein contains the domains shown in the 
Table 95E. 
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Table 95E. Domain Analysis of NOV95a 


Pfam Domain 


NOV95a Match 
Region 


Identities/ 
Similarities 
for the Matched 
Region 


1 

Expect 
Value 


MORN: domain 1 of 
7 


14..36 


10/23 (43%) 
13/23 (57%) 


1.1 


MORN: domain 2 of 
7 


38..59 


9/23 (39%) 
15/23 (65%) 


0.31 


MORN: domain 3 of 


60..77 


8/23 (35%) 
15/23 (65%) 


3 


MORN: domain 4 of 
7 


106..128 


1 1/23 (48% ) 
20/23 (87%) 


3.7e-06 


MORN: domain 5 of 
7 


129..151 


8/23 (35%) 
15/23 (65%) 


0.027 


MORN: domain 6 of 
7 


291..313 


12/23 (52%) 
19/23 (83%) 


0.00056 


MORN: domain 7 of 
7 


314..336 


1 1/23 (48%) 
19/23 (83%) 


0.00022 



Example 96. 



The NOV96 clone was analyzed, and the nucleotide and predicted polypeptide 
sequences are shown in Table 96A. 



Table 96A. NOV96 Sequence Analysis 




SEQIDNO:273 '3257 bp 


NOV96a, 

CG59708-01 DNA Sequence 


CGTAGGCGCTTCGGCCATGACTGCGGAGCTGCAGCAGGACGACGCGGCCGGCGCGGCA 
GACGGCCACGGCTCGAGCTGCCAAATGCTGTTAAATCAACTGAGAGAAATCACAGGCA 
TTCAGGACCCTTCCTTTCTCCATGAAGCTCTGAAGGCCAGTAATGGTGACATTACTCA 
GGCAGTCAGCCTTCTCACTGATGAGAGAGTTAAGGAGCCCAGTCAAGACACTGTTGCT 
ACAGAACCATCTGAAGTAGAGGGGAGTGCTGCCAACAAGGAAGTATTAGCAAAAGTTA 
TAGACCTTACTCATGATAACAAAGATGATCTTCAGGCTGCCATTGCTTTGAGTCTACT 
GGAGTCTCCCAAAATTCAAGCTGATGGAAGAGATCTTAACAGGATGCATGAAGCAACC 
TCTGCAGAAACTAAACGCTCAAAGAGAAAACGCTGTGAAGTCTGGGGAGAAAACCCCA 
ATCCCAATGACTGGAGGAGAGTTGATGGTTGGCCAGTTGGGCTGAAAAATGTTGGCAA 




CTTGTTCTCAGTTATAGTCTGCCACAAAATGTACTTGAAAATTGTCGAAGTCATACAG 
AAAAGAGAAATATCATGTTTATGCAAGAGCTTCAGTATTTGTTTGCTCTAATGATGGG 
ATCAAATAGAAAATTTGTAGACCCGTCTGCAGCCCTGGATCTATTAAAGGGAGCATTC 
CGATCATCTGAGGAACAGCAGCAAGATGTGAGTGAATTCACACACAAGCTCCTGGATT 
GGCTAGAGGACGCATTCCAGCTAGCTGTTAATGTTAACAGTCCCAGGAACAAATCTGA 
AAATCCAATGGTGCAGCTGTTCTATGGTACTTTCCTGACTGAAGGGGTTCGTGAAGGA 
AAACCCTTTTGTAACAATGAGACCTTCGGCCAGTATCCTCTTCAGGTAAACGGTTATC 
GCAACTTAGACGAGTGTTTGGAAGGGGCCATGGTGGAGGGTGATGTTGAGCTTCTTCC 
CTCCGATCACTCGGTGAAGTATGGACAAGAGCGTTGGTTTACAAAGCTACCTCCAGTG 
TTGACCTTTGAACTCTCAAGATTTGAGTTTAATCAGTCCCTTGGGCAGCCAGAGAAAA 
TTCACAATAAGCTGGAATTTCCTCAGATTATTTATATGGACAGGTACATGTACAGGAG 
CAAGGAGCTTATTCGAAATAAGAGAGAGTGTATTCGAAAGTTGAAGGAGGAAATAAAA 
ATTCTGCAGCAAAAATTGGAAAGGTATGTGAAATATGGCTCAGGCCCAGCTCGGTTCC 
CGCTCCCGGACATGCTGAAATATGTTATTGAATTTGCTAGTACAAAACCTGCCTCAGA 
AAGCTGTCCACCTGAAAGTGACACACATATGACATTACCACTTTCTTCAGTGCACTGC 
TCGGTTTCT I i r ^3 - t ^ 
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TTGAAAGTACCTTTTCTTCTCCTGAAGATTCTTTACCCAAGTCTAAACCACTGACATC 
TTCTCGGTCTTCCATGGAAATGCCTTCACAGCCAGCTCCACGAACAGTCACAGATGAG 
GAGATAAATTTTGTTAAGACCTGTCTTCAGAGATGGAGGAGTGAGATTGAACAAGATA 
TACAAGATTTAAAGACTTGTATTGCAAGTACTACTCAGACTATTGAACAGATGTACTG 
CGATCCTCTCCTTCGTCAGGTGCCTTATCGCTTGCATGCAGTTCTTGTTCATGAAGGA 
CAAGCAAATGCTGGACACTATTGGGCCTATATCTATAATCAACCCCGACAGAGCTGGC 
TCAAGTACAATGACATCTCTGTTACTGAATCTTCCTGGGAAGAAGTTGAAAGAGATTC 
CTATGGAGGCCTGAGAAATGTTAGTGCTTACTGTCTGATGTACATTAATGCCAAACTA 
CCCTACTTCAATGCAGAGGCAGCCCCAACTGAATCAGATCAAATGTCAGAAGTGGAAG 
CCCTATCTGTGGAACTCAAGCATTACATTCAGGAGGATAACTGGCGGTTTGAGCAGGA 
AGTAGAGGAGTGGGAAGAAGAGCAGTCTTGCAAAATCCCTCAAATGGAGTCCTCCCCC 
AACTCCTCATCACAGGGCTACTCTACATCACAAGAGCCTTCAGTAGCCTCTTCTCATG 
GGGTTCGCTGCTTGTCATCTGAGCATGCTGTGATTGTAAAGGAGCAAACTGCCCAGGC 
TATTGCAAACACAGCCCGTGCCTATGAGAAGAGCGGTGTAGAAGCGGCACTGAGTGAG 
GCATTCCATGAAGAATACTCCAGGCTCTATCAGCTTGCCAAAGAGACCCCCACCTCTC 
ACAGTGATCCTCGACTTCAGCATGTCCTTGTCTACTTTTTCCAAAATGAAGCACCCAA 
AAGGGTAGTAGAACGAACCCTTCTGGAACAGTTTGCAGATAAAAATCTTAGCTATGAT 
GAAAGATCAATCAGCATTATGAAGGTGGCTCAAGCGAAACTGAAGGAAATTGGTCCAG 
ATGACATGAATATGGAAGAGTACAAGAGGTGGCATGAAGATTATAGTTTGTTCCGAAA 
AGTGTCTGTGTATCTCCTAACAGGCCTAGAACTCTATCAAAAAGGAAAGTACCAAGAG 
GCACTTTCCTACCTGGTATATGCCTACCAGAGCAATGCTGCCCTGCTGATGAAGGGGC 
CCCGCCGGGGGGTCAAAGAATCCGTGATTGCTTTATACCGAAGAAAATGCCTTCTGGA 
GCTGAATGCCAAAGCAGCTTCTCTTTTTGAAACAAATGATGATCACTCCGTAACTGAG 
GGCATTAATGTGATGAATGAACTGATCATCCCCTGCATTCACCTTATCATTAATAATG 
ACATTTCCAAGGATGATCTGGATGCCATTGAGGTCATGAGAAACCATTGGTGCTCTTA 
CCTTGGGCAAGATATTGCAGAAAATCTGCAGCTGTGCCTAGGGGAGTTTCTACCCAGA 
CTTCTAGATCCTTCTGCAGAAATCATCGTCTTGAAAGAGCCTCCAACTATTCGACCCA 
ATTCTCCCTATGACCTATGTAGCCGATTTGCAGCTGTCATGGAGTCAATTCAGGGAGT 
TTCAACTGTGACAGTGAAATAAGCTCCCACATGTTCAAGGCCCATTCTGGTTCCTGGC 
TGCCTGCCTCTTGCACAGAAGTTCGTTGTCATAGTGCTCACCTTGGGAAAAGGATTAG 






ORF Start: ATG at 
17 


ORF Stop: TAA at 3 152 




SEQ ID NO: 274 


1045 aa [MW at 1 19041 .7kD 


NOV96a, 

CG59708-01 Protein 
Sequence 


MTAELQQDDAAGAADGHGSSCQMLLNQLREITGIQDPSFLHEALKASNGDITQAVSLL 
TDERVKEPSQDTVATEPSEVEGSAANKEVLAKVIDLTHDNKDDLQAAIALSLLESPKI 
QADGRDLNRMHEATSAETKRSKRKRCEVWGENPNPNDWRRVDGWPVGLKNVGNTCWFS 
AVIQSLFQLPEFRRLVLSYSLPQKVLENCRSHTEKRNIMFMQELQYLFALMMGSNRKF 
VDPSAALDLLKGAFRSSEEQQQDVSEFTHKLLDWLEDAFQLAVNVNSPRNKSENPiVIVQ 
LFYGTFLTEGVREGKPFCNNETFGQYPLQVNGYRNLDECLEGAMVEGDVELLPSDHSV 
KYGQERWFTKLPPVLTFELSRFEFNQSLGQPEKIHNKLEFPQIIYMDRYMYRSKELIR 
NKRECIRKLKEEIKILQQKLERYVKYGSGPARFPLPDMLKYVIEFASTKPASESCPPE 
SDTHMTLPLSSVHCSVSDQTSKESTSTESSSQDVESTFSSPEDSLPKSKPLTSSRSSM 
EMPSQPAPRTVTDEEINFVKTCLQRWRSEIEQDIQDLKTCIASTTQTIEQMYCDPLLR 
QVPYRLHAVLVHEGQANAGH YWAY I YNQPRQSWLKYNDI S VTES SWEEVERDS YGGLR 
NVSAYCLMYINAKLPYFNAEAAPTESDQMSEVEALSVELKHYIQEDNWRFEQEVEEWE 
EEQSCKIPQMESSPNSSSQGYSTSQEPSVASSHGVRCLSSEHAVIVKEQTAQAIANTA 
RAYEKSGVEAALSEAFHEEYSRLYQLAKETPTSHSDPRLQHVLVYFFQNEAPKRWER 
TLLEQFADKNLSYDERSISIMKVAQAKLKEIGPDDMNMEEYKRWHEDYSLFRKVSVYL 
LTGLELYQKGKYQEALSYLVYAYQSNAALLMKGPRRGVKESVIALYRRKCLLELNAKA 
ASLFETNDDHSVTEGINVMNELI IPCIHLI INNDISKDDLDAIEVMRNHWCSYLGQDI 
AENLQLCLGEFLPRLLDPSAEIIVLKEPPTIRPNSPYDLCSRFAAVMESIQGVSTVTV 
K 


si i,) II) NO: 275 


3044 bp 


NOV96b, 

CG59708-02 DNA Sequence 


T? atga:tg t - - ; :cgg"'JCGGCa 

? i i rCGAGCTGCCAAATGCTGTTAAATCAACTGAGAGAAATCACAGGCA 
TTCAGGACCCTTCCTTTCTCCATGAAGCTCTGAAGGCCAGTAATGGTGACATTACTCA 
GGCAGTCAGCCTTCTCACTGATGAGAGAGTTAAGGAGCCCAGTCAAGACACTGTTGCT 
ACAGAACCATCTGAAGTAGAGGGGAGTGCTGCCAACAAGGAAGTATTAGCAAAAGTTA 
TAGACCTTACTCATGATAACAAAGATGATCTTCAGGCTGCCATTGCTTTGAGTCTACT 
GGAGTCTCCCAAAATTCAAGCTGATGGAAGAGATCTTAACAGGATGCATGAAGCAACC 
TCTGCAGAAACTAAACGCTCAAAGAGAAATATCATGTTTATGCAAGAGCTTCAGTATT 
TGTTTGCTCTAATGATGGGATCAAATAGAAAATTTGTAGACCCGTCTGCAGCCCTGGA 
TCTATTAAAGGGAGCATTCCGATCATCTGAGGAACAGCAGCAAGATGTGAGTGAATTC 
ACACACAAGCTCCTGGATTGGCTAGAGGACGCATTCCAGCTAGCTGTTAATGTTAACA 
GTCCCAGGAACAAATCTGAAAATCCAATGGTGCAGCTGTTCTATGGTACTTTCCTGAC 
TGAAGGGGTTCGTGAAGGAAAACCCTTTTGTAACAATGAGACCTTCGGCCAGTATCCT 
CTTCAGGTAAACGGTTATCGCAACTTAGACGAGTGTTTGGAAGGGGCCATGGTGGAGG 
GTGATGTTGAGCTTCTTCCCTCCGATCACTCGGTGAAGTATGGACAAGAGCGTTGGTT 
TACAAAGCTACCTCCAGTGTTGACCTTTGAACTCTCAAGATTTGAGTTTAATCAGTCC 
CTTGGGCAGCCAGAGAAAATTCACAATAAGCTGGAATTTCCTCAGATTATTTATATGG 
ACAGGTACATGTACAGGAGCAAGGAGCTTATTCGAAATAAGAGAGAGTGTATTCGAAA 
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GTTGAAGGAGGAAATAAAAATTCTGCAGCAAAAATTGGAAAGGTATGTGAAATATGGC 

TCAGGCCCAGCTCGGTTCCCGCTCCCGGACATGCTGAAATATGTTATTGAATTTGCTA 

GTACAAAACCTGCCTCAGAAAGCTGTCCACCTGAAAGTGACACACATATGACATTACG 

ACTTTCTTCAGTGCACTGCTCGGTTTCTGACCAGACATCCAAGGAAAGTACAAGTACA 

GAAAGCTCTTCTCAGGATGTTGAAAGTACCTTTTCTTCTCCTGAAGATTCTTTACCCA 

AGTCTAAACCACTGACATCTTCTCGGTCTTCCATGGAAATGCCTTCACAGCCAGCTCC 

ACGAACAGTCACAGATGAGGAGATAAATTTTGTTAAGACCTGTCTTCAGAGATGGAGG 

AGTGAGATTGAACAAGATATACAAGATTTAAAGACTTGTATTGCAAGTACTACTCAGA 

CTATTGAACAGATGTACTGCGATCCTCTCCTTCGTCAGGTGCCTTATCGCTTGCATGC 

AGTTCTTGTTCATGAAGGACAAGCAAATGCTGGACACTATTGGGCCTATATCTATAAT 

CAACCCCGACAGAGCTGGCTCAAGTACAATGACATCTCTGTTACTGAATCTTCCTGGG 

AAGAAGTTGAAAGAGATTCCTATGGAGGCCTGAGAAATGTTAGTGCTTACTGTCTGAT 

GTACATTAATGCCAAACTACCCTACTTCAATGCAGAGGCAGCCCCAACTGAATCAGAT 

CAAATGTCAGAAGTGGAAGCCCTATCTGTGGAACTCAAGCATTACATTCAGGAGGATA 

ACTGGCGGTTTGAGCAGGAAGTAGAGGAGTGGGAAGAAGAGCAGTCTTGCAAAATCCC 

TCAAATGGAGTCCTCCCCCAACTCCTCATCACAGGGCTACTCTACATCACAAGAGCCT 

TCAGTAGCCTCTTCTCATGGGGTTCGCTGCTTGTCATCTGAGCATGCTGTGATTGTAA 

AGGAGCAAACTGCCCAGGCTATTGCAAACACAGCCCGTGCCTATGAGAAGAGCGGTGT 

AGAAGCGGCACTGAGTGAGGCATTCCATGAAGAATACTCCAGGCTCTATCAGCTTGCC 

AAAGAGACCCCCACCTCTCACAGTGATCCTCGACTTCAGCATGTCCTTGTCTACTTTT 

TCCAAAATGAAGCACCCAAAAGGGTAGTAGAACGAACCCTTCTGGAACAGTTTGCAGA 

TAAAAATCTTAGCTATGATGAAAGATCAATCAGCATTATGAAGGTGGCTCAAGCGAAA 

CTGAAGGAAATTGGTCCAGATGACATGAATATGGAAGAGTACAAGAGGTGGCATGAAG 

ATTATAGTTTGTTCCGAAAAGTGTCTGTGTATCTCCTAACAGGCCTAGAACTCTATCA 

AAAAGGAAAGTACCAAGAGGCACTTTCCTACCTGGTATATGCCTACCAGAGCAATGCT 

GCCCTGCTGATGAAGGGGCCCCGCCGGGGGGTCAAAGAATCCGTGATTGCTTTATACC 

GAAGAAAATGCCTTCTGGAGCTGAATGCCAAAGCAGCTTCTCTTTTTGAAACAAATGA 

TGATCACTCCGTAACTGAGGGCATTAATGTGATGAATGAACTGATCATCCCCTGCATT 

CACCTTATCATTAATAATGACATTTCCAAGGATGATCTGGATGCCATTGAGGTCATGA : 

GAAACCATTGGTGCTCTTACCTTGGGCAAGATATTGCAGAAAATCTGCAGCTGTGCCT 

AGGGGAGTTTCTACCCAGACTTCTAGATCCTTCTGCAGAAATCATCGTCTTGAAAGAG 

CCTCCAACTATTCGACCCAATTCTCCCTATGACCTATGTAGCCGATTTGCAGCTGTCA 

TGGAGTCAATTCAGGGAGTTTCAACTGTGACAGTGAAATAAGCTCCCACATGTTCAAG 

GCCCATTCTGGTTCCTGGCTGCCTGCCTCTTGCACAGAAGTTCGTTGTCATAGTGCTC 


ACCT f 3 AA ,3CCACA 




ORF Start: ATG at 
17 


ORF Stop: TAA at 2939 


SIX.) II) NO: 276 


974 aa al 1 !0687.3kD 


NOV96b, 

CG59708-02 Protein 
Sequence 


MTAELQQDDAAGAADGHGSSCQMLLNQLREITGIQDPSFLHEALKASNGDITQAVSLL 
TDERVKEPSQDTVATEPSEVEGSAANKEVLAKVIDLTHDNKDDLQAAIALSLLESPKI 
QADGRDLNRMHEATSAETKRSKRNIMFMQELQYLFALMMGSNRKFVDPSAALDLLKGA 
FRSSEEQQQDVSEFTHKLLDWLEDAFQLAVNVNSPRNKSENPMVQLFYGTFLTEGVRE 
GKPFCNNETFGQYPLQVNGYRNLDECLEGAMVEGDVELLPSDHSVKYGQERWFTKL.PP 
VLTFELSRFEFNQSLGQPEKIHNKLEFPQIIYMDRYMYRSKELIRNKRECIRKLKEEI 
KILQQKLERYVKYGSGPARFPLPDMLKYVIEFASTKPASESCPPESDTHMTLPLSSVH 
CSVSDQTSKESTST3SSSQDVESTFSSPEDSLPKSKPLTSSRSSMEMPSQPAPRTVTD 
EEINFVKTCLQRWRSEIEQDIQDLKTCIASTTQTIEQMYCDPLLRQVPYRLHAVLVHE 
GQANAGHYWAYIYNQPRQSWLKYNDISVTESSWEEVERDSYGGLRNVSAYCLMYINAK 
LPYFNAEAAPTESDQMSEVEALSVELKHYIQEDNWRFEQEVEEWEEEQSCKIPQMESS 
PNSSSQGYSTSQEPSVASSHGVRCLSSEHAVIVKEQTAQAIANTARAYEKSGVEAALS 
EAFHEEYSRLYQLAKETPTSHSDPRLQHVLVYFFQNEAPKRWERTLLEQFADKNLSY 
DERSISIMKVAQAKLKEIGPDDMNMEEYKRWHEDYSLFRKVSVYLLTGLELYQKGKYQ 
EALSYLVYAYQSNAALLMKGPRRGVKESVIALYRRKCLLELNAKAASLFETNDDHSVT 
EGINVMNELI I PCIHLI INNDISKDDLDAIEVMRNHWCSYLGQDIAENLQLCLGEFLP 
FLLLJ [ } i r TJ 1 1 £ -iAVTESIQGVSTVTVK 




SEQ ID NO: 277 (323 1 bp 


NOV96c, 

CG59708-03 DNA Sequence 


GCGCTTCGGCCATGACTGCGGAGCTGCAGCAGGACGACGCGGCCGGCGCGGCAGACGG s 

CCACGGCTCGAGCTGCCAAATGCTGTTAAATCAACTGAGAGAAATCACAGGCATTCAG 

GACCCTTCCTTTCTCCATGAAGCTCTGAGGGCCAGTAATGGTGACATTACTCAGGCAG 

TCAGCCTTCTCACTGATGAGAGAGTTAAGGAGCCCAGTCAAGACACTGTTGCTACAGA 

ACCATCTGAAGTAGAGGGGAGTGCTGCCAACAAGGAAGTATTAGCAAAAGTTATAGAC 

CTTACTCATGATAACAAAGATGATCTTCAGGCTGCCATTGCTTTGAGTCTACTGGAGT 

CTCCCAAAATTCAAGCTGATGGAAGAGATCTTAACAGGATGCATGAAGCAACCTCTGC 

AGAAACTAAACGCTCAAAGAGAAAACGCTGTGAAGTCTGGGGAGAAAACCCCAATCCC 

AATGACTGGAGGAGAGTTGATGGTTGGCCAGTTGGGCTGAAAAATGTTGGCAATACAT 

GTTGGTTTAGTGCTGTTATTCAGTCTCTCTTTCAATTGCCTGAATTTCGAAGACTTGT 

TCTCAGTTATAGTCTGCCACAAAATGTACTTGAAAATTGTCGAAGTCATACAGAAAAG 

AGAAATATCATGTTTATGCAAGAGCTTCAGTATTTGTTTGCTCTAATGATGGGATCAA 

ATAGAAAATTTGTAGACCCGTCTGCAGCCCTGGATCTATTAAAGGGAGCATTCCGATC 

ATCTGAGGAACAGCAGCAAGATGTGAGTGAATTCACACACAAGCTCCTGGATTGGCTA 

GAGGACGCATTCCAGCTAGCTGTTAATGTTAACAGTCCCAGGAACAAATTTGAAAATC 

CAATGGTGCAGCTGTTCTATGGTACTTTCCTGACTGAAGGGGTTCGTGAAGGAAAACC 
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CTTTTGTAACAATGAGACCTTCGGCCAGTATCCTCTTCAGGTAAACGGTTATCGCAAC 
TTAGACGAGTGTTTGGAAGGGGCCATGGTGGAGGGTGATGTTGAGCTTCTTCCCTCCG 
ATCACTCGGTGAAGTATGGACAAGAGCGTTGGTTTACAAAGCTACCTCCAGTGTTGAC 
CTTTGAACTCTCAAGATTTGAGTTTAATCAGTCCCTTGGGCAGCCAGAGAAAATTCAC 
AATAAGCTGGAATTTCCTCAGATTATTTATATGGACAGGTACATGTACAGGAGCAAGG 
AGCTTATTCGAAATAAGAGAGAGTGTATTCGAAAGTTGAAGGAGGAAATAAAAATTCT 
GCAGCAAAAATTGGAAGGGTATGTGAAATATGGCTCAGGCCCAGCTCGGTTCCCGCTC 
CCGGACATGCTGAAATATGTTATTGAATTTGCTAGTACAAAACCTGCCTCAGAAAGCT 
GTCCACCTGAAAGTGACACACATATGACATTACCACTTTCTTCAGTGCACTGCTCGGT 
TTCTAACCAGACATCCAAGGAAAGTACAAGTACAGAAAGCTCTTCTCAGGATGTTGAA 
AGTACCTTTTCTTCTCCTGAAGATTCTTTACCCAAGTCTAAACCACTGACATCTTCTC 
GGTCTTCCATGGAAATGCCTTCACAGCCAGCTCCACGAACAGTCACAGATGAGGAGAT 
AAATTTTGTTAAGACCTGTCTTCAGAGATGGAGGAGTGAGATTGAACAAGATATACAA 
GATTTAAAGACTTGTATTGCAAGTACTACTCAGACTATTGAACAGATGTACTGCGATC 
CTCTCCTTCGTCAGGTGCCTTATCGCTTGCATGCAGTTCTTGTTCATGAAGGACAAGC 
AAATGCTGGACACTATTGGGCCTATATCTATAATCAACCCCGACAGAGCTGGCTCAAG 
TACAATGACATCTCTGTTACTGAATCTTCCTGGGAAGAAGTTGAAAGAGATTCCTATG 
GAGGCCTGAGAAATGTTAGTGCTTACTGTCTGATGTACATTAACGACAAACTACCCTA 
CTTCAATGCAGAGGCAGCCCCAACTGAATCAGATCAAATGTCAGAAGTGGAAGCCCTA 
TCTGTGGAACTCAAGCATTACATTCAGGAGGATAACTGGCGGTTTGAGCAGGAAGTAG 
AGGAGTGGGAAGAAGAGCAGTCTTGCAAAATCCCTCAAATGGAGTCCTCCACCAACTC 
CTCATCACAGGACTACTCTACATCACAAGAGCCTTCAGTAGCCTCTTCTCATGGGGTT 
CGCTGCTTGTCATCTGAGCATGCTGTGATTGTAAAGGAGCAAACTGCCCAGGCTATTG 
CAAACACAGCCCGTGCCTATGAGAAGAGCGGTGTAGAAGCGGCACTGAGTGAGGCATT 
CCATGAAGAATACTCCAGGCTCTATCAGCTTGCCAAAGAGACCCCCACCTCTCACAGT 
GATCCTCGACTTCAGCATGTCCTTGTCTACTTTTTCCAAAATGAAGCACCCAAAAGGG 
TAGTAGAACGAACCCTTCTGGAACAGTTTGCAGATAAAAATCTTAGCTATGATGAAAG 
ATCAATCAGCATTATGAAGGTGGCTCAAGCGAAACTGAAGGAAATTGGTCCAGATGAC 
ATGAATATGGAAGAGTACAAGAAGTGGCATGAAGATTATAGTTTGTTCCGAAAAGTGT 
CTGTGTATCTCCTAACAGGCCTAGAACTCTATCAAAAAGGAAAGTACCAAGAGGCACT 
TTCCTACCTGGTATATGCCTACCAGAGCAATGCTGCCCTGCTGATGAAGGGGCCCCGC 
CGGGGGGTCAAAGAATCCGTGATTGCTTTATACCGAAGAAAATGCCTTCTGGAGCTGA 
ATGCCAAAGCAGCTTCTCTTTTTGAAACAAATGATGATCACTCCGTAACTGAGGGCAT 
TAATGTGATGAATGAACTGATCATCCCCTGCATTCACCTTATCATTAATAATGACATT 
TCCAAGGATGATCTGGATGCCATTGAGGTCATGAGAAACCATTGGTGCTCTTACCTTG 
GGCAAGATATTGCAGAAAATCTGCAGCTGTGCCTAGGGGAGTTTCTACCCAGACTTCT 
AGATCCTTCTGCAGAAATCATCGTCTTGAAAGAGCCTCCAACTATTCGACCCAATTCT 
CCCTATGACCTATGTAGCCGATTTGCAGCTGTCATGGAGTCAATTCAGGGAGTTTCAA 
CTGTGACAGTGAAATAAGCTCCCACATGTTCAAGGCCCATTCTGGTTCCTGGCTGCCT 
> 1 ' L J i - ■ . . r, 




ORF Start: ATG at |ORF Stop: TAA at 3147 

12 1 




SEQlDNO:278 i1045 aa 'MWat 119107.7kD 


NOV96c, 

CG59708-03 Protein 
Sequence 


MTAELQQDDAAGAADGHGSSCQMLLNQLREITGIQDPSFLHEALRASNGDITQAVSLL 
TDERVKEPSQDTVATEPSEVEGSAANKEVLAKVIDLTHDNKDDLQAAIALSLLESPKI 
QADGRDLNRMHEATSAETKRSKRKRCEVWGENPNPNDWRRVDGWPVGLKNVGNTCWFS 
AVIQSLFQLPEFRRLVLSYSLPQOTLENCRSHTEKRNIMFMQELQYLFALMMGSNRKF 
VDPSAALDLLKGAFRSSEEQQQDVSEFTHKLLDWLEDAFQLAVNVNSPRNKFENPMVQ 
LFYGTFLTEGVREGKPFCNNETFGQYPLQVNGYRNLDECLEGAMVEGDVELLPSDHSV 

NKRECIRKLKEEIKILQQKLEGYVKYGSGPARFPLPDMLKYVIEFASTKPASESCPPE 
SDTHMTLPLSSVHCSVSNQTSKESTSTESSSQDVESTFSSPEDSLPKSKPLTSSRSSM 
EMPSQPAPRTVTDEEINFVKTCLQRWRSEIEQDIQDLKTCIASTTQTIEQMYCDPLLR 
QVPYRLHAVLVHEGQANAGHYWAYIYNQPRQSWLKYNDISVTESSWEEVERDSYGGLR 
NVSAYCLMYINDKLPYFNAEAAPTESDQMSEVEALSVELKHYIQEDNWRFEQEVEEWE 
EEQSCKIPQMESSTNSSSQDYSTSQEPSVASSHGVRCLSSEHAVIVKEQTAQAIANTA 
RAYEKSGVEAALSEAFHEEYSRLYQLAKETPTSHSDPRLQHVLVYFFQNEAPKRWER 
TLLEQFADKNLSYDERSISIMKVAQAKLKEIGPDDMNMEEYKKWHEDYSLFRKVSVYL 
LTGLELYQKGKYQEALSYLVYAYQSNAALLMKGPRRGVKESVIALYRRKCLLELNAKA 
ASLFETNDDHSVTEGINVMNELI I PCIHLI INNDISKDDLDAIEVMRNHWCSYLGQDI 
AENLQLCLGEFLPRLLDPSAEIIVLKEPPTIRPNSPYDLCSRFAAVMESIQGVSTVTV 
K 



Sequence comparison of the above protein sequences yields the following sequence 
relationships shown in Table 96B. 
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Table 96B. Comparison of NOV96a against NOV%b through NOV96c. 


Protein Sequence 


NOV96a Residues/ 
Match Residues 


Identities/ 
Similarities for the Matched Region 


NOV96b 


209..1045 
138..974 


805/837 (96%) 
805/837 (96%) 


NOV96c 


1..1045 
1..1045 


979/1045 (93%) 
981/1045 (93%) 



Further analysis of the NOV96a protein yielded the following properties shown in 
Table 96C. 



Table 96C. Protein Sequence Properties NOV96a 


PSort 
analysis: 


0.8800 probability located in nucleus; 0.3000 probability located in 
microbody (peroxisome); 0.1000 probability located in mitochondrial matrix 
space; 0.1000 probability located in lysosome (lumen) 


SignalP 
analysis: 


No Known Signal Sequence Predicted 



A search of the NOV96a protein against the Geneseq database, a proprietary 
database that contains sequences published in patents and patent publication, yielded 
several homologous proteins shown in Table 96D. 



— - — — : ; — : — — — " — j 

Table 96D. Geneseq Results for NOV96a 


Geneseq 
Identifier 


Protein/Organism/Length 
[Patent #, Date] 


NOV96a 
Residues/ 

Match 
Residues 


Identities/ 
Similarities for 
the Matched 
Region 


Expect 
Value 


AAE04874 


Human protease protein- 1 (PRTS- 
1) - Homo sapiens, 1055 aa. 
[WO200146443-A2, 28-JUN-2001] 


22.. 1036 
18.. 1047 


524/1035 (50%) 
713/1035 (68%) 


0.0 


AAB31552 


A human ubiquitin specific 
protease 25 (USP25) - Homo 
sapiens, 1055 aa. [WO200079267- 
A2, 28-DEC-2000] 


22..1036 
18.. 1047 


524/1035 (50%) 
713/1035 (68%) 


0.0 


AAB31546 


A human ubiquitin specific 
protease 25 (USP25) - Homo 
sapiens, 1055 aa. [WO200078934- 
A2, 28-DEC-2000] 


22.. 1036 
18.. 1047 


524/1035 (50%) 
713/1035 (68%) 


0.0 


AAB74491 


Human SYK kinase binding protein 


22..1036 
18.. 1047 


522/1035 (50%) 
710/1035(68%) 


0.0 
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sapiens, 1055 aa. [WO200121654- 
A2, 29-MAR-2001] 








AAB31556 


A human ubiquitin specific 
protease (USP) - Homo sapiens, 
1087 aa. [WO200079267-A2, 28- 
DEC-2000] 


22.. 1036 
18.. 1079 


525/1067 (49%) 
717/1067 (66%) 


0.0 



In a BLAST search of public sequence databases, the NOV96a protein was found to 
have homology to the proteins shown in the BLASTP data in Table 96E. 



Table 96E. Public BLASTP Results for NOV96a 


Protein 
Accession 
Number 


Protein/Organism/Length 


NOV96a 
Residues/ 

Match 
Residues 


Identities/ 
Similarities for 
the Matched 
Portion 


Expect 
Value 


Q96RU2 


UBIQUITIN SPECIFIC PROTEASE 
- Homo sapiens (Human), 1077 aa. 


1..1045 
1..1077 


1041/1077 (96%) 
1042/1077 (96%) 


0.0 


Q9P213 


KIAA1 5 1 5 PROTEIN - Homo 
sapiens (Human), 757 aa (fragment). 


304..1045 
16..757 


738/742 (99%) 
739/742 (99%) 


0.0 


P57080 


Ubiquitin carboxyl-terminal 
hydrolase 25 (EC 3.1.2.15) 
(Ubiquitin thiolesterase 25) 
(Ubiquitin-specific processing 
protease 25) (Deubiquitinating 
enzyme 25) (mUSP25) - Mus 
musculus (Mouse), 1055 aa. 


22..1036 
18.. 1047 


527/1033 (51%) 
710/1033 (68%) 


0.0 




Q9UHP3 


Ubiquitin carboxyl-terminal 
hydrolase 25 (EC 3.1.2.15) 
(Ubiquitin thiolesterase 25) 
(Ubiquitin-specific processing 
protease 25) (Deubiquitinating 
enzyme 25) (USP on chromosome 
21) - Homo sapiens (Human), 1087 
aa. 


22..1036 
18..1079 


525/1067(49%) 
717/1067 (66%) 


0.0 


Q9H9W1 


CDNA FLJ12512 FIS, CLONE 
NT2RM2001730, WEAKLY 
SIMILAR TO PROBABLE 
UBIQUITIN CARBOXYL- 
TERMINAL HYDROLASE 
K02C4.3 (EC 3.1.2.15) - Homo 
sapiens (Human), 737 aa. 


313..1036 
2. .729 : 


363/733 (49%) 
510/733(69%) ] 


0.0 



PFam analysis predicts that the NOV96a protein contains the domains shown in the 
Table 967. 



Table 96F. Domain Analysis of NOV96a 


Pfam Domain 


NOV96a Match 
Region 


Identities/ 
Similarities 
for the Matched 
Region 


Expect 
Value 


UIM: domain 1 of 1 


96.. 113 


9/18(50%) 
14/18(78%) 


8.4 


UCH-1: domain 1 of 
1 


162..193 


14/32 (44%) 
28/32 (88%) 


2.6e-ll 


UCH-2: domain 1 of 
1 


580..649 


26/72 (36%) 
56/72 (78%) 


1.5e-19 



Example 97. 

The NOV97 clone was analyzed, and the nucleotide and predicted polypeptide 
sequences are shown in Table 97A. 



Table 97 A. NOV97 Sequence Analysis 




SEQIDNO:279 1 1601 bp 


NOV97a, 

CG59559-01 DNA Sequence 


AGGGCAGAGGCCACAGCGCCATCCCCTTCCCCATGGTCTCCCTACCCCCAACCTGCAC 


TGGGCGCTCCGCCCAGAGGTGAGTCCCTCCCAGCCCTTCTCTCCTTCTGTCCTAGCCA 


TCC, K1L ' 3GGO "CATGA 


TCCTTCTGCGGGCCTCCGAAGTGCGGCAGCTGCTTCACAATAAGTTCGTGGTCATCCT 
GGGGGACTCTGTGCATAGGGCAGTATACAAGGACCTGGTGCTTCTGCTGCAGAAGGAC 
CGCCTGCTCACTCCCGGGCAGCTTAGAGCAAGGGGGGAGCTGAACTTCGAACAAGATG 
AGCTGGTGGACGGAGGCCAGCGGGGCCACATGCACAACGGCCTTAACTACCGTGAGGT 
CCGCGAGTTCCGCTCCGACCACCATCTGGTACGTTTTTACTTCCTCACCCGCGTGTAC 
TCCGATTACCTCCAGACCATCTTGAAAGAGCTGCAGTCGGGCGAGCACGCCCCCGACC 
TGGTCATCATGAATTCCTGCCTCTGGGACATCTCCAGGTATGGTCCGAACTCCTGGAG 
AAGCTACCTGGAGAACCTGGAGAACCTGTTCCAGTGCCTGGGCCAGGTGCTGCCCGAG 
TCTTGCCTCCTGGTGTGGAACACGGCCATGCCTGTGGGCGAGGAAGTCACCGGGGGTT 
TTCTTCCGCCCAAGCTCCGGCGGCAGAAGGCCACCTTCCTGAAAAACGAAGTGGTCAA 
AGCCAACTTCCACAGCGCCACCGAGGCACGTAAACATAACTTCGATGTACTGGACTTG 
CATTTCCACTTCCGCCACGCGAGGGAGAACCTGCACTGGGACGGGGTGCACTGGAATG 
GACGTGTGCACCGCTGCCTCTCCCAGCTGCTGCTGGCCCACGTGGCCGACGCCTGGGG 
TGTGGAGCTGCCCCACCGCCACCCCGTGGGCGAGTGGATCAAGAAGAAAAAACCTGGC 
CCGAGAGTCGAAGGGCCGCCCCAGGCCAACAGAAATCACCCGGCCTTACCTCTGTCCC 
CACCCTTACCTTCCCCCACATACCGCCCCCTGCTTGGGTTCCCACCCCAGCGCTTGCC 
GCTGCTCCCGCTCCTGTCCCCACAGCCTCCTCCTCCCATTCTCCATCACCAGGGAATG 
CCCCGGTTCCCACAGGGTCCCCCAGATGCCTGTTTTTCCTCAGACCATACTTTCCAGT 
CGGATCAATTCTATTGCCATTCAGATGTCCCCTCATCAGCCCATGCAGGTTTCTTCGT 
CGAAGACAATTTTATGGTTGGTCCTCAGCTGCCTATGCCCTTCTTCCCCACACCCCGT 
TATCAGCGGCCTGCCCCAGTGGTACATAGGGGTTTTGGCAGGTATCGTCCCCGTGGCC 
CCTATACGCCCTGGGGACAGCGGCCTCGACCTTCAAAGAGAAGGGCCCCAGCCAATCC 
TGAGCCAAGGCCTCAATAGACGGACCTAGGCCTTATTTCCTCTTTATGAACATGGATT 


GGACAGATCTGACACTTCCTTTCCATTGCTTGGCCTGAACAGACTGACCTTGTTAACT 


TAAGCCTGGAGTCCATGCCTCGTCTTCCTTTTGTT 




ORF Start: ATG at 171 


ORF Stop: TAG at 1467 




SEQ ID NO: 280 


432 aa jMW at 49726.6kD 


NOV97a, 


MILLRASEVRQLLHNKFWILGDSVHRAVYKDLVLLLQKDRLLTPGQLRARGELNFEQ 
i ] 11 _ c 1 E r r 1. " Fl i ^ r TfELQSGEHAP 
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CG5 95 59-01 Protein Sequence 



DLVIMNSCLWDISRYGPNSWRSYLENLENLFQCLGQVLPESCLLVWNTAMPVGEEVTG 
GFLPPKLRRQKATFLKNEWKANFHSATEARKHNFDVLDLHFHFRHARENLHWDGVHW 
NGRVHRCLSQLLLAHVADAWGVELPHRHPVGEWIKKKKPGPRVEGPPQANRNHPALPL 
SPPLPSPTYRPLLGFPPQRLPLLPLLSPQPPPPILHHQGMPRFPQGPPDACFSSDHTF 
QSDQFYCHSDVPSSAHAGFFVEDNFMVGPQLPMPFFPTPRYQRPAPWHRGFGRYRPR 
GPYTPWGQRPRPSKRRAPANPEPRPQ 



Further analysis of the NOV97a protein yielded the following properties shown in 
Table 97B. 



Table 97B. Protein Sequence Properties NOV97a 


PSort 
analysis: 


0.5937 probability located in mitochondrial matrix space; 0.5103 probability 
located in microbody (peroxisome); 0.4900 probability located in nucleus; 
0.3252 probability located in lysosome (lumen) 


SignalP 
analysis: 


No Known Signal Sequence Predicted 



A search of the NOV97a protein against the Geneseq database, a proprietary 
database that contains sequences published in patents and patent publication, yielded 
several homologous proteins shown in Table 97C. 



Table 97C. Geneseq Results for NOV97a 



Geneseq 
Identifier 


Protein/Organism/Length [Patent 
#, Date] 


NOV97a 
Residues/ 

Match 
Residues 


Identities/ 
Similarities for 
the Matched 
Region 


Expect 
Value 


AAG74241 


Human colon cancer antigen protein 
SEQ ID NO:5005 - Homo sapiens, 
281 aa. [WO200122920-A2, 05- 
APR-2001] 


34..294 
1..266 


162/268 (60%) 
191/268 (70%) 


le-82 


AAE03639 


Human extracellular matrix and cell 
adhesion molecule-3 (XMAD-3) - 
Homo sapiens, 386 aa. 
[WO200142285-A2, 14-JUN-2001] 


1..421 
1..366 


197/435 (45%) 
231/435 (52%) 


2e-82 



In a BLAST search of public sequence databases, the NOV97a protein was found to 
have homology to the proteins shown in the BLASTP data in Table 97D. 
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Table 97D. Public BLASTP Results for NOV97a 


Protein 
Accession 
Number 


Protein/Organism/Length 


NOV97a 
Residues/ 

Match 
Residues 


Identities/ 
Similarities for 
the Matched 
Portion 


Expect 
Value 


Q96HM7 


SIMILAR TO HYPOTHETICAL 
PROTEIN FLJ22376 - Homo sapiens 
(Human), 432 aa. 

^ — — ^ ~ 


1..432 
1..432 


432/432 (100%) 
432/432(100%) 


0.0 


Q96B20 


HYPOTHETICAL 3 1 .4 KDA 
PROTEIN - Homo sapiens (Human), 
279 aa. 


121.310 
1..190 


190/190(100%) 
190/190(100%) 


e-116 


Q9H1Q7 


BA12M19.1.3 (NOVEL PROTEIN) 
(CDNA FLJ31791 FIS, CLONE 
NT2RI2008749, WEAKLY 
SIMILAR TO SPLICEOSOME 
ASSOCIATED PROTEIN 49) - 
Homo sapiens (Human), 454 aa. 


1..421 
18..434 


234 437 (53%) 
273/437 (61%) 


e-111 


Q9H1Q6 


BA 1 2M 1 Q. ] . 1 (NOVEL PROTEIN ) 
- Homo sapiens (Human), 403 aa. 


1..421 
18..383 


197/435 (45%) 
231/435 (52%) 


7e-82 
le-81 


Q9H6D1 


CDNA: FLJ22376 FIS, CLONE 
HRC07327 - Homo sapiens j 
(Human), 403 aa. 


1..421 
1 8..383 


196/435 (45%) 
231/435 (53%) 



PFam analysis predicts that the NOV97a protein contains the domains shown in the 
Table 97E. 



Tabic 97E. Domain Analysis of NOV97a 


j I Identities/ 




Pfam Domain j NOV97a Match Region j Similarities 


Expect Value 


| 1 for the Matched Region 




No Significant Matches Found 


Example 98. 



The NOV98 clone was analyzed, and the nucleotide and predicted polypeptide 
sequences are shown in Table 98A. 
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Table «>8,\. NOV98 Sequence Analysis 



(SEQIDNO:281 1981 bp 



NOV98a GCGCCGGGTCCCAGAATCTAGTCCTACGCCACGGTTTTGACCACGCGTGACCCGCTGC 

r^r^n<c/:n m tynt a c ccagccggcccggccatcaggtggtccgtgtgtccctctgac atgtcgtcctgcagcc 
CuDVooy-Ul UNA Sequence gcgtggccctggtaactggggctaacaaaggcatcggctttgcgatcacgcgtgacct 
gtgtcggaaattctccggggacgtggtgctcacggcgcgggacgaggcgcggggccgc 
jgcggcggtgcagcagctgcaggcggagggcctgagcccacgcttccaccagctggaca 
Itcgacgacccgcagagcatccgtgcgctgcgcgactttctgcgcaaggagtacggggg 
jacttaacgtgctggtcaacaacgcgggcatcgcctttagaagtactgatctcacccac 

JTTTCACATTCTAAGAGAAGCTGCAATGAAAACTAACTTTTTTGGTACCCAGGCCGTCT 

, i -gagctactccctctaataaaaacccaaggtagagtggtgaatatatcaagcct 
Iaataagtctagaggccctgaaaaactgcagcctggagctacagcagaagtttcgaagt 
gagaccatcacagaggaggagctggtggggctcatgaacaagtttgtggaggatacaa 
Iagaaaggagtccatgcaaaagaaggctggcctaatagtgcatacggggtgtctaagat 
Itggagtgacagtcctgtccagaatccttgccaggaaactcaatgagcagaggagaggg 
Igacaagatccttctgaatgcctgctgccctggctgggtcagaaccgacatggcaggac 
Icacaagccaccaaaagcccagaagaaggagcagagacccctgtgtacttggccctttt 
Igcctccagatgcagagggacctcatgggcagtttgttcaagataaaaaagtggaacaa 
( tggtg aactcagctctttgtacagctcccatctgtagcctgtcctaaagggga 



ORF Si . . \ k i u 101 ouj'Stop:TGAat932 



ISEQ ID NO: 282 



277 aa MW at 30547.7kD 



WOV98a jMSSCSRVALVTGANKGIGFAITRDLCRKFSGDWLTARDEARGRAAVQQLQAEGLSPR 

'/--r^n^n m r> • c fhqldiddpqsiralrdflrkeygglnvlvnnagiafrstdlthfhilreaamktnff 
jCCj^yooy-Ul rrotem occjuence IgtqavcteijLpliktqgrvvnissijIslealkncslelqqkfrsetiteeelvglmnk 

I FVEDTKKGVHAKEGWPNSAYGVSKIGVTVLSRILARKLNEQRRGDKILLNACCPGWVR 
[ I TDMAGPQATKSPEEGAETPVYLALLPPDAEGPHGQFVQDKKVEQW 



Further analysis of the NOV98a protein yielded the following properties shown in 
Table 98B. 



Table 98B. Protein Sequence Properties NO\ «>s : i 


jPSort 
| analysis: 


0.4766 probability located in mitochondrial matrix space; 0.4500 probability 
located in cytoplasm; 0.1822 probability located in mitochondrial inner 
membrane; 0.1822 probability located in mitochondrial intermembrane space 


SignalP 
analysis: 


No Known Signal Sequence Predicted 



A search of the NOV98a protein against the Geneseq database, a proprietary 
database that contains sequences published in patents and patent publication, yielded 
several homologous proteins shown in Table 98C. 



Table 98C. Geneseq Results for NOV98a 


Geneseq 
Identifier 


Protein/Organism/Length [Patent 
#, Date] 


NOV98a 
Residues/ 

Match 
Residues 


Identities/ 
Similarities for 
the Matched 
Region 


Expect 
Value 


AAW51011 


Human liver carbonyl reductase - 
Homo sapiens, 277 aa. 
[US5756299-A, 26-MAY-1998] 


1..277 
1..277 


236/277 (85%) 
252/277 (90%) 


e-134 
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AAU33100 


Novel human secreted protein 
#3591 - Homo sapiens, 175 aa. 
[WO2001 79449- A2, 25-OCT-2001] 


j 142..277 
J 39..174 


119/136(87%) 
128/136(93%) 


1 2e-66 




Human bone marrow expressed 
probe encoded protein SEQ ID NO: 
33947 - Homo sapiens, 123 aa. 
[WO200157276-A2, 09-AUG-2001] 


1..97 
1..97 


86/97 (88%) 
92/97 (94%) 


I 7e-43 


AAM6U948 


Human brain expressed single exon 
probe encoded protein SEQ ID NO: 
33053 - Homo sapiens, 123 aa. 
[WO200157275-A2, 09-AUG-2001] 


1..97 
1..97 


] 86/97 (88%) 
92/97 (94%) 


! 7e-43 

! 


AAM33832 


Peptide #7869 encoded by probe for 
measuring placental gene expression 
- Homo sapiens, 123 aa. 
[WO2001 57272- A2, 09-AUG-2001] 


1..97 
1..97 


86/97 (88%) 
92/97 (94%) 


7e-43 


In a BLAST search of public sequence databases, the NOV98a protein was found to 
have homology to the proteins shown in the BLASTP data in Table 98D. 


Table 98D. Public BLASTP Results for NOV98a j 


Protein 

Accession 
Number 


Protein/Organism/Length 


NOV98a 
Residues/ 

Match 
Residues 


Identities/ 
Similarities for 
the Matched 
Portion 


Expect 
Value 


[ Q924V2 


CARBONYL REDUCTASE 2 - 
Cricetulus griseus (Chinese 
hamster), 277 aa. 


1..277 
1..277 


243/277 (87%) 
260/277 (93%) 


e-139 


Q91X28 

! 

— — — . i 


SIMILAR TO CARBONYL 
REDUCTASE 1 - Mus musculus 
(Mouse), 277 aa. 


1..277 
1..277 


244/277 (88%) 
256/277 (92%) 


e-139 


Q924V3 


CARBONYL REDUCTASE 1 - 
Cricetulus griseus (Chinese 
hamster), 277 aa. 


I. .277 
1..277 


241/277 (87%) 
256/277 (92%) 


e-137 


P48758 


Carbonyl reductase [NADPH] 1 (EC 
1 . 1 . 1 . 1 84) (NADPH-dependent 
carbonyl reductase 1) - Mus 
musculus (Mouse), 276 aa. 


2.211 
1.216 \ 


240/276 (86%) 
253/276(90%) 


e-136 


JC5284 




carbonyl reductase (NADPH) (EC 
1.1.1.1 84), inducible - rat, 277 aa. 


1.211 
1.211 


236/277 (85%) 
249/277 (89%) 


e-134 



PFam analysis predicts that the NOV98a protein contains the domains shown in the 
Table 98E. 
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Table 98E. Domain Analysis of NOV98a 


Pfam Domain 


NOV98a Match 
Region 


Identities/ 
Similarities 
for the Matched 
Region 


Expect 
Value 


adh short: domain 1 of 
1 


4..274 


67/286 (23%) 
185/286 (65%) 


1.6e-38 



Example 99. 

The NOV99 clone was analyzed, and the nucleotide and predicted polypeptide 
sequences are shown in Table 99A. 



Table 99 A. NOV99 Sequence Analysis 




SEQIDNO:283 1001 hp 


NOV99a, 

CG5 8624-01 DNA Sequence 


CTTGGTATAAGTAAGTGCTCGTCAATGTTGGCTACTCTCAATGTCAGAGCCGCAGCCG 


CGGGGCGCAGAGCGCGATCTCTACCGGGACACGTGGGTGCGATACCTGGGCTATGCCA 
ATGAGGTGGGCGAGGCTTTCCGCTCTCTTGTGCCAGCGGCGGTGGTGTGGCTGAGCTA 
TGGCGTGGCCAGCTCCTACGTGCTGGCGGATGCCATTGACAAAGGCAAGAAGGCTGGA 
GAGGTGCCCAGCCCTGAAGCAGGCCGCAGCGCCAGGGTGACTGTGGCTGTGGTGGACA 
CCTTTGTATGGCAGGCTCTAGCCTCTGTGGCCATTCCGGGCTTCACCATCAACCGCGT 
GTGTGCTGCCTCTCTCTATGTCCTGGGCACTGCCACCCGCTGGCCCCTGGCTGTCCGC 
AAGTGGACCACCACCGCGCTTGGGCTGTTGACCATCCCCATCATTATCCACCCCATTG 
ACAGGGATCATCCACTCTCCAGTGATGAGAGTGGATCATCCAGTCTCCAGCACGAAGG 
GCCAGGGGTCCCACAGGTGAGTGGAGCCCCAGCAGCCCCCTCAGCTCTGCGTGCCCAT 
GTACTGGTCTTCTCCCTGGCTCTATACTCAGTGTTCAAGGGGTTGGACGGGGCTTGGG 
CCGCGGAGCTGCGCCTGGCTTTGCTGCTCCACAAGGGCACCGTGGCTGTCAGCCTGTC 
CCTGCAACTGCTGCAGAGCCACGTAGGGTTACAGGTGGTGGCTGGCTGTGGGATCCAC 




GGCCTCTGCACCAGCTGGCCCAGTCTGTGCTAGAGGGCATGGTGGCTGGCACCTTCCT 
CTATACCACCTTTCTGGAAATCTTTCCACAGGAGCTGGCGACTTCTGAGCAAAGGATC 
CTCAAGGTCATTCTGCTCCTAGAAGGGTGTGCCCTGCTCACTGGCCTGCTCTTCATCC 
ATATCTAGGGGGCTT 




ORF Start: ATG at 41 lORF Stop: TAG at 992 




SEQ ID NO: 284 |317aa MW at 33737.8kD 1 


NOV99a, 

CG58624-01 Protein Sequence 


MSEPQPRGAERDLYRDTWVRYLGYANEVGEAFRSLVPAAWWLSYGVASSYVLADAID 
KGKKAGEVPSPEAGRSARVTVAWDTFVWQALAS VAI PGFT I NRVCAASLYVLGTATR 
WPLAVRKWTTTALGLLTIPIIIHPIDRDHPLSSDESGSSSLQHEGPGVPQVSGAPAAP 
SALRAHVLVFSLALYSVFKGLDGAWAAELRLALLLHKGTVAVSLSLQLLQSHVGLQW 
AGCG I HFLCMTLLG IRLGAALAQSAGPLHQLAQSVLEGMVAGTFLYTTFLE I FPQELA 
E IL LI LLTGLLFIKI 



Further analysis of the NOV99a protein yielded the following properties shown in 
Table 99B. 



Table 99B. Protein Sequence Propirtus NOV99a 


PSort 
analysis: 


0.6000 probability located in plasma membrane; 0.4000 probability located in 
Golgi body; 0.3000 probability located in endoplasmic reticulum (membrane); 
0.1000 probability located in mitochondrial inner membrane 


SignalP 
analysis: 


Likely cleavage site between residues 55 and 56 
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A search of the NOV99a protein against the Geneseq database, a proprietary 
database that contains sequences published in patents and patent publication, yielded 
several homologous proteins shown in Table 99C. 





Table 99C. Geneseq Results for NOV99a 


1 


Geneseq 
Identifier 


Protein/Organism/Length [Patent 
#, Date] 


NOV99a 
Residues/ 

Match 
Residues 


Identities/ 
Similarities for 
the Matched 
Region 


Expect 
Value 


AAM93835 


Human polypeptide, SEQ ID NO: 
3905 - Homo sapiens, 324 aa. 
[EP1 130094-A2, 05-SEP-2001] 


140..317 
141. .324 


134/184 (72%) 
145/184 (77%) 




3e-63 


AAY52394 


Human transmembrane protein 
HP 10528 - Homo sapiens, 324 aa. 
[ W09955862-A2, 04-NOV-1999] 


140..317 
141.324 


134/184 (72%) 
145/184 (77%) 


3e-63 


AAY84895 


A human proliferation and 
apoptosis related protein - Homo 
sapiens, 324 aa. [WO200023589- 
A2, 27-APR-2000] 


140.31 7 
141.324 


134/184 (72%) 
145/184 (77%) 


3e-63 


AAB43291 


Human ORFX ORF3055 
polypeptide sequence SEQ ID 
NO: 61 10 - Homo sapiens, 323 aa. 
|WO 00058473-A2. 05-OCT-2000] 


140.317 
140.323 


134/184(72%) : 
145/184 (77%) 


3e-63 1 


AAM93650 


Human polypeptide, SEQ ID NO: 
3514 - Homo sapiens, 324 aa. 
[EP1130094-A2, 05-SEP-2001] 


140.317 
141.324 ; 


133/184 (72%) 
144/184(77%) ! 


2e-62 



In a BLAST search of public sequence databases, the NOV99a protein was found to 
have homology to the proteins shown in the BLASTP data in Table 99D. 



Table 99D. Public BLASTP Results for NOV99a 



Protein 
Accession 
Number 


Protein/Organism/Length 


NOV99a 
Residues/ 

Match 
Residues 


Identities/ 
Similarities for 
the Matched 
Portion 


Expect 
Value 


Q9UDX5 


WUGSC:H_DJ0539M06.2 PROTEIN 
- Homo sapiens (Human), 1 66 aa. 


1-152 
1..152 


145/152 (95%) 
145/152 (95%) 


6e-78 


Q9CRB8 


26 1 0507A2 1RIK PROTEIN 
(1700020C1 1RIK PROTEIN) - Mus 
nu v. ilus .' \1 ii v 166 aa. 


1..168 
1..164 


133/168 (79%) 
143/168 (84%) 


8e-69 
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Q9CZX4 


2610507A21RIK PROTEIN - Mus 
musculus (Mouse), 166 aa. 


1..143 
1..143 


125/143 (87%) 
133/143 (92%) 


2e-68 


Q9NY26 


IRT1 PROTEIN (SIMILAR TO 

TRANSPORTER-LIKE) 
(HYPOTHETICAL 34.2 KDA 
PROTEIN) (UNKNOWN) (PROTEIN 
FOR MGC:14180) - Homo sapiens 
(Human), 324 aa. 


140..317 


134 '184 (72%) 

i qa anw\ 
14J/1S4 yl 1 /o) 


le-62 


Q9Y380 


CGI-71 PROTEIN - Homo sapiens 
(Human), 324 aa. 


140..317 
141. .324 


134/184 (72%) 
145/184 (77%) 


le-62 



PFam analysis predicts that the NOV99a protein contains the domains shown in the 
Table 99E. 



Table 99E. Domain Analysis of NOV99a 


Pfam Domain 


NOV99a Match 
Region 


Identities/ 
Similarities 
for the Matched 
Region 


Expect 
Value 


Syndecan: domain 1 of 

i 1 


235. .255 


9/21 (43%) 
16/21 (76%) 


6.9 | 

-J 


Zip: domain 1 of 1 


1 74.3 13 


52/178 (29%) 
108/178 (61%) 


2.3e-15 



Example 100. 

The NOV100 clone was analyzed, and the nucleotide and predicted polypeptide 
sequences are shown in Table 100A. 



Table 100A. ISO VI 00 Sequence Analysis 




SEQ ID NO: 285 |987 bp 


NOV 100a, 

CG59679-01 DNA Sequence 


AGACGCTCACACAGACAACCTCAAGTCCAGCAACATCTTAGTAGCCCAAAATCGACTG 


CTTTAGTTCTTCTG rG rCT TGI ,i,~TATGCCATCCTGCAGTC 


GCATTGCACTGGTGACTGGAGCTAATAAGGGCATTGGCTTTGCGATCACTCGTGACCT 
GTGTCAGCAATTCTCAGGGGATGTGGTGCTCACTGCACGGGACGAGGCACGGGGCCTT 
GCGGCAGTGCAGAAGCTGCAGGCTGAGGGCCTGATTCCTCGCTTCCACCAGCTGGACA 
TCAATGACCCTCAGAGCATCCATGCACTTCGCAACTTTCTGCTCAAGGAGTACGGAGG 




ACACCCTTCGACATTCAAGCTGAGGTGACACTGAAGACGAACTTTTTTGCCACTAGAA 
ATGTCTGCACTGAGTTACTGCCTATAATGAAACCACATGGTAGAGTGGTGAACATCAG 
CAGTCTGCAGGGGTTAAAAGCCCTTGAGAACTGCAGGGAAGATCTTCAGGAAAAGTTC 
CGATGTGACACACTTACCGAGGTGGACCTGGTCGACCTCATGAAAAAGTTTGTGGAGG 
ATACAAAAAATGAAGTCCATGAGAGGGAAGGTTGGCCAGACTCGGCTTACGGGGTGTC 
GAAGCTGGGGGTGACAGTCCTTACGAGGATCCTGGCCCGGCAGCTGGATGAAAAGAGG 
AAAGCGGACAGGATTCTGCTCAATGCCTGCTGCCCGGGATGGGTGAAGACCGACATGG 

TCTCCTGCCTCCAGATGCCACTGAACCTCACGGCCAGCTAGTCCGTGACAAAGTTGTG 
CAAACTTGGTGAACGTCTGCTCTGGGGCTTAATTGTTTGATAAACGTTAGCGGGAGAG 


A 
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lORF Start: ATG at 101 


ORF Stop: TGA at 938 




SEQ ID NO: 286 


279 aa ImW at 31007.2kD 


NOV 100a, 

CG5 9679-01 Protein Sequence 


MPSCSRIALVTGANKGIGFAITRDLCQQFSGDWLTARDEARGLAAVQKLQAEGLIPR 
FHQLDINDPQSIHALRNFLLKEYGGLDVLVNNAGIGVLFKVDDPTPFDIQAEVTLKTN 
FFATRNVCTELLPIMKPHGRWNISSLQGLKALENCREDLQEKFRCDTLTEVDLVDLM 
KKFVEDTKNEVHEREGWPDSAYGVSKLGVTVLTRILARQLDEKRKADRILLNACCPGW 
VKTDMARDQGSRTVEEGAETPVYLALLPPDATEPHGQLVRDKWQTW 



Further analysis of the NOV 100a protein yielded the following properties shown in 
Table 100B. 



Table 100B. Protein Sequence Properties NOVlOOa 


PSort 
analysis: 


0.3600 probability located in mitochondrial matrix space; 0.3000 probability 
located in microbody (peroxisome); 0.1808 probability located in lysosome 
(lumen); 0.0000 probability located in endoplasmic reticulum (membrane) 


SignalP 
analysis: 


No Known Signal Sequence Predicted 



A search of the NOVlOOa protein against the Geneseq database, a proprietary 
database that contains sequences published in patents and patent publication, yielded 
several homologous proteins shown in Table 100C. 



Table 100C. Geneseq Results for NOVlOOa 


Geneseq 
Identifier 


Protein/Organism/Length 
[Patent #, Date] 


NOVlOOa 
Residues/ 

Match 
Residues 


Identities/ 
Similarities for 
the Matched 
Region 


Expect 
Value 

e-112 1 


AAW51011 


Human liver carbonyl reductase - 
Homo sapiens, 277 aa. 
[US5756299-A, 26-MAY-1998] 


1..279 
1..277 


198/279 (70%) 
233/279 (82%) 


AAU33100 


Novel human secreted protein 
#3591 - Homo sapiens, 175 aa. 
[WO200179449-A2, 25-OCT- 
2001] 


145..279 
40.. 174 


88/135 (65%) 
110/135 (81%) 


r 

2e-48 


AAG46601 


Arabidopsis thaliana protein 
fragment SEQ ID NO: 58644 - 
Arabidopsis thaliana, 302 aa. 
[EPI033405-A2, 06-SEP-2000] 


3..259 
20..283 


106/268(39%) 
157/268 (58%) 


6e-43 


AAG46600 


Arabidopsis thaliana protein 
fragment SEQ ID NO: 58643 - 
Arabidopsis thaliana, 316 aa. 
[EP1033405-A2, 06-SEP-2000] 


3..259 
34..297 


106/268 (39%) 
157/268 (58%) 


6e-43 
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AAG46599 


Arabidopsis thaliana protein 
fragment SEQ ID NO: 58642 - 
Arabidopsis thaliana, 327 aa. 
[EP1033405-A2, 06-SEP-2000] 


3..259 
45..308 


106/268 (39%) 
157/268 (58%) 


6e-43 


In a BLAST search of public sequence databases, the NOVlOOa protein was found 
to have homology to the proteins shown in the BLASTP data in Table 100D. 


Table I00D. Public BLASTP Results for NOVlOOa 


Protein 
Accession 
Number 


Protein/Organism/Length 

, 


NOVlOOa 
Residues/ 

Match 
Residues 


Identities/ 
Similarities for 
the Matched 
Portion 


Expect 
Value 


Q9JJN7 


CARBONYL REDUCTASE (EC 
1.1.1.184) (CARBONYL 
REDUCTASE 3) - Cricetulus 
griseus (Chinese hamster), 277 aa. 


1..279 
1..277 


246/279 (88%) 
262/279 (93%) 


e-140 


AAH02812 


CARBONYL REDUCTASE 3 - 
Homo sapiens (Human), 277 aa. 


1..279 
1..277 


227/279 (81%) 
246/279 (87%) 


e-126 


075828 


Carbonyl reductase [NADPH] 3 
(EC 1.1.1.184) (NADPH- 
dependent carbonyl reductase 3) - 
Homo sapiens (Human), 276 aa. 


3..279 
2..276 


226/277 (81%) 
245/277(87%) 


e-126 


Q924V2 


CARBONYL REDUCTASE 2- 
Cricetulus griseus (Chinese 
hamster), 277 aa. 


1..279 
1..277 


206/279(73%) 
244/279 (86%) 


e-119 


Q91X28 


SIMIIAR IOC \RBONYL 
REDUCTASE 1 - Mus musculus 
(Mouse), 277 aa. 


1..279 
1..277 


204/279(73%) 
240/279 (85%) 


e-116 



PFam analysis predicts that the NOVlOOa protein contains the domains shown in 
the Table 100E. 



Table 100E. Domain Analysis of NOVlOOa 


Pfam Domain 


NOVlOOa Match 
Region 


Identities/ 
Similarities 
for the Matched 
Region 


Expect 
Value 


adh short: domain 1 of 
1 


4..277 


77/316(24%) 
186/316(59%) 


5.2e-31 
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Example 101. 

The NOV101 clone was analyzed, and the nucleotide and predicted polypeptide 
sequences are shown in Table 101 A. 



Table 101A. NOV101 Sequence Analysis 




SEQIDNO:287 1 1011 bp 


NOVlOla, 

CG5 9644-01 DNA Sequence 


CTCCTCGGGGGGGCGGCGGCGGCGATGT n P < 1 rCTCGTACGGGCGGCTGGTGG 
CCCGCGCCGTGCTCGGCGGCCTCTCGCAGACCGACCCCAGGGCCGGCGGCGGCGGCGG 
CGGCGCCGTGCTCGGCGGCCTCTCGCAGACCGACCCCAGGGCCGGCGGCGGCGGCGGC 
GGCGACTACGGACTGGTGACGGCCGGCTGCGGCTTCGGGAAGGACTTCCGTAAGGGCC 
TCCTCAAGAAGGGCGCGTGCTACGGGGACGACGCGTGCTTCGTGGCCCGGCACCGTTC 
CGCGGACGTGCTCGGTGTTGCAGATGGTGTAGGAGGCTGGAGAGACTATGGAGTTGAT 
CCATCTCAATTCTCAGGGACTTTAATGCGGACGTGTGAACGTTTAGTAAAAGAAGGAC 
GGTTCGTACCTAGTAATCCCATTGGAATTCTCACCACAAGCTACTGTGAGTTGCTGCA 
AAATAAAGTCCCTTTGCTCGGTAGCAGCACCGCCTGCATTGTGGTGCTGGACAGAACC 
AGCCACCGCTTACACACAGCAAACCTGGGCGATTCAGGCTTCCTGGTTGTCAGGGGTG 
GTGAAGTCGTGCACCGATCAGATGAGCAGCAGCATTACTTCAACACTCCATTCCAGCT 
CTCAATCGCTCCCCCTGAAGCCGAGGGAGTCGTCTTGAGCGACAGTCCGGATGCTGCT 
GATAGCACGTCTTTCGATGTCCAGCTAGGAGACATTATCCTGACGGCAACAGATGGAC 
TCTTTGACAACATGCCTGATTATATGATTCTTCAGGAGCTAAAAAAGTTAAAGAATTC 
AAATTATGAGAGTATACAACAGACTGCCAGAAGCATTGCTGAGCAAGCTCATGAGCTG 
GCCTATGACCCAAATTATATGTCACCTTTTGCACAGTTTGCATGTGACAATGGATTGA 
ATGTGAGAGGTGGTGGAAAGCCAGATGACATCACCGTCCTTCTTTCAATAGTGGCTGA 
GTATACAGACTAGCTGAGGTGTCAA 




ORF Start: ATG at 25 


ORF Stop: TAG at 997 




SEQ ID NO: 288 


324 aa |m\\ at 543 1 l.lkl) 


NOVlOla, 

CG59644-01 Protein Sequence 


MFSVLSYGRLVARAVLGGLSQTDPRAGGGGGGAVLGGLSQTDPRAGGGGGGDYGLVTA 
GCGFGKDFRKGLLKKGACYGDDACFVARHRSADVLGVADGVGGWRDYGVDPSQFSGTL 
MRTCERLVKEGRFVPSNPIGILTTSYCELLQNKVPLLGSSTACIWLDRTSHRLHTAN 
LGDSGFLWRGGEWHRSDEQQHYFNTPFQLSIAPPEAEGWLSDSPDAADSTSFDVQ 
LGDI I LTATDGLFDNMPDYMILQELKKLKNSNYES IQQTARS I AEQAHELAYDPNYMS 
iJ ,_ T- ' E L [ S _ i 



Further analysis of the NOVlOla protein yielded the following properties shown in 
5 Table 10 IB. 



Table 101B. Protein Sequence Properties NOVlOla 


PSort 
analysis: 


0.5708 probability located in mitochondrial matrix space; 0.4996 probability 
located in mitochondrial intermembrane space; 0.2852 probability located in 
mitochondrial inner membrane; 0.2852 probability located in mitochondrial 
outer membrane 


SignalP 
analysis: 


Likely cleavage site between residues 23 and 24 



A search of the NOVlOla protein against the Geneseq database, a proprietary 
database that contains sequences published in patents and patent publication, yielded 
several homologous proteins shown in Table 101C. 
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Table 101 C. Gcneseq Results for NOV10 la 


Geneseq 
Identifier 


Protein/Organism/Length [Patent 
#, Date] 


NOVlOla 
Residues/ 

Match 
Residues 


Identities/ 
Similarities for 
the Matched 
Region 


Expect 
Value 


AAB85357 


Human phosphatase (PP) (clone ID 
3402521CD1) - Homo sapiens, 304 
aa. [WO200153469-A2, 26-JUL- 
2001] 


1..324 
1..304 


304/324 (93%) 
304/324 (93%) 


e-173 


AAU32112 


Novel human secreted protein 
#2603 - Homo sapiens, 304 aa. 
[WO200179449-A2, 25-OCT- 
2001] 


25..324 
6..304 


272/300 (90%) 
274/300 (90%) 


e-156 


AAG52267 


Arabidopsis thaliana protein 
fragment SEQ ID NO: 66421 - 
Arabidopsis thaliana, 348 aa. 
[EP1033405-A2, 06-SEP-2000] 


71. .320 
99..340 


101/261 (38%) 
133/261 (50%) 


4e-33 


AAG52266 


Arabidopsis thaliana protein 
fragment SEQ ID NO: 66420 - 
Arabidopsis thaliana, 374 aa. 
[EP1033405-A2, 06-SEP-2000] 


71..320 
125..366 


101/261 (38%) 
133/261 (50%) 


4e-33 


AAG52265 


Arabidopsis thaliana protein 
fragment SEQ ID NO: 66419 - 
Arabidopsis thaliana, 467 aa. 
[EP1033405-A2, 06-SEP-2000] 


71..320 
218..459 


101/261 (38%) 
133/261 (50%) 


4e-33 



In a BLAST search of public sequence databases, the NOVlOla protein was found 
to have homology to the proteins shown in the BLASTP data in Table 101D. 



Table 101D. Public BLASTP Results for NOVlOla 



Protein 
Accession 
Number 


Protein/Organism/Length 




NOVlOla 
Residues/ 

Match 
Residues 


Identities/ 
Similarities for 
the Matched 
Portion 


Expect 
Value 


Q9W0E2 
Q9W3R1 


CGI 2091 PROTEIN - Drosophila 
melanogaster (Fruit fly), 321 aa. 


1..320 
1..320 


163/322 (50%) 
218/322 (67%) 


le-83 


CGI 5035 PROTEIN - Drosophila 
melanogaster (Fruit fly), 374 aa. 


55..319 
109..373 


127/266 (47%) 
178/266 (66%) 


le-64 


018183 


W09D1 0.4 PROTEIN - 
Caenorhabditis elegans, 330 aa. 


4..320 
7..330 


130/33 I (41%) 
198/331 (59%) 


2e-60 
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Q9VAH4 


CG7615 PROTEIN - Drosophila 
melanogaster (Fruit fly), 314 aa. 


35..319 
26..309 


122/285 (42%) 
168/285 (58%) 


2e-56 


Q9SUK9 


HYPOTHETICAL 36.2 KDA 
PROTEIN - Arabidopsis thaliana 
(Mouse-ear cress), 335 aa. 


71..320 
86..327 


101/261 (38%) 
133/261 (50%) 


le-32 

_ 



PFam analysis predicts that the NOV 10 la protein contains the domains shown in 
the Table 101E. 



Table 101E. Domain Analysis of NOVlOla 


Pfam Domain 


NOVlOla Match Region 


Identities/ 
Similarities 
for the Matched Region 


Expect Value 


PP2C: domain 1 of 1 


147..191 


13/48 (27%) 
36/48 (75%) 


0.26 



Example 102. 

The NOV 102 clone was analyzed, and the nucleotide and predicted polypeptide 
sequences are shown in Table 102A. 



Table 102A. NOV102 Sequence Analysis 




SEQ ID NO: 289 


523 bp 


NOV 102a, 

CG59662-01 DNA Sequence 


AGT T T i ATG T AI r CACCATGTTCTTCGACGTTGCTGTCGAC 
AGTGAGCCCTTGGACCACGTCTCCTTTGAGCTGTTTGCAGAAAAGTTTCCAAAGACAG 
CAGAAAACGTTCGTGCTCTGAGCACTGAAGAGAAAGGATTTGGTTATAAGGGTCCCTG 
CTTTCACAGAATTATACCAGCATTTATGTGTCAGGGTGGTGACTTCACGCACCATAAT 
GGCACTGGTGGCAAGTCCATCTACGGGGAGAAATTTGAAGATGAGAAATTTATCCTAA 
AGCGTACAGGTCCTGGCATCTTGTCCATGGCAAATTCTGGACCCAACACAAACTGTTC 
CGTTTTTTTCATCTGCACTGCCAAGACGGGGTGGTTGGATGGCAAGCATGTAGTCTTT 
GGCAAGGTGAAAGAAGGCATGAATATTTTGGAGGCCATAGAGCAATTTGGGTCCAGGA | 
ATGGCAAGACCAGCAAGAAGACCACCATTGCTGACTGTGGACAGCTCTGGTAAGTTTG 




ORF Start: ATG at 20 


ORF Stop: TAA at 515 




SEQ ID NO 290 


165 aa ImW at 18237.7kD 


NOV102a. 

CG59662-01 Protein Sequence 


MVNHTMFFDVAVDSEPLDHVSFELFAEKFPKTAENVRALSTEEKGFGYKGPCFHRIIP 
AFMCQGGDFTHHNGTGGKSIYGERFEDEKFILKRTGPGILSMANSGPNTNCSVFFICT 
L H r r f 1 n 3 _u 



Further analysis of the NOV102a protein yielded the following properties shown in 
Table 102B. 



Table 102B. Protein Sequence Properties NO VI 02a 


PSort 
analysis: 


0.6400 probability located in microbody (peroxisome); 0.4500 probability 
located in cytoplasm; 0.1000 probability located in mitochondrial matrix 
space; 0.1000 probability located in lysosome (lumen) 
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SignalP 
analysis: 



No Known Signal Sequence Predicted 



A search of the NOV 102a protein against the Geneseq database, a proprietary 
database that contains sequences published in patents and patent publication, yielded 
several homologous proteins shown in Table 102C. 



Table 102C. Geneseq Results for NOV102a 


Geneseq 
Identifier 


Protein/Organism/Length [Patent 
#, Date] 


NOV102a 
Residues/ 

Match 
Residues 


Identities/ 
Similarities for 
the Matched 
Region 


Expect 
Value 


AAU01195 


Human cyclophilin A protein - 
Homo sapiens, 165 aa. 
[WO200132876-A2, 10-MAY- 
2001] 


L.164 
1..164 


141/164 (85%) 
148/164 (89%) 


le-80 


AAW56028 


Calcineurin protein - Mammalia, 
165 aa. [WO9808956-A2, 05- 
MAR-1998] 


1.164 
1..164 


141/164 (85%) 
148/164 (89%) 


le-80 


AAG65275 


Haematopoietic stem cell 
proliferation agent related human 
protein #2 - Homo sapiens, 1 64 aa. 
[JP2001163798-A, 19-JUN-2001] 


2..164 
1-163 


140/163 (85%) 
147/163 (89%) 


5e-80 


AAP90431 


Cyclophilin - Homo sapiens 
(human), 164 aa. [EP326067-A, 02- 
AUG-1989] 


2.. 164 
1-163 


140/163 (85%) 
147/163 (89%) 


5e-80 | 


AAG03831 


Human secreted protein, SEQ ID 1 1 . . 1 64 
NO: 79 1 2 - Homo sapiens, 1 65 aa. I 1 . . 1 64 
[EP1 033401 -A2, 06-SEP-2000] ! 


140/164 (85%) 
147/164 (89%) 


8e-80 



In a BLAST search of public sequence databases, the NOV 102a protein was found 
to have homology to the proteins shown in the BLASTP data in Table 102D. 



Table 1021). Public BLASTP Results for NO\T02a 


i 

Protein 
Accession 
Number 


Protein/Organism/Length 


NO VI 02a 
Residues/ 

Match 
Residues 


Identities/ 
Similarities for 
the Matched 
Portion 


Expect 
Value 


CAC39529 


SEQUENCE 26 FROM PATENT 


1-164 
L.164 


141/164 (85%) 
148/164 (89%) 


4e-80 
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(Human), 165 aa. 








) Q9BRU4 


PFPTTTYVT PPOT VT TQDMFR AQF 

A (CYCLOPHILIN A) - Homo 
sapiens (Human), 165 aa. 


1..164 


14U/ lo4 
147/164 (89%) 


2e-79 


P05092 


Peptidyl-prolyl cis-trans isomerase 
A (EC 5.2.1.8) (PPIase) (Rotamase) 
(Cyclophilin A) (Cyclosporin A- 
binding protein) - Homo sapiens 
(Human),, 164 aa. 


2. .164 
1..163 


14U/]o3 (85%) 
147/163 (89%) 


2e-79 

S 


Q96IX3 


PEPTIDYLPROLYL ISOMERASE 
A (CYCLOPHILIN A) - Homo 
sapiens (Human), 1 65 aa. 


1.164 
1..164 


140/164 (85%) 
147/164 (89%) 


5e-79 1 


P04374 


Peptidyl-prolyl cis-trans isomerase 
A (EC 5.2.1.8) (PPIase) (Rotamase) 
(Cyclophilin A) (Cyclosporin A- 
binding protein) - Bos taurus 
(Bovine), and, 163 aa. 


2.. 164 
1..163 


138/163 (84%) 
147/163 (89%) \ 


7e-79 | 



PFam analysis predicts that the NOV 102a protein contains the domains shown in 
the Table 102E. 



Table 102E. Domain Analysis of NOYT02a 


Pfam Domain 


NOV102a Match 
Region 


Identities/ 
Similarities 
for the Matched 
Region 


Expect 
Value 


pro isomerase: domain 1 
of 1 


5..165 


105/180 (58%) 
141/180 (78%) 


4.2e-91 



Example 103. 

The NOV 103 clone was analyzed, and the nucleotide and predicted polypeptide 
sequences are shown in Table 103 A. 



Table 103 A. NOV! 03 Sequence Analysis 




SEQIDNO:291 1 8860 bp 


NOV103a, 

CG59773-01 DNA Sequence 


GGATCCTTGAGGGCACTGGTGCGACTTTCAGGTGAGGTCTTAGCAGATGAAAGCGGCT 
T i P CTCGCCGTGAGCCAGGTGTGC 
AACCGGATTT' -. GGC AGGGTCG < I IGCAT 3 AAGCCC 
GCTGACCGGA "T? ' \ ! - :CCAGAAGAGCCTTGTGGAGGCCGCAGACGCGAAGCCGCT 
'GA.TCTTGAAATCTGATCrT-1 rCCCCGAGGCl ; ^TPGCC 
GCTGCTGCTCCGGGAGCCCAGTCTGCTAAAAGGGGAGGACGTTGAGGACGCGGCGGCT 
GGCGGGAGAG 2 AG TGC J " iGAGACATGGCAGGGTCGGAGCGCGGCCTGCGCCTCTG 

ACGGCTCTGGT i ' GTGGGG 

ACCCCACTGGGCTGTGCCATGCTGACCGGAGACCACCGAGGCGGGAGACAGAGCGCGG 
CGAAGAGCCATTGAGTGGTCACCCAGTAGCCGCCGCCGCCGCCGCCTCGGGAAGCTTG 
CCACCCGCTAGGAGGGAAGATGAAGGAGATTTGCAGGATCTGTGCCCGAGAGCTGTGT 

GGAARCCAGC3GCGCTGGATCTTCCACACGGCGTCCARGCTC&ATCTCCAGGTTCTGC 
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! TTTCGCACGTCTTGGGCAAGGATGTCCCCCGCGATGGCAAAGCCGAGTTCGCTTGCAG 
i CAAGTGTGCTTTCATGCTTGATCGAATCTATCGATTCGACACAGTTATTGCCCGGATT 
GAAGCGCTTTCTATTGAGCGCTTGCAAAAGCTGCTACTGGAGAAGGATCGCCTCAAGT 
TCTGCATTGCCAGTATGTATCGGAAGAATAACGATGACTCTGGCGCGGAGATCAAGGC 
GGGGAATGGGACGGTTGACATGTCCGTCTTACCCGATGCGAGATACTCTGCACTGCTC 
CAGGAGGACTTCGCCTATTCAGGGTTTGAGTGCTGGGTGGAGAATGAGGATCAGArCC 
AGGAGCCACACAGCTGCCATGGTTCAGAAGGCCCTGGAAACCGACCCAGGAGATGCCG 
TGGTTGTGCCGCTTTGCGGGTTGCTGATTCTGACTATGAAGCCATTTGTAAGGTACCT 
CGAAAGGTGGCCAGAAGTATCTCCTGCGGCCCTTCTAGCAGGTGGTCGACCAGCATTT 
GCACTGAAGAACCAGCGTTGTCTGAGGTTGGGCCACCCGACTTAGCAAGCACAAAGGT 
ACCCCCAGATGGAGAAAGCATGGAGGAAGAGACGCCTGGTTCCTCTGTGGAATCTTTG 
GATGCAAGCGTCCAGGCTAGCCCTCCACAACAGAAAGATGAGGAGACTGAGAGAAGTG 
CAAAGGAACTTGGAAAGTGTGACTGTTGTTCAGATGATCAGGCTCCGCAGCATGGGTG 
TAATCACAAGCTGGAATTAGCTCTTAGCATGATTAAAGGTCTTGATTATAAGCCCATC 
CAGAGCCCCCGAGGGAGCAGGCTTCCGATTCCAGTGAAATCCAGCCTACCTGGAGCCA 
AGCCTGGCCCTAGCATGACAGATGGAGTTAGTTCCGGTTTCCTTAACAGGTCTTTGAA 
; ACCCCTTTACAAGACACCTGTGAGTTATCCCTTGGAGCTTTCAGACCTGCAGGAGCTG 
TGGGATGATCTCTGTGAAGATTATTTGCCGCTCCGGGTCCAGCCCATGACTGAAGAGT 
TGCTGAAACAACAAAAGCTGAATTCACATGAGACCACTATAACTCAGCAGTCTGTATC 
TGATTCCCACTTGGCAGAACTCCAGGAAAAAATCCAGCAAACAGAGGCCACCAACAAG 
ATTCTTCAAGAGAAACTTAATGAAATGAGCTATGAACTAAAGTGTGCTCAGGAGTCGT 
CTCAAAAGCAAGATGGTACAATTCAGAACCTCAAGGAAACTCTGAAAAGCAGGGAACG 
TGAGACTGAGGAGTTGTACCAGGTAATTGAAGGTCAAAATGACACAATGGCAAAGCTT 
CGAGAAATGCTGCACCAAAGCCAGCTTGGACAACTTCACAGCTCAGAGGGTACTTCTC 
CAGCTCAGCAACAGGTAGCTCTGCTTGATCTTCAGAGTGCTTTATTCTGCAGCCAACT 
TGAAATACAGAAGCTCCAGAGGGTGGTACGACAGAAAGAGCGCCAACTGGCTGATGCC 
AAACAATGTGTGCAATTTGTAGAGGCTGCAGCACACGAGAGTGAACAGCAGAAAGAGG 
CTTCTTGGAAACATAACCAGGAATTGCGAAAAGCCTTGCAGCAGCTACAAGAAGAATT 
GCAGAATAAGAGCCAACAGCTTCGTGCCTGGGAGGCTGAAAAATACAATGAGATTCGA 
ACCCAGGAACAAAACATCCAGCACCTAAACCATAGTCTGAGTCACAAGGAGCAGTTGC 
TTCAGGAATTTCGGGAGCTCCTACAGTATCGAGATAACTCAGACAAAACCCTTGAAGC 
AAATGAAATGTTGCTTGAGAAACTTCGCCAGCGAATACATGATAAAGCTGTTGCTCTG 
GAGCGGGCTATAGATGAAAAATTCTCTGCTCTAGAAGAGAAAGAAAAAGAACTGCGCC 
AGCTTCGTCTTGCTGTGAGAGAGCGAGATCATGACTTAGAGAGACTGCGCGATGTCCT 
CTCCTCCAATGAAGCTACTATGCAAAGTATGGAGAGTCTCCTGAGGGCCAAAGGCCTG 
GAAGTGGAACAGTTATCTACTACCTGTCAAAACCTCCAGTGGCTGAAAGAAGAAATGG 
AAACCAAATTTAGCCGTTGGCAGAAGGAACAAGAGAGTATCATTCAGCAGTTACAGAC 
GTCTCTTCATGATAGGAACAAAGAAGTGGAGGATCTTAGTGCAACACTGCTCTGCAAA 
CTTGGACCAGGGCAGAGTGAGATAGCAGAGGAGCTGTGCCAGCGTCTACAGCGAAAGG 
AAAGGATGCTGCAGGACCTTCTAAGTGATCGAAATAAACAAGTGCTGGAACATGAAAT 
GGAGATTCAAGGCCTGCTTCAGTCTGTGAGCACCAGGGAGCAGGAAAGCCAAGCTGCT 
GCAGAGAAGTTGGTGCAAGCCTTAATGGAAAGAAATTCAGAATTACAGGCCCTGCGCC 
AATATTTAGGAGGGAGAGACTCCCTGATGTCCCAAGCACCCATCTCTAACCAACAAGC 
TGAAGTTACCCCCACTGGCCGTCTTGGAAAACAGACTGATCAAGGTTCAATGCAGATA 
CCTTCCAGAGATGATAGCACTTCATTGACTGCCAAAGAGGATGTCAGCATACCCAGAT 
CCACATTAGGAGACTTGGACACAGTTGCAGGGCTGGAAAAAGAACTGAGTAATGCCAA 
AGAGGAACTTGAACTCATGGCTAAAAAAGAAAGAGAAAGTCAGATGGAACTTTCTGCT 
CTACAGTCCATGATGGCTGTGCAGGAAGAAGAGCTGCAGGTGCAGGCTGCTGATATGG 
AGTCTCTGACCAGGAACATACAGATTAAAGAAGATCTCATAAAGGACCTGCAAATGCA 
ACTGGTTGATCCTGAAGACATACCAGCTATGGAACGCCTGACCCAGGAAGTCTTACTT 
CTTCGGGAAAAAGTTGCTTCAGTAGAATCCCAGGGTCAAGAAATTTCAGGAAACCGAA 
GACAACAGTTGCTGCTGATGCTAGAAGGACTAGTAGATGAACGGAGTCGGCTCAATGA 
GGCCTTACAAGCAGAGAGACAGCTCTATAGCAGTCTGGTGAAGTTCCATGCCCATCCA 
GAGAGCTCTGAGAGAGACCGAACTCTGCAGGTGGAACTGGAAGGGGCTCAGGTGTTAC 
GCAGTCGGCTAGAAGAAGTTCTTGGAAGAAGCTTGGAGCGCTTAAACAGGCTGGAGAC 
CCTGGCCGCCATTGGAGGTGCAGCTGCAGGGGATGACACCGAAGATACAAGCACTGAG 
TTCACTGACAGTATTGAGGAGGAGGCTGCACACCATAGTCACCAGCAACTTGTCAAGG 
TGGCTTTGGAGAAAAGTCTGGCAACTGTGGAGACCCAGAACCCATCTTTTTCCCCTCC 
TTCTCCGATGGGAGGGGACAGTAACAGGTGTCTTCAGGAAGAAATGCTCCACCTGAGG 
GCTGAGTTCCACCAGCACTTAGAAGAGAAGAGGAAAGCTGAGGAGGAACTGAAGGAGC 
TAAAGGCTCAAATTGAGGAAGCAGGATTCTCCTCAGTGTCCCACATCAGGAACACCAT 
GCTGAGCCTTTGCCTTGAGAATGCGGAGCTGAAAGAGCAGATGGGAGAAGCAATGTCT 
GATGGATGGGAGATCGAGGAAGACAAGGAGAAGGGCGAGGTGATGGTTGAGACTGTGG 
TAACCAAAGAGGGTCTGAGTGAGAGTAGCCTTCAGGCTGAGTTCAGAAAGCTCCAGGG 
AAAACTGAAGAATGCCCACAATATCATCAACCTCCTCAAAGAACAACTTGTGCTGAGT 
AGCAAGGAAGGGAATAGTAAACTTACTCCAGAGCTCCTTGTGCATCTGACCAGCACCA 
TTGAAAGAATAAACACAGAACTGGTTGGTTCCCCTGGGAAGCACCAACACCAAGAGGA 
GGGGAATGTGACTGTGAGGCCTTTCCCCAGACCCCAGAGCCTTGACCTTGGGGCTACC 
TTCACAGTGGATGCCCACCAATTGGATAACCAGTCCCAGCCTCGTGACCCTGGGCCTC 
AGTCAGCGTTTAGCCTACCAGGGTCCACCCAGCACCTGCGCTCCCAGCTGTCACAATG 
CAAACAACGCTATCAAGATCTCCAGGAGAAGCTGCTGCTATCAGAAGCCACTGTCTTT 
GCTCAGGCTAACGAGCTGGAGAAATACAGAGTTATGCTTACAGGTGAATCCTTGGTGA 
AGCAGGACAGCAAGCAGATCCAGGTGGACCTCCAGGACCTGGGCTATGAGACTTGTGG 
CCGAAGCGAGAATGAGGCTGAACGGGAGGAAACCACCAGTCCTGAGTGTGAGGAGCAC 
AACAGCCTCAAGGAAATGGTCCTGATGGAGGGGCTGTGCTCTGAGCAGGGACGCCGGG 
GCTCAACACTGGCTAGTTCCTCTGAGAGGAAGCCCTTGGAGAACCAGCTAGGGAAGCA 
GGAAGAGTTCCGGGTATATGGAAAGTCAGAAAACATCTTGGTCCTACGAAAGGACATC 
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AAAGATCTGAAGGCCCAGCTGCAGAATGCCAACAAGGTCATTCAAAACCTCAAGAGCC 
GGGTCCGGTCCCTCTCAGTTACAAGTGATTATTCGTCTAGTCTGGAAAGACCCCGGAA 
GCTGAGAGCTGTTGGCACCTTGGAGGGGTCTTCACCTCATAGTGTCCCTGATGAGGAT 
GAGGGGTGGCTGTCTGATGGCACTGGGGCTTTCTACTCTCCAGGGCTTCAGGCCAAAA 
AGGACCTGGAGAGTCTCATCCAGAGAGTATCCCAGCTGGAGGCCCAGCTCCCAAAAAA 
TGGACTAGAAGAGAAGCTGGCTGAGGAGCTGAGATCAGCCTCGTGGCCTGGGAAATAT 
GATTCCCTGATTCAGGATCAGGCCCGGGAACTGTCTTACCTACGGCAAAAAATACGAG 
AAGGGAGAGGTATTTGTTATCTTATCACCCGGCATGCAAAAGATACAGTAAAATCTTT 
TGAGGATCTCCTAAGGAGCAATGACATTGACTACTACCTGGGACAGAGCTTCCGGGAG 
CAACTCGCCCAGGGAAGCCAGCTGACAGAGAGGCTCACCAGCAAACTCAGCACCAAGG 
ATCATAAAAGTGAGAAAGATCAAGCTGGACTTGAGCCACTGGCCCTCAGGCTCAGCAG 
GGAGCTGCAGGAGAAGGAGAAAGTGATTGAAGTCCTGCAGGCCAAGCTGGATGCTCGG 
TCCCTCACACCCTCCAGCAGCCATGCCTTGTCTGACTCCCACCGCTCTCCCAGCAGCA 
CCTCTTTCCTGTCTGATGAACTGGAAGCCTGCTCTGACATGGACATAGTCAGCGAGTA 
CACACACTATGAAGAGAAGAAAGCTTCTCCCAGTCACTCAGATTCCATCCATCATTCG 
AGTCATTCTGCTGTGTTGTCTTCTAAACCATCATCAACCAGTGCATCTCAGGGGGCTA 
AGGCCGAATCCAACAGCAACCCCATCAGCTTGCCAACTCCCCAGAATACCCCCAAGGA 
GGCCAACCAGGCCCATTCAGGCTTTCATTTTCACTCCATACCCAAGCTGGCTAGCCTT 
CCTCAGGCACCATTGCCCTCAGCTCCATCCAGCTTCCTGCCTTTCAGCCCCACTGGCC 
CTCTCCTCCTTGGCTGCTGTGAGACACCAGTGGTCTCCTTGGCTGAGGCTCAGCAGGA 
GCTACAGATGCTGCAGAAGCAGTTGGGAGAAAGTGCCAGCACTGTTCCTCCTGCTTCC 
ACAGCTACATTGCTGAGCAACGACTTGGAAGCCGACTCTTCCTACTACCTCAACTCTG 
CCCAGCCTCACTCTCCTCCAAGGGGCACCATAGAACTGGGAAGAATCCTAGAGCCTGG 
GTACCTGGGCAGCAGTGGCAAGTGGGATGTGATGAGGCCTCAGAAAGGGAGTGTATCT 
GGGGACCTATCCTCAGGCTCCTCTGTGTACCAGCTTAACTCCAAACCCACAGGGGCTG 
ACCTGCTGGAAGAGCATCTTGGTGAAATCCGGAACCTGCGCCAGCGCCTGGAGGAGTC 
CATCTGCATCAATGACCGCCTACGGGAGCAACTGGAACACCGGCTGACCTCTACTGCT 
CGTGGAAGGGGATCCACTTCTAACTTCTACAGTCAGGGCCTGGAGTCCATACCTCAGC 
TCTGCAATGAGAACAGAGTCCTCAGGGAAGACAATCGAAGACTTCAGGCTCAACTGAG 
TCATGTTTCCAGAGAGCACTCCCAGGAAACAGAAAGCCTGAGGGAGGCTCTGCTGTCC 
TCTCGATCCCACCTTCAAGAGCTGGAAAAGGAGCTGGAGCACCAGAAGGTGGAAAGGC 
AGCAGCTTTTGGAAGACTTGAGGGAGAAGCAGCAAGAGGTCTTGCATTTCAGGGAGGA 
ACGTCTTTCCCTCCAGGAAAACGACTCCAGACTGCAGCACAAGCTGGTTCTCCTGCAG 
CAACAGTGTGAAGAGAAACAGCAGCTCTTTGAGTCCCTCCAGTCAGAGCTACAAATCT 
ACGAGGCACTTTATGGCAATTCCAAGAAGGGGCTGAAAGGCTTGGGTTTGGATACTTC 
TCCAGTAATGAAGACCCCTCCCAAGCTAGAGGGTGATGCTACTGATGGCTCCTTTGCC 
AATAAGCATGGCCGCCATGTCATTGGCCACATTGATGACTACAGTGCCCTAAGACAGC 
AGATTGCGGAGGGCAAGCTGCTGGTCAAAAAGATAGTGTCTCTTGTGAGATCAGCGTG 
CAGCTTCCCTGGCCTTGAAGCCCAAGGCACAGAGGTGCTAGGCAGCAAAGGTATTCAT 
GAGCTTCGGAGCAGCACCAGTGCCCTGCACCATGCCCTAGAGGAGTCGGCTTCCCTCC 
TCACCATGTTCTGGAGAGCAGCCCTGCCAAGCACCCACATCCCTGTGCTGCCTGGCAA 
AGTGGGAGAATCAACAGAAAGGGAACTTCTGGAACTGAGAACCAAAGTATCCAAACAG 
GAGCGGCTCCTTCAGAGCACAACTGAGCATCTGAAGAACGCCAACCAGCAGAAGGAGA 
GCATGGAGCAGTTCATCGTCAGCCAGCTAACCAGAACACATGATGTTTTAAAGAAGGC 
AAGGACTAACTTAGAGGTGAAATCCCTAAGGGCTCTGCCATGTACTCCAGCCTTGTGA 
CCCTTGCCTTCCAGGAACCATGCAAGAAGCGCAGCCACCAGAAGTCCTTAAAACAGCA 


GGAAAGGTGGGCCTGTCCCCCTTTTGTGCAGCTACCTATCTGCTGAGGAGCATCTGGG 
CCTCATTCCTCCAAGTCCACGGGAGGGTCCAGAAGAGGGAGTCAGAGATGTATCCTGG 


TGGAGCTGGGAGAAAGGCAGAAAGCCTTTCTGACAGCTATGGAATACGATTAGCCAAG 


GTCCACTTGGCCCAGCACTAAGAAAAAGATGCGTAGTTTGCACAGAAGGTTTTGTGAT 


CCTGCCTCTCAACAGCCCCAGCAGCTTGCGAACTAGCAAGAGCACATTTCTTGrfTCA 


TCAGCTGTr - A TCAGTGGATATAGGACCCTGATTCCGATGAAAGGG 


GC - G P3GTC -CAATGCTGGAGCTCCTCTGGCAGGTTCTAAAAGCACACTACTGAGCA 


GCGGTGCCCTGCCGGACACTGCTGGCGGGGGCTCAGTGAGCACTACTCACAGATCCAC 


ACCTGACCCTGTTGGGTCGAGTCAGGCTGGGCCTTGGTCTGCACTGTAGCACCTGTGT 


TCTTTGAGTTCACATCATGAATGTGGTGACTTCCCAGATACCATCTCAGGCTTAACCT 


AGCACATCCTATTTCTTTTCTTCTATGATATCCAAATTGGACTGACCTCACTTCAAAG 


TTGCTGTCCCATTTTGTCACCCTATCTTATCTCGGGGAAATTGCAGACTGATGGCCAG 


ACCAACTCTGTTGAAATTCTTGCATAGAGCAAACCTGTGCTCATTTTTAAGTGGCATG 


GGAGAGGCCCCCAGCCTAGTAAAGCCTAGTCTGTGTCTTCACAGTGCTGGTAGAATGT 


GTTTGTGTGTATAAATATATGATATAGATTTATATATGTTGCTAACGCCATATATTGA 


AGGCCAACATAACTGGTGGACAGGGTGGGTGACAGAAAATGAAAGCCTTTTTGGTGAT 
TGTTAAAGCAAGATGTGTATAAAGAAATAAATAGTTTTTCTTTC 




ORF Start: ATG at [oRJ Stop li A at 7828 
658 




SEQ ID NO 292 2390 aa M\V at 26884 -, 7kD 


NOV103a, 
CG59773-01 Protein 
Sequence 


MKEICRICARELCGNQRRWIFHTASKLNLQVLLSHVLGKDVPRDGKAEFACSKCAFML 
DRI YRFDTVI AR I EALS I ERLQKLLLEKDRLKFC I ASMYRKNNDDSGAE I KAGNGTVD 
MSVLPDARYSALLQEDFAYSGFECWVENEDQIQEPHSCHGSEGPGNRPRRCRGCAALR 
VADSDYEAICKVPRKVARSISCGPSSRWSTSICTEEPALSEVGPPDLASTKVPPDGES 
MEEETPGSSVESLDASVQASPPQQKDEETERSAKELGKCDCCSDDQAPQHGCNHKLEL 
ALSMIKGLDYKPIQSPRGSRLPIPVKSSLPGAKPGPSMTDGVSSGFLNRSLKPLYKTP 
VSYPLELSDLQELWDDLCEDYLPLRVQPMTEELLKQQKLNSHETT I TQQSVSDSHLAE 
□Fino-F - L . UZ E E - „ STLKSRERETEELY 
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w 





QVIEGQKDTMAKLREMLHQSQLGQLHSSEGTSPAQQQVALLDLQSALFCSQLEIt I ' . 
RWRQKERQLADAKQCVQFVEAAAHESEQQKEASWKHNQELRKALQQLQEELQNKSQQ 
LRAWEAEKYNEIRTQEQNIQHLNHSLSHKEQLLQEFRELLQYRDNSDKTLEANEMLLE 
KLRQRIHDKAVALERAIDEKFSALEEKEKELRQLRLAVRERDHDLERLRDVLSSNEAT 
MQSMESLLRAKGLEVEQLSTTCQNLQWLKEEMETKFSRWQKEQES I IQQLQTSLHDRN 
KEVEDLSATLLCKLGPGQSEIAEELCQRLQRKERMLQDLLSDRNKQVLEHEME I QGLL 
QSVSTREQESQAAAEKLVQALMERNSELQALRQYLGGRDSLMSQAPISNQQAEVTPTG 
RLGKQTDQGSMQIPSRDDSTSLTAKEDVSIPRSTLGDLDTVAGLEKELSNAKEELELM 
AKKERESQMELSALQSMMAVQEEELQVQAADMESLTRKIQIKEDLIKDLQMQLVDPED 
IPAMERLTQEVLLLREKVASVESQGQEISGNRRQQLLLMLEGLVDERSRLNEALQAER 
QLYSSLVKFHAHPESSERDRTLQVELEGAQVLRSRLEEVLGRSLERLNRLETLAAIGG 
AAAGDDTEDTSTEFTDSIEEEAAHHSHQQLVICVALEKSLATVETQNPSFSPPSPMGGD 
SNRCLQEEMLHLRAEFHQHLEEKRKAEEELKELKAQIEEAGFSSVSHIRNTMLSLCLE 
NAELKEQMGEAMSDGWEIEEDKEKGEVMVETWTKEGLSESSLQAEFRKLQGKLKNAH 
NIINLLKEQLVLSSKEGNSKLTPELLVHLTSTIERINTELVGSPGKHQHQEEGNVTVR 
PFPRPQSLDLGATFTVDAHQLDNQSQPRDPGPQSAFSLPGSTQHLRSQLSQCKQRYQD 
LQEKLLLSEATVFAQANELEKYRVMLTGESLVKQDSKQIQVDLQDLGYETCGRSENEA 
EREETTSPECEEHNSLKEMVLMEGLCSEQGRRGSTLASSSERKPLENQLGKQEEFRVY 
GKSENILVLRKDIKDLKAQLQNANKVIQNLKSRVRSLSVTSDYSSSLERPRKLRAVGT 
LEGSSPHSVPDEDEGWLSDGTGAFYSPGLQAKKDLESLIQRVSQLEAQLPKNGLEEKL 
AEELRSASWPGKYDSLIQDQARELSYLRQKIREGRGICYLITRHAKDTVKSFEDLLRS 

KVIEVLQAKLDARSLTPSSSHALSDSHRSPSSTSFLSDELEACSDMDIVSEYTHYEEK 

GFHFHS I PKLASLPQAPLPSAPSSFLPFSPTGPLLLGCCETPWSLAEAQQELQMLQK 
QLGESASTVPPASTATLLSNDLEADSSYYLNSAQPHSPPRGTIELGRILEPGYLGSSG 
KWDVMRPQKGSVSGDLSSGSSVYQLNSKPTGADLLEEHLGE I RNLRQRLEES I C INDR 
LREQLEHRLTSTARGRGSTSNFYSQGLESIPQLCNENRVLREDNRRLQAQLSHVSREH 
SQETESLREALLSSRSHLQELEKELEHQKVERQQLLEDLREKQQEVLHFREERLSLQE 
NDSRLQHKLVLLQQQCEEKQQLFESLQSELQIYEALYGNSKKGLKGLGLDTSPVMKTP 
PKLEGDATDGSFANKHGRHVIGHIDDYSALRQQIAEGKLLVKKIVSLVRSACSFPGLE 
AQGTEVLGSKGIHELRSSTSALHHALEESASLLTMFWRAALPSTHIPVLPGKVGESTE 
RELLELRTKVSKQERLLQSTTEHLKNANQQKESMEQFIVSQLTRTHDVLKKARTNLEV 




SEQIDNO:293 (7161 bp 


NOV 103b, 

CG59773-02 DNA Sequence 


FGAG 5 "CGGGCACGCTCCTCCCCATGGGTTGCCCATCATGTCTAATGGAT 


ATCGCACTCTGTCCCAGCACCTCAATGACCTGAAGAAGGAGAACTTCAGCCTCAAGCT 
GCGCATCTACTTCCTGGAGGAGCGCATGCAACAGAAGTATGAGGCCAGCCGGGAGGAC 
ATCTACAAGCGGAACATTGAGCTGAAGGTTGAAGTGGAGAGCTTGAAACGAGAACTCC 
AGGACAAGAAACAGCATCTGGATAAAACATGGGCTGATGTGGAGAATCTCAACAGTCA 
GAATGAAGCTGAGCTCCGACGCCAGTTTGAGGAGCGACAGCAGGAGACGGAGCATGTT 
TATGAGCTCTTGGAGAATAAGATCCAGCTTCTGCAGGAGGAATCCAGGCTAGCAAAGA 
ATGAAGCTGCGCGGATGGCAGCTCTGGTGGAAGCAGAGAAGGAGTGTAACCTGGAGCT 
CTCAGAGAAACTGAAGGGAGTCACCAAAAACTGGGAAGATGTACCAGGAGACCAGGTC 
AAGCCCGACCAATACACTGAGGCCCTGGCCCAGAGGGACAGGAGAATTGAAGAACTGA 
ATCAGAGCCTGGCTGCCCAGGAGAGGCTTGTAGAACAGCTATCTCGGGAGAAACAACA 
ACTGCTACATCTGTTGGAGGAGCCAACTAGCATGGAAGTGCAGCCCATGACTGAAGAG 
TTGCTGAAACAACAAAAGCTGAATTCACATGAGACCACTATAACTCAGCAGTCTGTAT 
CTGATTCCCACTTGGCAGAACTCCAGGAAAAAATCCAGCAAACAGAGGCCACCAACAA 
GATTCTTCAAGAGAAACTTAATGAAATGAGCTATGAACTAAAGTGTGCTCAGGAGTCG 
TCTCAAAAGCAAGATGGTACAATTCAGAACCTCAAGGAAACTCTGAAAAGCAGGGAAC 
GTGAGACTGAGGAGTTGTACCAGGTAATTGAAGGTCAAAATGACACAATGGCAAAGCT 
TCGAGAAATGCTGCACCAAAGCCAGCTTGGACAACTTCAGAGCTCAGAGGGTACTTCT 
CCAGCTCAGCAACAGGTAGCTCTGCTTGATCTTCAGAGTGCTTTATTCTGCAGCCAAC 
TTGAAATACAGAAGCTCCAGAGGGTGGTACGACAGAAAGAGCGCCAACTGGCTGATGC 
CAAACAATGTGTGCAATTTGTAGAGGCTGCAGCACACGAGAGTGAACAGCAGAAAGAG 
GCTTCTTGGAAACATAACCAGGAATTGCGAAAAGCCTTGCAGCAGCTACAAGAAGAAT 
TGCAGAATAAGAGCCAACAGCTTCGTGCCTGGGAGGCTGAAAAATACAATGAGATTCG 
AACCCAGGAACAAAACATCCAGCACCTAAACCATAGTCTGAGTCACAAGGAGCAGTTG 
CTTCAGGAATTTCGGGAGCTCCTACAGTATCGAGATAACTCAGACAAAACCCTTGAAG 
CAAATGAAATGTTGCTTGAGAAACTTCGCCAGCGAATACATGATAAAGCTGTTGCTCT 
GGAGCGGGCTATAGATGAAAAATTCTCTGCTCTAGAAGAGAAAGAAAAAGAACTGCGC 
CAGCTTCGTCTTGCTGTGAGAGAGCGAGATCATGACTTAGAGAGACTGCGCGATGTCC 
TCTCCTCCAATGAAGCTACTATGCAAAGTATGGAGAGTCTCCTGAGGGCCAAAGGCCT 
GGAAGTGGAACAGTTATCTACTACCTGTCAAAACCTCCAGTGGCTGAAAGAAGAAATG 
GAAACCAAATTTAGCCGTTGGCAGAAGGAACAAGAGAGTATCATTCAGCAGTTACAGA 
CGTCTCTTCATGATAGGAACAAAGAAGTGGAGGATCTTAGTGCAACACTGCTCTGCAA 
ACTTGGACCAGGGCAGAGTGAGATAGCAGAGGAGCTGTGCCAGCGTCTACAGCGAAAG 
GAAAGGATGCTGCAGGACCTTCTAAGTGATCGAAATAAACAAGTGCTGGAACATGAAA 
TGGAGATTCAAGGCCTGCTTCAGTCTGTGAGCACCAGGGAGCAGGAAAGCCAAGCTGC 
TGCAGAGAAGTTGGTGCAAGCCTTAATGGAAAGAAATTCAGAATTACAGGCCCTGCGC 
CAATATTTAGGAGGGAGAGACTCCCTGATGTCCCAAGCACCCATCTCTAACCAACAAG 
CTGAAGTTACCCCCACTGGCCGTCTTGGAAAACAGACTGATCAAGGTTCAATGCAGAT 
ACCTTCCAGAGATGATAGCACTTCATTGACTGCCAAAGAGGATGTCAGCATACCCAGA 
' - " J - --~ n A ■ "-TTTGGACACAGTTGCAGGGCTGGAAAAAGAACTGAGTAATGCCA 
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AAGAGGAACTTGAACTCATGGCTAAAAAAGAAAGAGAATCACAGATGGAACTTTCTGC 
TCTACAGTCCATGATGGCTGTGCAGGAAGAAGAGCTGCAGGTGCAGGCTGCTGATATG 
GAGTCTCTGACCAGGAACATACAGATTAAAGAAGATCTCATAAAGGACCTGCAAATGC 
AACTGGTTGATCCTGAAGACATACCAGCTATGGAACGCCTGACCCAGGAAGTCTTACT 
TCTTCGGGAAAAAGTTGCTTCAGTAGAATCCCAGGGTCAAGAAATTTCAGGAAACCGA 
AGACAACAGCAGTTGCTGCTGATGCTAGAAGGACTAGTAGATGAACGGAGTCGGCTCA 
ATGAGGCCTTACAAGCAGAGAGACAGCTCTATAGCAGTCTGGTGAAGTTCCATGCCCA 
TCCAGAGAGCTCTGAGAGAGACCGAACTCTGCAGGTGGAACTGGAAGGGGCTCAGGTG 
TTACGCAGTCGGCTAGAAGAAGTTCTTGGAAGAAGCTTGGAGCGCTTAAACAGGCTGG 
AGACCCTGGCCGCCATTGGAGGTGCAGCTGCAGGGGATGACACCGAAGATACAAGCAC 
I TGAGTTCACTGACAGTATTGAGGAGGAGGCTGCACACCATAGTCAC CAGCAACTTGTC 
AAGGTGGCTTTGGAGAAAAGTCTGGCAACTGTGGAGACCCAGAACCCATCTTTTTCCC 
CTCCTTCTCCGATGGGAGGGGACAGTAACAGGTGTCTTCAGGAAGAAATGCTCCACCT 
GAGGGCTGAGATCCACCAGCACTTAGAAGAGAAGAGGAAAGCTGAGGAGGAACTGAAG 
GAGCTAAAGGCTCAAATTGAGGAAGCAGGATTCTCCTCAGTGTCCCACATCAGGAACA 
CCATGCTGAGCCTTTGCCTTGAGAATGCGGAGCTGAAAGAGCAGATGGGAGAAACAAT 
GTCTGATGGATGGGAGATCGAGGAAGACAAGGAGAAGGGCGAGGTGATGGTTGAGACT 
GTGGTAACCAAAGAGGGTCTGAGTGAGAGTAGCCTTCAGGCTGAGTTCAGAAAGCTCC 
AGGGAAAACTGAAGAATGCCCACAATATCATCAACCTCCTCAAAGAACAACTTGTGCT 
GAGTAGCAAGGAAGGGAATAGTAAACTTACTCCAGAGCTCCTTGTGCATCTGACCAGC 
ACCATCGAAAGAATAAACACAGAACTGGTTGGTTCCCCTGGGAAGCACCAACACCAAG 
AGGAGGGGAATGTGACTGTGAGGCCTTTCCCCAGACCCCAGAGCCTTGACCTTGGGGC 
TACCTTCACAGTGGATGCCCACCAACAGTTGGATAACCAGTCCCAGCCTCGTGACCCT 
GGGCCTCAGCCAGCGTTTAGCCTACCAGGGTCCACCCAGCACCTGCGCTCCCAGCTGT 
CACAATGCAAACAACGCTATCAAGATCTCCAGGAGAAGCTGCTGCTATCAGAAGCCAC 
TGTCTTTGCTCAGGCTAACGAGCTGGAGAAATACAGAGTTATGCTTAGTGAATCCTTG 
GTGAAGCAGGACAGCAAGCAGATCCAGGTGGACTTCCAGGACCTGGGCTATGAGACTT 
GTGGCCGAAGCGAGAATGAGGCTGAACGGGAGGAAACCACCAGTCCTGAGTGTGAGGA 
GCACAACAGCCTCAAGGAAATGGTCCTGATGGAGGGGCTGTGCTCTGAGCAGGGACGC 
CGGGGCTCAACACTGGCTAGTTCCTCTGAGAGGAAGCCCTTGGAGAACCAGCTAGGGA 
AGCAGGAAGAGTTCCGGGTATATGGAAAGTCAGAAAACATCTTGGTCCTACGAAAGGA 
CATCGAAGATCTGAAGGCCCAGCTGCAGAATGCCAACAAGGTCATTCAAAACCTCAAG 
AGCCGGGTCCGGTCCCTCTCAGTTACAAGTGATTATTCGTCTAGTCTGGAAAGACCCC 
GGAAGCTGAGAGCTGTTGGCACCTTGGAGGGGTCTTCACCTCATAGTGTCCCTGATGA 
GGATGAGGGGTGGCTGTCTGATGGCACTGGGGCTTTCTACTCTCCAGGGCTTCAGGCC 
AAAAAGGACCTGGAGAGTCTCATCCAGAGAGTATCCCAGCTGGAGGCCCAGCTCCCAG 
AAAATGGACTAGAAGAGAAGCTGGCTGAGGAGCTGAGATCAGCCTCGTGGCCTGGGAA 
ATATGATTCCCTGATTCAGGATCAGGCCCGGGAACTGTCTTACCTACGGCAAAAAATA 
CGAGAAGGGAGAGGTATTTGTTATCTTATCACCCAGCATGCAAAAGATACAGTAAAAT 
CTTTTGAGGATCTCCTAAGGAGCAATGACATTGACTACTACCTGGGACAGAGCTTCCG 
GGAGCAACTCGCCCAGGGAAGCCAGCTGACAGAGAGGCTCACCAGCAAACTCAGCACA 
GAGGATCATAAAAGTGAGAAAGATCAAGCTGGACTTGAGCCACTGGCCCTCAGGCTCA 
GCAGGGAGCTGCAGGAGAAGGAGAAAGTGATTGAAGTCCTGCAGGCCAAGCTGGATGC 
TCGGTCCCTCACACCCTCCAGCAGCCGTGCCTTGTCTGACTCCCACCGCTCTCCCAGC 
AGCACCTCTTTCCTGTCTGATGAGCTGGAAGCCTGCTCTGACATGGACATAGTCAGCG 
AGTACACACACTATGAAGAGAAGAAAGCTTCTCCCAGTCACTCAGGTAGCAGTGCATC 
TCAGGGGGCTAAGGCCGAATCCAACAGCAACCCCATCAGCTTGCCAACTCCCCAGAAT 
ACCCCCAAGGAGGCCAACCAAGCCCATTCAGGCTTTCATTTTCACTCCATACCCAAGC 
TGGCTAGCCTTCCTCAGGCACCATTGCCCTCAGCTCCATCCAGCTTCCTGCCTTTCAG 
CCCCACTGGCCCTCCCCTCCTTGGCTGCTGTGAGACACCAGAGGTCTCCTTGGCTGAG 
TCTCAGCAGGAGCTACAGATGCTGCAGAAGCAGTTGGGAGAAAGTAGCACTGTTCCTC 
CTGCTTCCACAGCTACATTGCTGAGCAACGACTTGGAAGCCGACTCTTCCTACTACCT 
CAACTCTGCCCAGCCTCACTCTCCTCCAAGGGGCACCATAGAACTGGGAAGAATCCTA 
GAGCCTGGGTACCTGGGCAGCAGTGGCAAGTGGGATGTGATGAGGCCTCAGAAAGGGA 
GTGTATCTGGGGACCTATCCTCAGGCTCCTCTGTGTACCAGCTTAACTCCAAACCCAC 
AGGGGCTGACCTGCTGGAAGAGCATCTTGGTGAAATCTGGAACCTGCGCCAGCGCCTG 
GAGGAGTCCATCTGCATCAATGACTGCCTACGGGAGCAACTGGAACACCGGCTGACCT 
CTACTGCTCGTGGAAGGGGATCCACTTCTAACTTCTACAGTCAGGGCCTGGAGTCCAT 
ACCTCAGCTCTGCAATGAGAACAGAGTCCTCAGGGAAGAAAATCGAAGACTTCAGGCT 
CAACTGAGTCATGTTTCCAGAGGTCACTCCCAGGAAACAGAAAGCCTGAGGGAGGCTC 
TGCTGTCCTCTCGATCCCACCTTCAAGAGCTGGAAAAGGAGCTGGAGCACCAGAAGGT 
GGAAAGGCAGCAGCTTTTGGAAGACTTGAGGGAGAAGCAGCAAGAGGTCTTGCATTTC 
AGGGAGGAACGTCTTTCCCTCCAGGAAAACGACTCCAGACTGCAGCACAAGCTGGTTC 
TCCTGCAGCAACAGTGTGAAGAGAAACAGCAGCTCTTTGAGTCCCTCCAGTCAGAGCT 
ACAAATCTACGAGGCACTTTATGGCAATTCCAAGAAGGGGCTGAAAGCTTACAGCCTG 
GATGCCTGTCACCAAATCCCTTTGAGCAGTGACCTGAGCCACCTGGTGGCAGAGGTAC 
AAGCTCTGAGAGGGCAGCTGGAGCAGAGCATTCAGGGGAACAATTGTCTGCGACTGCA 
GCTGCAACAGCAGCTGGAGAGCGGTGCTGGCAAAGCCAGCCTCAGCCCCTCCTCCATT 
AACCAGAACTTCCCAGCCAGCACTGACCCTGGAAACAAGCAGCTGCTCCTCCAAGGTT 
CAGCTGTGTCCCCTCCAGTCCGGGATGTTGGTATGAATTCCCCAGCTCTGGTCTTCCC 
CAGCTCTGCTTCCTCTACTCCTGGCTCAGATTCAGTTGTGTTGTCATTTTCTTTTTCA 
GGCTTGGGTTTGGATACTTCTCCAGTAATGAAGACCCCTCCCAAGCTAGAGGGTGATG 
CTACTGATGGCTCCTTTGCCAATAAGCATGGCCGCCATGTCATTGGCCACATTGATGA 
CTACAGTGCCCTAAGACAGCAGATTGCGGAGGGCAAGCTGCTGGTCAAAAAGATAGTG 

GCAAAGGCATTCATGAGCTTCGGAGCAGCACCAGTGCCCTGCACCATGCCCTAGAGGA 
GTCGGCTTCCCTCCTCACCATGTTCTGGAGAGCGGCCCTGCCAAGCACCCACATCCCT 
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NOV103b, 
( (.59773-02 Protein 
| Sequence 



NOV 103c, 

CG59773-03 DNA Sequence 



GTGCTGCCTGGCAAACAGGGAGAATCAACAGAAAGGGAACTTCTGGAACTGAGAACCA 
AAGTATCCAAACAGGAGCAGCTCCTTCAGAGCACAACTGAGCATCTGAAGAACGCCAA 
CCAGCAGAAGGAGAGCATGGAACAGTTCATTGTCAGCGTAACCAGAACACATGATGTT 
TTAAAGAAGGCAAGGACTAACTTAGAGGTGAAATCCCTAAGGGCTCTGCCGTGTACTC 
CAGCCTTGTS ACCCTTGCCTTCCAGGAACCATGCAAGAAGCGCAGCCACCAGAAGTCC 



TTAAAACAGCAGGAAAGGTGAGCCTGTCCCCCTTTTGTGC/ 



GAGCATCTGGGCCTCATTCCTCCAAGT 



:tgctgag 



ORF Start: ATG at 46 



SEQ ID NO: 294 



ORF Stop: TGA at 7027 



2327 aa MW at 263034.6kD 



msngyrtlsqhlndlkkenfslklriyfleermqqkyeasrediykrnielkvevesl 

KRELQDKKQHLDKTWADVENLNSQNEAELRRQFEERQQETEHVYELLENKIQLLQEES 
RLAKNEAARMAALVEAEKECNLELSEKLKGVTKNWEDVPGDQVKPDQYTEALAQRDRR 
IEELNQSLAAQERLVEQLSREKQQLLHLLEEPTSMEVQPMTEELLKQQKLNSHETTIT 
QQSVSDSHLAELQEKIQQTEATNKILQEKLNEMSYELKCAQESSQKQDGTIQNLKETL 
KSRERETEELYQVIEGQNDTMAKLREMLHQSQLGQLQSSEGTSPAQQQVALLDLQSAL 
FCSQLEIQKLQRWRQKERQLADAKQCVQFVEAAAHESEQQKEASWKHNQELRKALQQ 
LQEELQNKSQQLRAWEAEKYNEIRTQEQNIQHLNHSLSHKEQLLQEFRELLQYRDNSD 
KTLEANEMLLEKLRQRIHDKAVALERAIDEKFSALEEKEKELRQLRLAVRERDHDLER 
LRDVLSSNEATMQSMESLLRAKGLEVEQLSTTCQNLQWLKEEMETKFSRWQKEQESII 
QQLQTSLHDRNKEVEDLSATLLCKLGPGQSEIAEELCQRLQRKERMLQDLLSDRNKQV 
LEHEMEIQGLLQSVSTREQESQAAAEKLVQALMERNSELQALRQYLGGRDSLMSQAPI 
SNQQAEVTPTGRLGKQTDQGSMQIPSRDDSTSKTAKEDVSIPRSTLGDLDTVAGLEKE 
LSNAKEELELMAKKERESQMELSALQSMMAVQEEELQVQAADMESLTRNIQIKEDLIK 
DLQMQLVDPEDIPAMERLTQEVLLLREKVASVESQGQEISGNRRQQQLLLMLEGLVDE 
RSRLNEALQAERQLYSSLVKFHAHPESSERDRTLQVELEGAQVLRSRLEEVLGRSLER 
LNRLETLAAIGGAAAGDDTEDTSTEFTDSIEEEAAHHSHQQLVKVALEKSLATVETQN 
PSFSPPSPMGGDSNRCLQEEMLHLRAEIHQHLEEKRKAEEELKELKAQIEEAGFSSVS 
HIRNTMLSLCLENAELKEQMGETMSDGWEIEEDKEKGEVMVETWTKEGLSESSLQAE 
FRKLQGKLKNAHNIINLLKEQLVLSSKEGNSKLTPELLVHLTSTIERINTELVGSPGK 
HQHQEEGNVTVRPFPRPQSLDLGATFTVDAHQQLDNQSQPRDPGPQPAFSLPGSTQHL 
RSQLSQCKQRYQDLQEKLLLSEATVFAQANELEKYRVMLSESLVKQDSKQIQVDFQDL 
GYETCGRSENEAEREETTSPECEEHNSLKEMVLMEGLCSEQGRRGSTLASSSERKPLE 
NQLGKQEEFRVYGKSENI LVLRKDIEDLKAQLQNANKVI QNLKSRVRSLSVTSDYSSS 
LERPRKLRAVGTLEGSSPHSVPDEDEGWLSDGTGAFYSPGLQAKKDLESLIQRVSQLE 
AQLPENGLEEKLAEELRSASWPGKYDSLIQDQARELSYLRQKIREGRGICYLITQHAK 
DTVKSFEDLLRSNDIDYYLGQSFREQLAQGSQLTERLTSKLSTEDHKSEKDQAGLEPL 
ALRLSRELQEKEKVIEVLQAKLDARSLTPSSSRALSDSHRSPSSTSFLSDELEACSDM 
DIVSEYTHYEEKKASPSHSGSSASQGAKAESNSNPISLPTPQNTPKEANQAHSGFHFH 
SIPKLASLPQAPLPSAPSSFLPFSPTGPPLLGCCETPEVSLAESQQELQMLQKQLGES 
STVPPASTATLLSNDLEADSSYYLNSAQPHSPPRGTIELGRILEPGYLGSSGKWDVMR 
PQKGSVSGDLSSGSSVYQLNSKPTGADLLEEHLGEIWNLRQRLEESICINDCLREQLE 
HRLTSTARGRGSTSNFYSQGLESIPQLCNEWRVLREENRRLQAQLSHVSRGHSQETES 
LREALLSSRSHLQELEKELEHQKVERQQLLEDLREKQQEVLHFREERLSLQENDSRLQ 
HKLVLLQQQCEEKQQLFESLQSELQIYEALYGNSKKGLKAYSLDACHQIPLSSDLSHL 
VAEVQALRGQLEQSIQGNNCLRLQLQQQLESGAGKASLSPSSINQNFPASTDPGNKQL 
LLQGSAVSPPVRDVGMNSPALVFPSSASSTPGSDSWLSFSFSGLGLDTSPVMKTPPK 
LEGDATDGSFANKHGRHVIGHIDDYSALRQQIAEGKLLVKKIVSLVRSACSFPGLEAQ 
GTEGSKGIHELRSSTSALHHALEESASLLTMFWRAALPSTHIPVLPGKQGESTERELL 
ELRTKVSKQEQLLQSTTEHLKNANQQKESMEQFIVSVTRTHDVLKKARTNLEVKSLRA 
LPCTPAL 



SEQ ID NO: 295 



1 7084 bp 



GTTGAGGGGGCAATCGGGCACGCTCCTCCCCATGGGTTGCCCATCATGTCTAATGGAT 
A TCGCACTCTGTCCC AG CACCTCAATGACCT GAA GAAGGAGAACTTCAGCCTCAAGCT 

j I T-, IT 1 TGCAACAGAAGT ATGAGGCCAGCCGGGAGGAC 

ATCTACAAGCGGGGGTGATGTGGAGAATCTCAACAGTCAGAATGAAGCTGAGCTCCGA 
CGCCAGTTTGAGGAGCGACAGCAGGAGACGGAGCATGTTTATGAGCTCTTGGAGAATA 
AGATCCAGCTTCTGCAGGAGGAATCCAGGCTAGCAAAGAATGAAGCTGCGCGGATGGC 
AGCTCTGGTGGAAGCAGAGAAGGAGTGTAACCTGGAGCTCTCAGAGAAACTGAAGGGA 
GTCACCAAAAACTGGGAAGATGTACCAGGAGACCAGGTCAAGCCCGACCAATACACTG 
AGACCCTGGCCCAGAGGGACAAGAGAATTGAAGAACTGAATCAGAGCCTGGCTGCCCA 
GGAGAGGCTTGTAGAACAGCTATCTCGGGAGAAACAACAACTGCTACATCTGTTGGAG 
GAGCCAACTAGCATGGAAGTGCAGCCCATGACTGAAGAGTTGCTGAAACAACAAAAGC 
TGAATTCACATGAGACCACTATAACTCAGCAGTCTGTATCTGATTCCCACTTGGCAGA 
ACTCCAGGAAAAAATCCAGCAAACAGAGGCCACCAACAAGATTCTTCAAGAGAAACTT 
AATGAAATGAGCTATGAACTAAAGTGTGCTCAGGAGTCGTCTCAAAAGCAAGATGGTA 
CAATTCAGAACCTCAAGGAAACTCTGAAAAGCAGGGAACGTGAGACTGAGGAGTTGTA 
CCAGGTAATTGAAGGTCAAAATGACACAATGGCAAAGCTTCGAGAAATGCTGCACCAA 
AGCCAGCTTGGACAACTTCAGAGCTCAGAGGGTACTTCTCCAGCTCAGCAACAGGTAG 
CTCTGCTTGATCTTCAGAGTGCTTTATTCTGCAGCCAACTTGAAATACAGAAGCTCCA 
GAGGGTGGTACGACAGAAAGAGCGCCAACTGGCTGATGCCAAACAATGTGTGCAATTT 
GTAGAGGCTGCAGCACACGAGAGTGAACAGCAGAAAGAGGCTTCTTGGAAACATAACC 
AGGAATTGCGAAAAGCCTTGCAGCAGCTACAAGAAGAATTGCAGAATAAGAGCCAACA 
GCTTCGTGCCTGGGAGGCTGAAAAATACAATGAGATTCGAACCCAGGAACAAAACATC 
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CAGCACCTAAACCATAGTCTGAGTCACAAGGAGCAGTTGCTTCAGGAATTTCGGGAGC 
TCCTACAGTATCGAGATAACTCAGACAAAACCCTTGAAGCAAATGAAATGTTGCTTGA 
GAAACTTCGCCAGCGAATACATGATAAAGCTGTTGCTCTGGAGCGGGCTATAGATGAA 
AAATTCTCTGCTCTAGAAGAGAAAGAAAAAGAACTGCGCCAGCTTCGTCTTGCTGTGA 
GAGAGCGAGATCATGACTTAGAGAGACTGCGCGATGTCCTCTCCTCCAATGAAGCTAC 
TATGCAAAGTATGGAGAGTCTCCTGAGGGCCAAAGGCCTGGAAGTGGAACAGTTATCT 
ACTACCTGTCAAAACCTCCAGTGGCTGAAAGAAGAAATGGAAACCAAATTTAGCCGTT 
GGCAGAAGGAACAAGAGAGTATCATTCAGCAGTTACAGACGTCTCTTCATGATAGGAA 
CAAAGAAGTGGAGGATCTTAGTGCAACACTGCTCTGCAAACTTGGACCAGGGCAGAGT 
GAGATAGCAGAGGAGCTGTGCCAGCGTCTACAGCGAAAGGAAAGGATGCTGCAGGACC 
TTCTAAGTGATCGAAATAAACAAGTGCTGGAACATGAAATGGAGATTCAAGGCCTGCT 
TCAGTCTGTGAGCACCAGGGAGCAGGAAAGCCAAGCTGCTGCAGAGAAGTTGGTGCAA 
GCCTTAATGGAAAGAAATTCAGAATTACAGGCCCTGCGCCAATATTTAGGAGGGAGAG 
ACTCCCTGATGTCCCAAGCACCCATCTCTAACCAACAAGCTGAAGTTACCCCCACTGG 
CCGTCTTGGAAAACAGACTGATCAAGGTTCAATGCAGATACCTTCCAGAGATGATAGC 
ACTTCATTGACTGCCAAAGAGGATGTCAGCATACCCAGATCCACATTAGGAGATTTGG 
ACACAGTTGCAGGGCTGGAAAAAGAACTGAGTAATGCCAAAGAGGAACTTGAACTCAT 
GGCTAAAAAAGAAAGAGAATCACAGATGGAACTTTCTGCTCTACAGTCCATGATGGCT 
GTGCAGGAAGAAGAGCTGCAGGTGCAGGCTGCTGATATGGAGTCTCTGACCAGGAACA 
TACAGATTAAAGAAGATCTCATAAAGGACCTGCAAATGCAACTGGTTGATCCTGAAGA 
CATACCAGCTATGGAACGCCTGACCCAGGAAGTCTTACTTCTTCGGGAAAAAGTTGCT 
TCAGTAGAATCCCAGGGTCAAGAAATTTCAGGAAACCGAAGACAACAGCAGTTGCTGC 
TGATGCTAGAAGGACTAGTAGATGAACGGAGTCGGCTCAATGAGGCCTTACAAGCAGA 
GAGACAGCTCTATAGCAGTCTGGTGAAGTTCCATGCCCATCCAGAGAGCTCTGAGAGA 
GACCGAACTCTGCAGGTGGAACTGGAAGGGGCTCAGGTGTTACGCAGTCGGCTAGAAG 
AAGTTCTTGGAAGAAGCTTGGAGCGCTTAAACAGGCTGGAGACCCTGGCCGCCATTGG 
AGGTGCAGCTGCAGGGGATGACACCGAAGATACAAGCACTGAGTTCACTGACAGTATT 
GAGGAGGAGGCTGCACACCATAGTCACCAGCAACTTGTCAAGGTGGCTTTGGAGAAAA 



GGACAGTAACAGGTGTCTTCAGGAAGAAATGCTCCACCTGAGGGCTGAGATCCACCAG 
CACTTAGAAGAGAAGAGGAAAGCTGAGGAGGAACTGAAGGAGCTAAAGGCTCAAATTG 
AGGAAGCAGGATTCTCCTCAGTGTCCCACATCAGGAACACCATGCTGAGCCTTTGCCT 
TGAGAATGCGGAGCTGAAAGAGCAGATGGGAGAAACAATGTCTGATGGATGGGAGATC 
GAGGAAGACAAGGAGAAGGGCGAGGTGATGGTTGAGACTGTGGTAACCAAAGAGGGTC 
TGAGTGAGAGTAGCCTTCAGGCTGAGTTCAGAAAGCTCCAGGGAAAACTGAAGAATGC 
CCACAATATCATCAACCTCCTCAAAGAACAACTTGTGCTGAGTAGCAAGGAAGGGAAT 
AGTAAACTTACTCCAGAGCTCCTTGTGCATCTGACCAGCACCATCGAAAGAATAAACA 
CAGAACTGGTTGGTTCCCCTGGGAAGCACCAACACCAAGAGGAGGGGAATGTGACTGT 
GAGGCCTTTCCCCAGACCCCAGAGCCTTGACCTTGGGGCTACCTTCACAGTGGATGCC 
CACCAACAGTTGGATAACCAGTCCCAGCCTCGTGACCCTGGGCCTCAGCCAGCGTTTA 
GCCTACCAGGGTCCACCCAGCACCTGCGCTCCCAGCTGTCACAATGCAAACAACGCTA 
TCAAGATCTCCAGGAGAAGCTGCTGCTATCAGAAGCCACTGTCTTTGCTCAGGCTAAC 
GAGCTGGAGAAATACAGAGTTATGCTTAGTGAATCCTTGGTGAAGCAGGACAGCAAGC 
AGATCCAGGTGGACTTCCAGGACCTGGGCTATGAGACTTGTGGCCGAAGCGAGAATGA 
GGCTGAACGGGAGGAAACCACCAGTCCTGAGTGTGAGGAGCACAACAGCCTCAAGGAA 
ATGGTCCTGATGGAGGGGCTGTGCTCTGAGCAGGGACGCCGGGGCTCAACACTGGCTA 
GTTCCTCTGAGAGGAAGCCCTTGGAGAACCAGCTAGGGAAGCAGGAAGAGTTCCGGGT 
ATATGGAAAGTCAGAAAACATCTTGGTCCTACGAAAGGACATCGAAGATCTGAAGGCC 
CAGCTGCAGAATGCCAACAAGGTCATTCAAAACCTCAAGAGCCGGGTCCGGTCCCTCT 
CAGTTACAAGTGATTATTCGTCTAGTCTGGAAAGACCCCGGAAGCTGAGAGCTGTTGG 
CACCTTGGAGGGGTCTTCACCTCATAGTGTCCCTGATGAGGATGAGGGGTGGCTGTCT 
GATGGCACTGGGGCTTTCTACTCTCCAGGGCTTCAGGCCAAAAAGGACCTGGAGAGTC 
TCATCCAGAGAGTATCCCAGCTGGAGGCCCAGCTCCCAGAAAATGGACTAGAAGAGAA 
GCTGGCTGAGGAGCTGAGATCAGCCTCGTGGCCTGGGAAATATGATTCCCTGATTCAG 
GATCAGGCCCGGGAACTGTCTTACCTACGGCAAAAAATACGAGAAGGGAGAGGTATTT 
GTTATCTTATCACCCAGCATGCAAAAGATACAGTAAAATCTTTTGAGGATCTCCTAAG 
GAGCAATGACATTGACTACTACCTGGGACAGAGCTTCCGGGAGCAACTCGCCCAGGGA 
AGCCAGCTGACAGAGAGGCTCACCAGCAAACTCAGCACAGAGGATCATAAAAGTGAGA 
AAGATCAAGCTGGACTTGAGCCACTGGCCCTCAGGCTCAGCAGGGAGCTGCAGGAGAA 
GGAGAAAGTGATTGAAGTCCTGCAGGCCAAGCTGGATGCTCGGTCCCTCACACCCTCC 
AGCAGCCGTGCCTTGTCTGACTCCCACCGCTCTCCCAGCAGCACCTCTTTCCTGTCTG 
ATGAGCTGGAAGCCTGCTCTGACATGGACATAGTCAGCGAGTACACACACTATGAAGA 
GAAGAAAGCTTCTCCCAGTCACTCAGGTAGCAGTGCATCTCAGGGGGCTAAGGCCGAA 
TCCAACAGCAACCCCATCAGCTTGCCAACTCCCCAGAATACCCCCAAGGAGGCCAACC 
AAGCCCATTCAGGCTTTCATTTTCACTCCATACCCAAGCTGGCTAGCCTTCCTCAGGC 
ACCATTGCCCTCAGCTCCATCCAGCTTCCTGCCTTTCAGCCCCACTGGCCCTCCCCTC 
CTTGGCTGCTGTGAGACACCAGAGGTCTCCTTGGCTGAGTCTCAGCAGGAGCTACAGA 
TGCTGCAGAAGCAGTTGGGAGAAAGTAGCACTGTTCCTCCTGCTTCCACAGCTACATT 
GCTGAGCAACGACTTGGAAGCCGACTCTTCCTACTACCTCAACTCTGCCCAGCCTCAC 
TCTCCTCCAAGGGGCACCATAGAACTGGGAAGAATCCTAGAGCCTGGGTACCTGGGCA 
GCAGTGGCAAGTGGGATGTGATGAGGCCTCAGAAAGGGAGTGTATCTGGGGACCTATC 
CTCAGGCTCCTCTGTGTACCAGCTTAACTCCAAACCCACAGGGGCTGACCTGCTGGAA 
GAGCATCTTGGTGAAATCTGGAACCTGCGCCAGCGCCTGGAGGAGTCCATCTGCATCA 
ATGACTGCCTACGGGAGCAACTGGAACACCGGCTGACCTCTACTGCTCGTGGAAGGGG 
ATCCACTTCTAACTTCTACAGTCAGGGCCTGGAGTCCATACCTCAGCTCTGCAATGAG 
AACAGAGTCCTCAGGGAAGAAAATCGAAGACTTCAGGCTCAACTGAGTCATGTTTCCA 
GAGGTCACTCCCAGGAAACAGAAAGCCTGAGGGAGGCTCTGCTGTCCTCTCGATCCCA 
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CCTTCMGAGCTGGAAAAGGAGCTGGAGCACCAGAAGGTGGAAAGGCAGCAGCTTTTG 
GAAGACTTGAGGGAGAAGCAGCAAGAGGTCTTGCATTTCAGGGAGGAACGTCTTTCCC 




AGAGAAACAGCAGCTCTTTGAGTCCCTCCAGTCAGAGCTACAAATCTACGAGGCACTT 
TATGGCAATTCCAAGAAGGGGCTGAAAGCTTACAGCCTGGATGCCTGTCACCAAATCC 
CTTTGAGCAGTGACCTGAGCCACCTGGTGGCAGAGGTACAAGCTCTGAGAGGGCAGCT 
GGAGCAGAGCATTCAGGGGAACAATTGTCTGCGACTGCAGCTGCAACAGCAGCTGGAG 
AGCGGTGCTGGCAAAGCCAGCCTCAGCCCCTCCTCCATTAACCAGAACTTCCCAGCCA 
GCACTGACCCTGGAAACAAGCAGCTGCTCCTCCAAGGTTCAGCTGTGTCCCCTCCAGT 
CCGGGATGTTGGTATGAATTCCCCAGCTCTGGTCTTCCCCAGCTCTGCTTCCTCTACT 
CCTGGCTCAGATTCAGTTGTGTTGTCATTTTCTTTTTCAGGCTTGGGTTTGGATACTT 

CAATAAGCATGGCCGCCATGTCATTGGCCACATTGATGACTACAGTGCCCTAAGACAG 
CAGATTGCGGAGGGCAAGCTGCTGGTCAAAAAGATAGTGTCTCTTGTGAGATCAGCGT 
GCAGCTTCCCTGGCCTTGAAGCCCAAGGCACAGAGGGCAGCAAAGGCATTCATGAGCT 
TCGGAGCAGCACCAGTGCCCTGCACCATGCCCTAGAGGAGTCGGCTTCCCTCCTCACC 
ATGTTCTGGAGAGCGGCCCTGCCAAGCACCCACATCCCTGTGCTGCCTGGCAAACAGG 
GAGAATCAACAGAAAGGGAACTTCTGGAACTGAGAACCAAAGTATCCAAACAGGAGCA 
GCTCCTTCAGAGCACAACTGAGCATCTGAAGAACGCCAACCAGCAGAAGGAGAGCATG 
GAACAGTTCATTGTCAGCGTAACCAGAACACATGATGTTTTAAAGAAGGCAAGGACTA 
ACTTAGAGGTGAAATCCCTAAGGGCTCTGCCGTGTACTCCAGCCTTGTGACCCTTGCC 
TTCCAGGAACCATGCAAGAAGCGCAGCCACCAGAAGTCCTTAAAACAGCAGGAAAGGT 


GAGCCTGTCCCCCTTTTGTGCAGCTACCTATCTGCTGAGGA ;c L TC TG< ; 3CCTCATTC 


CTCCAAGT 


L — 


OPvF Start: ATG at 
155 


ORF Stop: TGA at 6950 




SEQ ID NO: 296 


2265 aa JMW at 25508 1.5kD 


NOV 103c, 
CG59773-03 Protein 
Sequence 


MRPAGRTSTSGGDVENLNSQNEAELRRQFEERQQETEHVYELLENKIQLLQEESRLAK 
NEAARMAALVEAEKECNLELSEKLKGVTKNWEDVPGDQVKPDQYTETLAQRDKRIEEL 
NQSLAAQERLVEQLSREKQQLLHLLEEPTSMEVQPMTEELLKQQKLNSHETTITQQSV 
SDSHEAELQEKIQQTEATNKILQEKLNEMSYELKCAQESSQKQDGTIQNLKETLKSRE 
RETEELYQVIEGQNDTMAKLREMLHQSQLGQLQSSEGTSPAQQQVALLDLQSALFCSQ 
LEIQKLQRWRQKERQLADAKQCVQFVEAAAHESEQQKEASWKHNQELRKALQQLQEE 
LQNKSQQLRAWEAEKYNEIRTQEQNIQHLNHSLSHKEQLLQEFRELLQYRDNSDKTLE 
ANEMLLEKLRQRIHDKAVALERAIDEKFSALEEKEKELRQLRLAVRERDHDLERLRDV 
LSSNEATMQSKESLLRAXGLEVEQLSTTCQNLQWLKEEMETKFSRWQKEQESIIQQLQ 
TSLHDRNKEVEDLSATLLCKLGPGQSEIAEELCQRLQRKERMLQDLLSDRNKQVLEHE 
MEIQGLLQSVSTREQESQAAAEKLVQALMERNSELQALRQYLGGRDSLMSQAPISNQQ 
AEVTPTGRLGKQTDQGSMQIPSRDDSTSLTAKEDVSIPRSTLGDLDTVAGLEKELSNA 
KEELELMAKKERESQMELSALQSMMAVQEEELQVQAADMESLTRNIQIKEDLIKDLQM 
QLVDPEDIPAMERLTQEVLLLREKVASVESQGQEISGNRRQQQLLLMLEGLVDERSRL 
NEALQAERQLYSSLVKFHAHPESSERDRTLQVELEGAQVLRSRLEEVLGRSLERLNRL 
ETLAAIGGAAAGDDTEDTSTEFTDSIEEEAAHHSHQQLVKVALEKSLATVETQNPSFS 
PPSPMGGDSNRCLQEEMLHLRAEIHQHLEEKRKAEEELKELKAQIEEAGFSSVSHIRN 
TMLSLCLENAELKEQMGETMSDGWEIEEDKEKGEVMVETWTKEGLSESSLQAEFRKL 
QGKLKNAHNIINLLKEQLVLSSKEGNSKLTPELLVHLTSTIERINTELVGSPGKHQHQ 
EEGNVTVRPFPRPQSLDLGATFTVDAHQQLDNQSQPRDPGPQPAFSLPGSTQHLRSQL 
SQCKQRYQDLQEKLLLSEATVFAQANELEKYRVMLSESLVKQDSKQIQVDFQDLGYET 
CGRSENEAEREETTSPECEEHNSLKEMVLMEGLCSEQGRRGSTLASSSERKPLENQLG 
KQEEFRVYGKSENILVLRKDIEDLKAQLQNANKVIQNLKSRVRSLSVTSDYSSSLERP 
RKLRAVGTLEGSSPHSVPDEDEGWLSDGTGAFYSPGLQAKKDLESLIQRVSQLEAQLP 
ENGLEEKLAEELRSASWPGKYDSLIQDQARELSYLRQKIREGRGICYLITQHAKDTVK 
SFEDLLRSNDIDYYLGQSFREQLAQGSQLTERLTSKLSTEDHKSEKDQAGLEPLALRL 
SRELQEKEKVIEVLQAKLDARSLTPSSSRALSDSHRSPSSTSFLSDELEACSDMDIVS 
EYTHYEEKKASPSHSGSSASQGAKAESNSNP I SLPTPQNTPKEANQAHSGFHFHS I PK 
LASLPQAPLPSAPSSFLPFSPTGPPLLGCCETPEVSLAESQQELQMLQKQLGESSTVP 
PASTATLLSNDLEADSSYYLNSAQPHSPPRGTIELGRILEPGYLGSSGKWDVMRPQKG 
SVSGDLSSGSSVYQLNSKPTGADLLEEHLGEIWNLRQRLEESICINDCLREQLEHRLT 
STARGRGSTSNFYSQGLES I PQLCNENRVLREENRRLQAQLSHVSRGHSQETESLREA 
LLSSRSHLQELEKELEHQKVERQQLLEDLREKQQEVLHFREERLSLQENDSRLQHKLV 
LLQQQCEEKQQLFESLQSELQIYEALYGNSKKGLKAYSLDACHQIPLSSDLSHLVAEV 
QALRGQLEQS I QGNNCLRLQLQQQLESGAGKASLS PSSI NQNF P ASTD PGNKQLLLQG 
SAVSPPVRDVGMNSPALVFPSSASSTPGSDSWLSFSFSGLGLDTSPVMKTPPKLEGD 
ATDGSFANKHGRHVIGHIDDYSALRQQIAEGKLLVKKIVSLVRSACSFPGLEAQGTEG 
SKGIHELRSSTSALHHALEESASLLTMFWRAALPSTHIPVLPGKQGESTERELLELRT 
KVSKQEQLLQSTTEHLKNANQQKESMEQFIVSVTRTHDVLKKARTNLEVKSLRALPCT 
PAL 



Sequence comparison of the above protein sequences yields the following sequence 
relationships shown in Table 103B. 
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Table 10311. Comparison of NOV103a against NOV103b through NOV103c. 


Protein Sequence 


NOV103a Residues/ 
Match Residues 


Identities/ 
Similarities for the Matched Region 


NOV103b 


365. .2196 
202..2016 


1510/1834(82%) 
1518/1834(82%) 


NOV 103c 


365..2196 
140..1954 


1510/1834 (82%) 
1518/1834 (82%) 



Further analysis of the NOV103a protein yielded the following properties shown in 
Table 103C. 



Table 103C Protein Sequence Properties NOV 103a 


PSort 
analysis: 


0.5855 probability located in mitochondrial matrix space; 0.4200 probability 
located in nucleus; 0.3000 probability located in microbody (peroxisome); 
0.2957 probability located in mitochondrial inner membrane 


SignalP 

analysis: 


Likely cleavage site between residues 39 and 40 



A search of the NOV 103a protein against the Geneseq database, a proprietary 
database that contains sequences published in patents and patent publication, yielded 
several homologous proteins shown in Table 103D. 



Table 103D. Geneseq Results for NO VI 03a 



Geneseq 
Identifier 


Protein/Organism/Length 
[Patent #, Date] 


NOV103a 
Residues/ 

Match 
Residues 


Identities/ 
Similarities for 
the Matched 
Region 


Expect 
Value 


AAY71159 


Human phosphodiesterase 
interacting protein, myomegalin - 
Homo sapiens, 2517 aa. 
[WO200027861-A1, 18-MAY- 
2000] 


1..2196 
1..2204 


2193/2204 (99%) 
2193/2204 (99%) 


0.0 


AAM40183 


Human polypeptide SEQ ID NO 
3328 - Homo sapiens, 1883 aa. 
[WO200153312-A1, 26-JUL- 
2001] 


635..2196 
1..1570 


1557/1570 (99%) 
1559/1570 (99%) 


0.0 


AAY71158 


Rat phosphodiesterase interacting 
protein, myomegalin - Rattus sp, 
2326 aa. [WO200027861-A1, 18- 
MAY-2000] 


365..2197 
202..2017 


1433/1837(78%) 
1572/1837(85%) 


0.0 
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AAY67600 


Human adipose tissue protein #3 - 
Homo sapiens, 944 aa. 
[JP2000037190-A, 08-FEB-2000] 


1..934 
1..934 


925/934 (99%) 
927/934 (99%) 


0.0 


AAU01768 


Human secreted protein #47 - 
Homo sapiens, 934 aa. 
[WO200123546-A1, 05-APR- 
2001] 


365..1102 
j 1 97..934 


730/738 (98%) 
733/738 (98%) 


0.0 


In a BLAST search of public sequence databases, the NOV103a protein was found 
to have homology to the proteins shown in the BLASTP data in Table 103E. 


Table 103E. Public BLASTP Results for NOV103a 


Protein 
Accession 
Number 


Protein/ Or ganism/Len gth 


NOV103a 
Residues/ 

Match 
Residues 


Identities/ 
Similarities for the 
Matched Portion 


Expect 
Value 


075042 


KIAA0454 PROTEIN - Homo 
sapiens (Human), 1882 aa 
(fragment). 


636..2196 
1..1569 


1558/1569 (99%) 
1558/1569 (99%) 


0.0 j 


Q9WUJ3 


MYOMEGALIN - Rattus 
norvegicus (Rat), 2324 aa. 


365..2197 
202..2015 


1444/1838(78%) 
1581/1838 (85%) 


0.0 


075065 


KIAA0477 PROTEIN - Homo 
sapiens (Human), 1 132 aa. 


1..1132 
1..1132 


1132/1132(100%) 
1132/1132(100%) 


0.0 


Q25893 


LIVER STAGE ANTIGEN - 
Plasmodium falciparum (isolate 
NF54), 1909aa. 


356..1459 
605.. 1651 


243/1129 (21%) 
488/1129(42%) 


4e-35 


Q13439 


Golgi autoantigen, golgin 
subfamily A 4 (Trans-Golgi p230) 
(256 kDa golgin) (Golgin-245) 
(72.1 protein) - Homo sapiens 
(Human), 2230 aa. 


229.. 1749 
267..1814 


349/1638 (21%) 
679/1638 (41%) 


4e-34 



PFam analysis predicts that the NOV 103a protein contains the domains shown in 
the Table 103F. 
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Table 103F. Domain Analysis of NO VI 03a 


Pfam Domain 


NO VI 03a Match 
Region 


Identities/ 
Similarities 
for the Matched 
Region 


Expect 
Value 


Somatomedin B: domain 1 1 150.. 189 
of 1 


14/47 (30%) 
25/47 (53%) 


7.6 


recA: domain 1 of 1 


621. .650 


8/30 (27%) 
22/30 (73%) 


8.1 


Ribosomal L10: domain 1 of 
1 


604.. 695 


20/109(18%) 
59/109 (54%) 


9.9 


Dishevelled: domain 1 of 1 


844..914 


19/74(26%) 
37/74 (50%) 


2.7 


Transposase 22: domain 1 of 
1 


1135. .1416 


71/376(19%) 
127/376 (34%) 




4.6 


Phe tPvNA-synt N: domain 1 
of 1 


2079..2152 




13/79 (16%) 
49/79 (62%) 


4.9 



Example 104. 

The NOV104 clone was analyzed, and the nucleotide and predicted polypeptide 
sequences are shown in Table 104A. 



Table 104A. NOV104 Sequence Analysis 




SEQIDNO:297 J 736 bp 


NOV104a, 

CG57460-01 DNA Sequence 


AAAGCACCCGAGATGACCCCGGCTCCTCCACCAGGAGCGCGGCCGGGCGCGGCGTCCC 
TAGCGGGCTTCGCCGGGGTGGCGTCTCTGGGGCCTGGGGACCCCCGCCGCGCCGCTGA 
CCCGCGCCCTCTGCCCCCAGCGCTGTGCTTCGCCGTGAGCCGCTCGCTGCTGCTGACG 
TGCCTGGTGCCGGCCGCGCTGCTGGGCCTGCGCTACTACTACAGCCGCAAGGTGATCC 
GCGCCTACCTGGAGTGCGCGCTGCACACGGACATGGCGGACATCGAGCAGTACTACAT 

GCCGTGCTGGATGGCAACGTGGTGGGCATTGTGGCTGCACGGGCCCACGAGGAGGACA 
ACACGGTGGAGCTGCTGCGGATGTCTGTGGACTCACGTTTCCGAGGCAAGGGCATCGC 
CAAGGCGCTGGGCCGGAAGGTGCTGGAGTTCGCCGTGGTGCACAACTACTCCGCGGTG 
GTGCTGGGCACGACGGCCGTCAAGGTGGCCGCCCACAAGCTCTACGAGTCGCTGGGCT 
TCAGACACATGGGCGCCAGTGACCACTACGTGCTGCCGGGCATGACCCTCTCGCTGGC 
TGAGCGCCTCTTCTTCCAGGTCCGCTACCACCGCTACCGCCTGCAGCTGCGCGAGGAG 
TGA 1 




ORF Start: ATG at 13 :ORF Stop: TGA at 697 




SEQ ID NO: 298 (228 aa ]MW at 24767.5kD 


NOV104a, 

CG57460-01 Protein Sequence 


MTPAPPPGARPGAASLAGFAGVASLGPGDPRRAADPRPLPPALCFAVSRSLLLTCLVP 
AALLGLRYYYSRKVIRAYLECALHTDMADIEQYYMKPPGVSLTALSPAGSCFWVAVLD 
GNWGIVAARAHEEDNTVELLRMSVDSRFRGKGIAKALGRKVLEFAWHNYSAWLGT 
AHKLiESLGI hSDl I 1TLSLAERLFFQ E U 3 
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Further analysis of the NOV104a protein yielded the following properties shown in 
Table 104B. 



Table 104B. Protein Sequence Properties NOV104a 


PSort 
analysis: 


0.6400 probability located in plasma membrane; 0.4600 probability located in 
Golgi body; 0.3700 probability located in endoplasmic reticulum (membrane); 
0.1000 probability located in endoplasmic reticulum (lumen) 


SignalP 
analysis: 


Likely cleavage site between residues 64 and 65 



A search of the NOV 104a protein against the Geneseq database, a proprietary 
database that contains sequences published in patents and patent publication, yielded 
several homologous proteins shown in Table 104C. 



Table 104C. Geneseq Results for NO VI 04a 


Geneseq 
Identifier 


Protein/Organism/Length 
[Patent #, Date] 


NOV104a 
Residues/ 

Match 
Residues 


Identities/ 
Similarities for 
the Matched 
Region 


Expect 
Value 


AAB 19986 


Human camello 3 (Hcml3) protein 
(partial) - Homo sapiens, 144 aa. 
[WO200077024-A1, 21-DEC- 
2000] 


42.. 195 
1..144 


144/154 (93%) 
144/154 (93%) 


7e-76 


AAB 19985 


Human camello 2 (Hcml2) protein 
- Homo sapiens, 227 aa. 
[WO200077024-A1, 21-DEC- 
2000] 




47..200 
56..203 


63/158(39%) 
92/158 (57%) 


le-21 


AAB 19984 


Human camello 1 (Hcmll) protein 
- Homo sapiens, 227 aa. 
[WO200077024-A1, 21-DEC- 
20001 


41. .196 
50..199 


60/160 (37%) 
88/160 (54%) 


7e-20 


AAY57959 


Human TSC501 protein SEQ ID 
NO:l - Homo sapiens, 227 aa. 
[JP1 1332579-A, 07-DEC-1999] 


n.,196 

50.. 199 


59/160 (36%) 
87/160 (53%) 


4e-19 


AAB 19987 


Mouse camello 1 (Mcmll) protein 
- Mus sp, 222 aa. [WO200077024- 
A1.21-DEC-2000] 


41. .194 
50.. 197 


63/158(39%) j le-18 
87/158 (54%) J 



In a BLAST search of public sequence databases, the NOV 104a protein was found 
to have homology to the proteins shown in the BLASTP data in Table 104D. 



412 





Table 1041). Public BLASTP Results for NOV 104a 




Protein 
Accession 
Number 


Protein/Organism/Length 


NOVKMa 
Residues/ 

Match 
Residues 


Identities/ 
Similarities for 
the Matched 
Portion 


Expect 
Value 


Q9UHF3 


PUTATIVE N- 
ACETYLTRANSFERASE 
CAMELLO 2 - Homo sapiens 
(Human), 227 aa. 


47..200 
56..203 


63/158 (39%) 
92/158 (57%) 


5e-21 


Q9UHE5 


PUTATIVE N- 

ACETYLTRANSFERASE CML1 
- Homo sapiens (Human), 227 aa. 


41..196 
50..199 


60/160(37%) I 
88/160(54%) 


3e-19 


Q9UQ17 


GLA PROTEIN - Homo sapiens 
(Human), 227 aa. 


41 196 
50 .199 


60/160 (37%) 
88/160(54%) 1 


3e-19 


Q96QI8 


KIDNEY-AND LIVER-SPECIFIC 
GENE - Homo sapiens (Human), 
227 aa. 


41..196 
50.. 199 


59/160 (36%) 
87/160(53%) ! 


le-18 


075839 


TSC501 PROTEIN - Homo 
sapiens (Human), 227 aa. 


41. 196 
50..199 


59/160 (36%) 
87/160 (53%) 


le-18 



PFam analysis predicts that the NOV 104a protein contains the domains shown in 
the Table 104E. 



Table 104E. Domain Analysis of NOV104a 


Pfam Domain 


NOVKMa Match 
Region 


Identities/ 
Similarities 
for the Matched 
Region 


Expect 
Value 


Acetyltransf: domain 1 of 

1 


1 1 1.-191 


28/82 (34%) 
64/82 (78%) 


2.2e-17 

, 



Example 105. 

The NOV105 clone was analyzed, and the nucleotide and predicted polypeptide 
sequences are shown in Table 1 05A. 



Table 105A. NOV105 Sequence Analysis 




SEQIDNO:299 1 1230 bp 


NOV105a, 

CG57464-01 DNA Sequence 


CTTCCCGGCGGCTGCGCGATGGACAGCCCCGAGGTGACCTTCACTCTCGCCTATCTGG 
TGTTCGCCGTGTGCTTCGTGTTCACGCCCAACGAGTTCCACGCGGCGGGGCTCACGGT 
GCAGAACCTGCTGTCGGGCTGGCTGGGCAGCGAGGACGCCGCCTTCGTGCCCTTCCAC 
TTGCGCCGCACGGCCGCCACGCTGTTGTGCCACTCGCTGCTGCCGCTCGGTGAGGCTG 
CTCGGGCCGGCCGGCCGCATCCTCTCCTGCGCAGGGCTTGCTGGGAGGTCAGGAGGAG 
GCCTCCGCCAGCTCCCCGAGGCCCCGAAAGCGCCTGGGCGCAGCTGGGGAGAGGCGCC 
GGTCCTCATCCAGAGGGACCGCGGCGTGGGCTGAGCGCGCTTAGGGGTGCCGCCGGCC 
TGGCCTGGCGGCTCTTCCTGCTGCTGGCCGTGACCCTCCCCTCCATCGCCTGCATCCT 
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GATCTACTACTGGTCCCGTGACCGGTGGGCCTGCCACCCACTGGCGCGCACCCTGGCC 

TCCGGCGGATTGACAAGTTTGCCACCGGTGCACCAGGTGCCCGTGTGATTGTGACAGA 
CACGTGGGTGATGAAGGTAACCACCTACCGAGTGCACGTGGCCCAGCAGCAGGACGTG 
CACCTGACTGTGACGGAGTCTCGGCAGCATGAGCTCTCGCCAGACTCGAACTTGCCCG 
TGCAGCTCCTCACCATCCGTGTGGCCAGCACCAACCCTGCTGTGCAGGCCTTTGACAT 
CAGGCTGAACTCCACTGAGTACGGGGAGCTCTGCGAGAAGCTCCGGGCACCCATCCGC 
AGGGCAGCCCATGTGGTCATCCACCAGAGCCTGGGCGACCTGTTCCTGGAGACATTTG 
CCTCCCTGGTAGAGGTCAACCCGGCCTACTCAGTGCCCAGCAGCCAGGTGGGGGGCCT 
GGAGGCCTGCATAGGCTGCATGCAGACACGTGCCAGCGTGAAGCTGGTGAAGACCTGC 
CAGGAGGCAGCCACAGGCGAGTGCCAGCAGTGTTACTGCCGCCCCATGTGGTGCCTCA 
CCTGCATGGGCAAGTGGTTCGCCAGCCGCCAGGACCCCCTGCGCCCTGACACCTGGCT 
GGCCAGCCGCGTGCCCTGCCCCACCTGCCGCGCACGCTTCTGCATCCTGGATGTGTGC 
ACCGTGCGCTGA 




ORF Start: ATG at 19 


ORF Stop: TGA at 1228 




SEQ ID NO: 300 


403 aa jMW at 44585.0kD 


NOV105a, 

CG57464-01 Protein Sequence 


MDSPEVTFTLAYLVFAVCFVFTPNEFHAAGLTVQNLLSGWLGSEDAAFVPFHLRRTAA 
TLLCHSLLPI/3EAARAGRPHPLLRRACWEVRRRPPPAPRGPESAWAQLGRGAGPHPEG 
PRRGLSALRGAAGLAWRLFLLLAVTLPSIACILI YYWSRDRWACHPLARTLALYALPQ 
SGWQAVASSVNTEFRRIDKFATGAPGARVIVTDTWVMKVTTYRVHVAQQQDVHLTVTE 
SRQHELSPDSNLPVQLLTIRVASTNPAVQAFDIRLNSTEYGELCEKLRAPIRRAAHW 
IHQSLGDLFLETFASLVEVNPAYSVPSSQVGGLEACIGCMQTRASVKLVKTCQEAATG 
ECQQCYCRPMWCLTCMGKWFASRQDELF^r S-TU r n I 



Further analysis of the NOV 105a protein yielded the following properties shown in 
Table 105B. 



Table 105R. Protein Sequence Properties NOV105a 


PSort 
analysis: 


0.6760 probability located in plasma membrane; 0.1000 probability located in 
endoplasmic reticulum (membrane); 0.1000 probability located in 
endoplasmic reticulum (lumen); 0.1000 probability located in outside 


SignalP 
analysis: 


Likely cleavage site between residues 29 and 30 



A search of the NOV 105a protein against the Geneseq database, a proprietary 
database that contains sequences published in patents and patent publication, yielded 
several homologous proteins shown in Table 105C. 



Table I05C. Geneseq Results for NOV 105a 


I 

Geneseq 
I Identifier 

! 


Protein/Organism/Length 
[Patent #, Date] 


NO VI 05a 
Residues/ 

Match 
Residues 


Identities/ 
Similarities for 
the Matched 
Region 


Expect 
Value 


1 AAG81377 


Human AFP protein sequence SEQ 
ID NO:272 - Homo sapiens, 362 
aa. [WO200129221-A2, 26-APR- 
2001] 


1..403 
1..362 


344/409 (84%) 
345/409 (84%) 


0.0 
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In a BLAST search of public sequence databases, the NOV 105a protein was found 
to have homology to the proteins shown in the BLASTP data in Table 105D. 



Table 105D. Public BLASTP Results for NO VI 05a 


Protein 
Accession 
Number 


Protein/Organism/Length 


NOV105a 
Residues/ 

Match 
Residues 


Identities/ 
Similarities for 
the Matched 
Portion 


Expect 
Value 


CAC38627 


SEQUENCE 271 FROM 
PATENT WOO 129221 - Homo 
sapiens (Human), 362 aa. 


1..403 
1..362 


345/409 (84%) 
346/409 (84%) 


0.0 


Q9DCF3 


0610039G24RIK PROTEIN - 
Mus musculus (Mouse), 362 aa. 


1..403 
1..362 


311/403 (77%) 
328/403 (81%) 


e-176 


Q96GP5 


SIMILAR TO RIKEN CDNA 
0610039G24 GENE - Homo 
sapiens (Human), 232 aa. 


1..265 
1..226 


211/271 (77%) 
212/271 (77%) 


e-109 


Q9VN16 


CGI 4646 PROTEIN - Drosophila 
melanogaster (Fruit fly), 409 aa. 


1..399 
1..383 


123/409 (30%) 
202/409 (49%) 


le-55 


Q95TM4 


LD3981 IP -Drosophila 
melanogaster (Fruit fly), 393 aa. 


20..399 
4..367 


117/390 (30%) 
192/390 (49%) 


le-51 



PFam analysis predicts that the NOV105a protein contains the domains shown in 
the Table 105E. 



Table 105E. Domain Analysis of NOV105a 




Identities/ 1 


Pfam Domain 


NOV105a Match Region 1 Similarities 8 Expect Value 




1 for the Matched Region | 


No Significant Matches Found 


Example 106. 



The NOV 106 clone was analyzed, and the nucleotide and predicted polypeptide 
sequences are shown in Table 106A. 



Table 106 A. NOV 106 Sequence Analysis 




SEQIDNO:301 1 1136 bp 


NOV! Una. 

CG5 7466-01 DNA Sequence 


TTTCTGCAATGGGAGCCTCCGCCACCCACCCTGGAGCCACAGAAGGCCCAGAAGCCAA 
ATGGACAGCTGGTGAACCCCAACAACTTCTGGAAGAACCCGAAAGATGTGTGCGCCCA 
CGCCCATGGCCTCTCAGGGCCCAGGCCTGGGACGTGACCACCACTAACTGCTCAGCCA 
ATATCAACTTGACCCACCAGCCCTGGTTCCAGGTCCTGGAGCCGCAGTTCCGGCAGTT 
TCTCTTCTACCGCCACTGCCGCTACTTCCCCATGCTGCTGAACCACCCGGAGAAGTGC 
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CGTGCGCACCCTCTTCCTGCTGGGCACGGCCTCCAAGCAGGAGSAGCGCACGCACTAC 
CAGCAGCTGCTGGCCTACGAAGACGCCCTCTACGGCGACATCCTGCAGTGGGGCTTTC 
TCGACACCTTCTTCAACCTGACCCTCAAGGAGATCCACTTCCTCAAGTGGCTGGACAT 
CTACTGCCCCCACGTCCCCTTCATTTTCAAAGGCGACGATGACGTCTTCGTCAACCCC 
ACCAACCTGCTAGAATTTCTGGCTGACCGGCAGCCACAGGAAAACCTGTTCGTGGGCG 
ATGTCCTGCAGCACGCTCGGCCCATTCGCAGGAAAGACAACAAATACTACATCCCGGG 
GGCCCTGTACGGCAAGGCCAGCTATCCGCCGTATGCAGGCGGCGGTGGCTTCCTCATG 
GCCGGCAGCCTGGCCCGGCGCCTGCACCATGCCTGCGACACCCTGGAGCTCTACCCGA 
TCGACGACGTCTTTCTGGGCATGTGCCTGGAGGTGCTGGGCGTGCAGCCCACGGCCCA 
CGAGGGCTTCAAGACTTTCGGCATCTCCCGGAACCGCAACAGCCGCATGAACAAGGAG 
CCGTGCTTTTTCCGCGCCATGCTCGTGGTGCACAAGCTGCTGCCCCCTGAGCTGCTCG 
CCATGTGGGGGCTGGTGCACAGCAATCTCACCTGCTCCCGCAAGCTCCAGGTGCTCTG 
ACCCCAGCCGGGCTACTAGGACAGGCCAGGGCAC 




ORF Start: ATG at 9 


ORF Stop: TGA at 1101 




SEQ ID NO: 302 


364 aa |mW at 41853. 8kD 


NOV106a, 

CG57466-01 Protein Sequence 


MGASATHPGATEGPEAKWTAGEPQQLLEEPERCVRPRPWPLRAQAWDVTTTNCSANIN 
LTHQPWFQVLEPQFRQFLFYRHCRYFPMLLNHPEKCRGDVYLLVWKSVITQHDRREA 
IRQTWARAAVRGWGPSAVRTLFLLGTASKQEERTHYQQLLAYEDALYGDILQWGFLDT 
FFNLTLKEIHFLKWLDIYCPHVPFIFKGDDDVFVNPTNLLEFLADRQPQENLFVGDVL 
QHARP I RRKDNKYY I PGALYGKASYP PYAGGGGFLMAGSLARRLHHACDT LELYPIDD 
VFLGMCLEVLGVQPTAHEGFKTFGISRNRNSRMNKEPCFFRAMLWHKLLPPELLAMW 
GLVHSNLTCSRKLQVL 



Further analysis of the NOV106a protein yielded the following properties shown in 
Table 106B. 



Tabic 106B. Protein Sequence Properties NOVI06a 


PSort 

analysis: 


0.6400 probability located in microbody (peroxisome); 0.4500 probability 
located in cytoplasm; 0.3122 probability located in lysosome (lumen); 0.1000 
probability located in mitochondrial matrix space 


SignalP 
analysis: 


No Known Signal Sequence Predicted 



A search of the NOV 106a protein against the Geneseq database, a proprietary 
database that contains sequences published in patents and patent publication, yielded 
several homologous proteins shown in Table 106C. 



Table 106C. Geneseq Results for NO VI 06a 


Geneseq 
Identifier 


Protein/Organism/Length [Patent 
#, Date] 


NOV106a 
Residues/ 

Match 
Residues 


Identities/ 
Similarities for 
the Matched 
Region 


Expect 
Value 


AAB24035 


Human PR04397 protein sequence 
SEQ ID NO:42 - Homo sapiens, 
402 aa. [WO200053750-A1, 14- 
SEP-2000] 


72..352 
84..380 


149/300(49%) 
191/300 (63%) 


4e-76 


AAU29167 


Human PRO polypeptide sequence 
#144 - Homo sapiens, 372 aa. 
[WO200168848-A2, 20-SEP-2001] 


26..363 
27..371 


149/348 (42%) 
207/348 (58%) 


9e-76 
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AAB88404 


Human membrane or secretory 
protein clone PSEC01 59 - Homo 
sapiens, 372 aa. [EP1067182-A2, 
10-JAN-2001] 


26. .363 
27..371 


149/348 (42%) 
207/348 (58%) 


9e-76 


fy/AD^y 1 JU 


Human beta 1,3-N- 
acetylglucosamine transferase 
protein G4 - Homo sapiens, 372 aa. 
[WO200100848-A1, 04-JAN-2001] 


26 363 
27..371 


207/348 (58%) 




AAB49749 


Human beta 1,3-N- 
acetylglucosamine transferase 
protein G4 - Homo sapiens, 372 aa. 
[WO200100848-A1, 04-JAN-2001] 


26.363 
27..371 


149/348 (42%) 
207/348 (58%) 


9e-76 



In a BLAST search of public sequence databases, the NOV 106a protein was found 
to have homology to the proteins shown in the BLASTP data in Table 1 06D. 



Table 106D. Public BLASTP Results for NOV106a 



Protein 

Accession 
Number 

] 


Protein/Organism/Length 


NOV106a 
Residues/ 

Match 
Residues 


Identities/ 
Similarities 
for the 
Matched 
Portion 


Expect 
Value 


AAL32295 


BETA-3-GAI \CTOSYI TRANSFERASE - 
Brachydanio rerio (Zebrafish) (Zebra danio), 
418 aa. 


46.364 
101. .417 


199/319 
(62%) 

249/319 
(77%) 


e-121 


AAL32297 


BETA-3-GALACTOSYLTRANSFERASE - 
Brachydanio rerio (Zebrafish) (Zebra danio), 
412 aa. 


29.360 
82..409 


180/337 
(53%) 

244/337 
(71%) 


e-104 


Q96EK0 


UNKNOWN (PROTEIN FOR MGC:20513) - 
Homo sapiens (Human), 377 aa. 


60.352 
46.355 


152/313 
(48%) 

198/313 
(62%) 


9e-76 


CAC39768 


SEQUENCE 175 FROM PATENT 
EP1067182 - Homo sapiens (Human), 372 aa. 


26.363 
27.371 


149/348 
(42%) 

207/348 
(58%) 


3e-75 


Q9C0J2 


BETA-13-N- | 26.363 
ACETYLGLUCOSAMINYLTRANSFERASE J 27.371 
BGNT-3 - Homo sapiens (Human), 372 aa. | 


149/348 
(42%) 

207/348 
(58%) 


3e-75 
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PFam analysis predicts that the NOV 106a protein contains the domains shown in 
the Table 106E. 



Table 106E. Domain Analysis of NOV106a 


Pfam Domain 


NOV106a Match 
Region 


Identities/ 
Similarities 
for the Matched 
Region 


Expect 
Value 


PI3 P14 kinase: domain 1 
of 1 


195..205 


8/12 (67%) 
10/12(83%) 


8.5 


Galactosyl_T: domain 1 of 


112..308 

„ — _ — „ — _ 


69/212 (33%) 
148/212 (70%) 


7.7e-45 

,,„,.,„,,,,„„„„,,„„,„„ 



Example 107. 

The NOV107 clone was analyzed, and the nucleotide and predicted polypeptide 
sequences are shown in Table 1 07A. 



Table 107 A. NOVT07 Sequence Analysis 




SEQIDNO:303 |4091 bp 


NOV 107a, 

CG57468-01 DNA Sequence 


T . vrs r i - • 

CGAGCGCGGAGGGCGACTTTGAACTGGGCATCAGCAGCAAACAAAAAAGGAAAAAAAC 
GAAGACAGTGAAAATGATTGGAGTATTAACATTGTTTCGATACTCCGATTGGCAGGAT 
AAATTGTTTATGTCGCTGGGTACCATCATGGCCATAGCTCACGGATCAGGTCTCCCCC 
TCATGATGATAGTATTTGGAGAGATGACTGACAAATTTGTTGATACTGCAGGAAACTT 
CTCCTTTCCAGTGAACTTTTCCTTGTCGCTGCTAAATCCAGGCAAAATTCTGGAAGAA 




CCTATATACAAGTTTCATTTTGGACTTTGGCAGCTGGTCGACAGATCAGGAAAATTAG 
GCAGAAGTTTTTTCATGCTATTCTACGACAGGAAATAGGATGGTTTGACATCAATGAC 
ACCACTGAACTCAATACGCGGCTAACAGATGACATCTCCAAAATCAGTGAAGGAATTG 
GTGACAAGGTTGGAATGTTCTTTCAAGCAGTAGCCACGTTTTTTGCAGGATTCATAGT 
GGGATTCATCAGAGGATGGAAGCTCACCCTTGTGATAATGGCCATCAGCCCTATTCTA 
GGACTCTCTGCAGCCGTTTGGGCAAAGATACTCTCGGCATTTAGTGACAAAGAACTAG 

GATAGCTTTCGGGGGCCAGAACAAAGAGCTGGAAAGGTATCAGAAACATTTAGAAAAT 
GCCAAAGAGATTGGAATTAAAAAAGCTATTTCAGCAAACATTTCCATGGGTATTGCCT 

ATCAAAAGAATATACTATTGGAAATGCAATGACAGTTTTTTTTTCAATCCTAATTGGA 
GCTATGGCCATCGGAGAAACGCTCGTTTTGGCTCCTGAATATTCCAAAGCCAAATCGG 
GGGCTGCGCATCTGTTTGCCTTGTTGGAAAAGAAACCAAATATAGACAGCCGCAGTCA 
AGAAGGGAAAAAGCCAGTAAGCGACACATGTGAAGGGAATTTAGAGTTTCGAGAAGTC 
TCTTTCTTCTATCCATGTCGCCCAGATGTTTTCATCCTCCGTGGCTTATCCCTCAGTA 
TTGAGCGAGGAAAGACAGTAGCATTTGTGGGGAGCAGCGGCTGTGGGAAAAGCACTTC 
TGTTCAACTTCTGCAGAGACTTTATGACCCCGTGCAAGGACAAGTGGATGGTGTGGAT 
GCAAAAGAATTGAATGTACAGTGGCTCCGTTCCCAAATAGCAATCGTTCCTCAAGAGC 
CTGTGCTCTTCAACTGCAGCATTGCTGAGAACATCGCCTATGGTGACAACAGCCGTGT 
GGTGCCATTAGATGAGATCAAAGAAGCCGCAAATGCAGCAAATATCCATTCTTTTATT 
GAAGGTCTCCCTGAGAAATACAACACACAAGTTGGACTGAAAGGAGCACAGCTTTCTG 
GCGGCCAGAAACAAAGACTAGCTATTGCAAGGGCTCTTCTCCAAAAACCCAAAATTTT 
ATTGTTGGATGAGGCCACTTCAGCCCTCGATAATGACAGTGAGTGGCAGGTGGTTCAG 
CATGCCCTTGATAAAGCCAGGACGGGAAGGACATGCCTAGTGGTCACTCACAGGCTCT 
CTGCAATTCAGAACGCAGATTTGATAGTGGTTCTGCACAATGGAAAGATAAAGGAACA 
AGGAACTCATCAAGAGCTCCTGAGAAATCGAGACATATATTTTAAGTTAGTGAATGCA 
CAGTCAGCGAGCAAAGGTCGGACTACAATCGTGGTAGCACACCGACTTTCTACTATTC 
GAAGTGCAGATTTGATTGTGACCCTAAAGGATGGAATGCTGGCGGAGAAAGGAGCACA 
TGCTGAACTAATGGCAAAACGAGGTCTATATTATTCACTTGTGATGTCACAGGTAATG 
CTTATGGGGACTCTTTCAGACTGTGGTAATAGTCTTCCTGAAGTCTCTCTATTAAAAA 
TTTTAAAGTTAAACAAGCCTGAATGGCCTTTTGTGGTTCTGGGGACATTGGCTTCTGT 
TCTAAATGGAACTGTTCATCCAGTATTTTCCATCATCTTTGCAAAAATTATAACCGTA 
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TGTTTTTCCTCAAACAAGGTTTCAGCGTAGATTTTTGTTTGTTTGCTTTTCAGGGATT 
ATTTTACGGCAGAGCAGGGGAAATTTTAACGATGAGATTAAGACACTTGGCCTTCAAA 
GCCATGTTATATCAGGATATTGCCTGGTTTGATGAAAAGGAAAACAGCACAGGAGGCT 
TGACAACAATATTAGCCATAGATATAGCACAAATTCAAGGAGCAACAGGTTCCAGGAT 
TGGCGTCTTAACACAAAATGCAACTAACATGGGACTTTCAGTTATCATTTCCTTTATA 
TATGGATGGGAGATGACATTCCTGATTCTGAGTATTGCTCCAGTACTTGCCGTGACAG 
GAATGATTGAAACCGCAGCAATGACTGGATTTGCCAACAAAGATAAGCAAGAACTTAA 
GCATGCTGGAAAGGTAAAGATAGCAACTGAAGCTTTGGAGAATATACGTACTATAGTG 
TCATTAACAAGGGAAAAAGCCTTCGAGCAAATGTATGAAGAGATGCTTCAGACTCAAC 
ACAGGAGAAATACCTCGAAGAAAGCACAGATTATTGGAAGCTGTTATGCATTCAGCCA 
TGCCTTTATATATTTTGCCTATGCGGCAGGGTTTCGATTTGGAGCCTATTTAATTCAA 
GCTGGACGAATGTCAAATGCTTTATCTTTTGATAGAGTTTTTACTGCAATTGCATATG 
GAGCTATGGCCATCGGAGAAACGCTCGTTTTGGCTCCTGAATATTCCAAAGCCAAATC 
GGGGGCTGCGCATCTGTTTGCCTTGTTGGAAAAGAAACCAAATATAGACAGCCGCAGT 
CAAGAAGGGAAAAAGCCACTTTCACAGGACACATGTGAAGGGAATTTAGAGTTTCGAG 
AAGTCTCTTTCTTCTATCCATGTCGCCCAGATGTTTTCATCCTCCGTGGCTTATCCCT 

ACTTCTGTTCAACTTCTGCAGAGACTTTATGACCCCGTGCAAGGACAACAGCTGTTTG 
ATGGTGTGGATGCAAAAGAATTGAATGTACAGTGGCTCCGTTCCCAAATAGCAATCGT 
TCCTCAAGAGCCTGTGCTCTTCAACTGCAGCATTGCTGAGAACATCGCCTATGGTGAC 
AACAGCCGTGTGGTGCCATTAGATGAGATCAAAGAAGCCGCAAATGCAGCAAATATCC 
ATTCTTTTATTGAAGGTCTCCCTAAATACAACACACAAGTTGGACTGAAAGGAGCACA 
GCTTTCTGGCGGCCAGAAACAAAGACTAGCTATTGCAAGGGCTCTTCTCCAAAAACCC 
AAAATTTTATTGTTGGATGAGGCCACTTCAGCCCTCGATAATGACAGTGAGAAGGTAC 
AGGTGGTTCAGCATGCCCTTGATAAAGCCAGGACGGGAAGGACATGCCTAGTGGTCAC 
TCACAGGCTCTCTGCAATTCAGAACGCAGATTTGATAGTGGTTCTGCACAATGGAAAG 
ATAAAGGAACAAGGAACTCATCAAGAGCTCCTGAGAAATCGAGACATATATTTTAAGT 
TAGTGAATGCACAGTCAGCGAGCAAAGGTCGGACTACAATCGTGGTAGCACACCGACT 
TTCTACTATTCGAAGTGCAGATTTGATTGTGACCCTAAAGGATGGAATGCTGGCGGAG 
AAAGGAGCACATGCTGAACTAATGGCAAAACGAGGTCTATATTATTCACTTGTGATGT 
CACAGGTAATGCTTATGTGACATAATGCTAT 




ORF Start: ATG at 
16 


ORF Stop: TGA at 4078 




SEQ ID N< >: 304 


1354 aa M\\ at 149167.3kD 


NOV107a, 
CG57468-01 Protein 
Sequence 


MDLEAAKNGTAWRPTSAEGDFELGISSKQKRKKTKTVKMIGVLTLFRYSDWQDKLFMS 
LGTIMAIAHGSGLPLMMIVFGEMTDKFVDTAGNFSFPVNFSLSLLNPGKILEEEMTRY 
AYYYSGLGAGVLVAAYI QVSFWTLAAGRQIRKI RQKFFHAI LRQE IGWFDINDTTELN 
TRLTDDISKISEGIGDKVGMFFQAVATFFAGFIVGFIRGWKLTLVIMAISPILGLSAA 
VWAKILSAFSDKELAAYAKAGAVAEEALGAIRTVIAFGGQNKELERYQKHLENAKEIG 
IKKAISANISMGIAFLLIYASYALAFWYGSTLVISKEYTIGNAMTVFFSILIGAMAIG 
ETLVLAPEYSKAKSGAAHLFALLEKKPNIDSRSQEGKKPVSDTCEGNLEFREVSFFYP 
CRPDVFILRGLSLSIERGKTVAFVGSSGCGKSTSVQLLQRLYDPVQGQVDGVDAKELN 
VQWLRSQIAIVPQEPVLFNCSIAENIAYGDNSRWPLDEIKEAANAANIHSFIEGLPE 
KYNTQVGLKGAQLSGGQKQRLAIARALLQKPKILLLDEATSALDNDSEWQWQHALDK 
ARTGRTCLWTHRLSAIQNADLIWLHNGKIKEQGTHQELLRNRDIYFKLVNAQSASK 
GRTTIWAHRLSTIRSADLIVTLKDGMLAEKGAHAELMAKRGLYYSLVMSQVMLMGTL 
SDCGNSLPEVSLIiK I LKLNKPEWPFWLGTLASVLNGTVHPVFS 1 1 FAKI I TVMFGNN 
DLLFFLKI FLYSFLLFFLKQGFSVDFCLFAFQGLFYGRAGE ILTMRLRHLAFKAMLYQ 
DIAWFDEKENSTGGLTTILAIDIAQIQGATGSRIGVLTQNATNMGLSVIISFIYGWEM 
TFLILSIAPVLAVTGMIETAAMTGFANKDKQELKHAGKVKIATEALENIRTIVSLTRE 
KAFEQMYEEMLQTQHRRNTSKKAQIIGSCYAFSHAFI YFAYAAGFRFGAYLIQAGRMS 
NALSFDRVFTAIAYGAMAIGETLVIAPEYSKAKSGAAHLFALLEKKPNIDSRSQEGKK 
PLSQDTCEGNLEFREVSFFYPCRPDVFILRGLSLSIERGKTVAFVGSSGCGKSTSVQL 
LQRLYDPVQGQQLFDGVXiAKELNVQWLRSQIAIVPQEPVLFNCSIAENIAYGDNSRW 
PLDEIKEAANAANIHSFIEGLPKYNTQVGLKGAQLSGGQKQRLAIARALLQKPKILLL 
DEATSALDNDSEKVQWQHALDKARTGRTCLWTHRLSAIQNADLIWLHNGKIKEQG 
THQELLRNRDIYFKLVNAQSASKGRTTIWAHRLSTIRSADLIVTLKDGMLAEKGAHA 
ELMAKRGLYYSLVMSQVMLM 



Further analysis of the NOV107a protein yielded the following properties shown in 
Table 107B. 



Table 107B. Protein Sequence Properties NOV107a 


PSort 
analysis: 


0.6000 probability located in plasma membrane; 0.4000 probability located in 
Golgi body; 0.3000 probability located in endoplasmic reticulum (membrane); 
0.3000 probability located in microbody (peroxisome) 
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SignalP j No Known Signal Sequence Predicted 
analysis: 



A search of the NOV 107a protein against the Geneseq database, a proprietary 
database that contains sequences published in patents and patent publication, yielded 
several homologous proteins shown in Table 107C. 



Table 107C. Geneseq Results for NOV107a 


Geneseq 
Identifier 


Protein/Organism/Length 
[Patent #, Date] 


NOV107a 
Residues/ 

Match 
Residues 


Identities/ 
Similarities for 
the Matched 
Region 


Expect 
Value 


AAB81064 


Cynomologous monkey P- 
glycoprotein variant 1 - Macaca 
fascicularis, 1280 aa. 
[WO200123565-A1, 05-APR- 
2001] 


1..1299 
1..1278 


750/1312(57%) 
964/1312(73%) 


0.0 


AAB81065 


Cynomologous monkey P- 
glycoprotein variant 2 - Macaca 
fascicularis, 1283 aa. 
[WO200123565-A1, 05-APR- 
2001] 


1..1299 
1..1281 


749/1312(57%) 
967/1312(73%) 


0.0 


AAB81959 


Human MDR1 - Homo sapiens, 
1280 aa. [WO200121762-A2, 29- 
MAR-2001] 


1..1299 
1..1278 


749/1324 (56%) 
967/1324 (72%) 


0.0 


AAY58186 


Human wild-type multidrug 
resistance- 1 (MDR-1) protein - 
Homo sapiens, 1280 aa. 
[W09961589-A2, 02-DEC-1999] 


1..1299 
1..1278 


749/1324 (56%) 
967/1324 (72%) 


0.0 


AAW44073 


Human multidrug resistance P- 
glycoprotein MDR1 - Homo 
sapiens, 1280 aa. [WO9740160- 
A1,30-OCT-1997] 


1..1299 
1..1278 


749/1324 (56%) 
967/1324 (72%) 


0.0 



In a BLAST search of public sequence databases, the NOV 107a protein was found 
to have homology to the proteins shown in the BLASTP data in Table 1 07D. 
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Table 107D. Public BLASTP Results for NOV107a 


Protein 

Accession 
Number 


Protein/Organism/Length 


NOV107a 
Residues/ 

Match 
Residues 


Identities/ 
Similarities for 
the Matched 
Portion 


Expect 
Value 


P23174 


Multidrug resistance protein 3 (P- 
glycoprotein 3) - Cricetulus 
griseus (Chinese hamster), 1281 
aa. 


1..1299 
1..1279 


818/1303 (62%) 
999/1303 (75%) 


0.0 


P21440 


Multidrug resistance protein 2 (P- 
glycoprotein 2) - Mus musculus 
(Mouse), 1276 aa. 


1..1299 
1..1274 


823/1306 (63%) 
998/1306 (76%) 


0.0 


Q08201 


Multidrug resistance protein 2 (P- 
glycoprotein 2) - Rattus 
norvegicus (Rat), 1278 aa. 


1-1299 
1..1276 


823/1309 (62%) 
999/1309 (75%) 


0.0 


CAC37764 


SEQUENCE 1 FROM PATENT 
WOO 123 565 - Macaca fascicularis 
(Crab eating macaque) 
(Cynomolgus monkey), 1280 aa. 


1..1299 
1..1278 


750 1312 (57%) 
964/1312(73%) 


0.0 


CAC37765 


SEQUENCE 3 FROM PATENT 
WO0123565 - Macaca fascicularis 
(Crab eating macaque) 
(Cynomolgus monkey), 1283 aa. 


1.-1299 
1-1281 


749/1312(57%) 
967/1312 (73%) 


0.0 



PFam analysis predicts that the NOV 107a protein contains the domains shown in 
the Table 107E. 



Table 107E. Domain Analysis of NOV107a 


Pfam Domain 


NOV107a Match 
Region 


Identities/ 
Similarities 
for the Matched 
Region 


Expect 
Value 


ABC membrane: domain 1 
of 2 


57..350 


115/301 (38%) 
252/301 (84%) 


3.3e-83 


MVIN: domain 1 of 1 


57..447 


70/531 (13%) 
263/531 (50%) 


5.8 


SAA_proteins: domain 1 of 
1 


51 8..524 


6/7 (86%) 
7/7(100%) 


3 


ABC tran: domain 1 of 2 


424..609 


76/199(38%) 
150/199(75%) 


3.1e-56 
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DsbD: domain 1 of 1 


722..926 


39/249 (16%) 
126/249 (51%) 


9.6 


ABC membrane: domain 2 
of 2 


722..1008 


80/297 (27%) 
222/297 (75%) 


2.2e-43 


ABC tran: domain 2 of 2 


1083..1270 


77/202 (38%) 
154/202 (76%) 


7.1e-54 


GidB: domain 1 of 1 


1170..1312 


29/202 (14%) 
97,202 (48%) 


6.6 



Example 108. 

The NOV108 clone was analyzed, and the nucleotide and predicted polypeptide 
sequences are shown in Table 108A. 



Table 108A. NOV108 Sequence Analysis 




SEQIDNO:305 (520 bp 


NOV108a, 

CG59609-01 DNA Sequence 


CCCCGTTCTATCAGCCATGGTCAACCCCACCAGGTTCTTAGACATCATCGTGGATGGT 
GAGCTCTTGGGACGTGTCTCCTTTGAGCTGTTTGCAGACAAGATTCCAAAGACAGCAG 
AAAATTTTTGTGCTCTAATCATTGGAGAGAAAGGATTTGGTTATAAAGGTTCCTACTT 
TCACAGAATTGTTCCTGGGTTTATGTGTCAGGGTGGTGACTTCACACAGCATAATGGC 
ACTGGTGGCAAGTCCATCTACGGGAAGAAATTTGATGATGAGAACTTCGTCCTAAATT 
ATACAGGTCCTGGCATCTTGTCCGTGGAGAATGCTGGACCCAACACAAATGGTTCCCA 
GTTTTTCATCTGCACTGCCATGTCTGAGTGGTTGGATGGCATGCAGGTGGTCTTTGGC 
AAGGGAAGGAAGGTGAGTATTGTGGAAGCCATGGAGTGCTTTGGGTCCACAAATGGCA 
AGACCAGCAAGAAGATCACCATTGCTGACTGTGGACAACTCTAATAGGTTTGACTT 




ORF Start: ATG at 17 


ORF Stop: TAA at 506 




SEQ ID NO: 306 


163 aa |mW at 17734. lkD 


NOV 108a, 

CG59609-01 Protein Sequence 


MVNPTRFLDI IVDGELLGRVSFELFADKIPKTAENFCALI IGEKGFGYKGSYFHRIVP 
GFMCQGGDFTQHNGTGGKSIYGKKFDDENFVLNYTGPGILSVENAGPNTNGSQFFICT 
AMSEWLDGMQWFGKGRKVSIVEAMECFGSTNGKTSKKITIADCGQL 



Further analysis of the NOV108a protein yielded the following properties shown in 
Table 108B. 



Table 108B. Protein Sequence Properties NOV108a 


PSort 
analysis: 


0.6400 probability located in microbody (peroxisome); 0.6000 probability 
located in plasma membrane; 0.4500 probability located in cytoplasm; 0.1000 
probability located in mitochondrial matrix space 


SignalP 
analysis: 


No Known Signal Sequence Predicted 



A search of the NOV 108a protein against the Geneseq database, a proprietary 
database that contains sequences published in patents and patent publication, yielded 
several homologous proteins shown in Table 108C. 
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Table 108C. Geneseq Results for NOV108a 



Geneseq 
Identifier 


Protein/Organism/Length [Patent 
#, Date] 


NOV108a 
Residues/ 

Match 
Residues 


Identities/ 
Similarities for 
the Matched 
Region 


Expect 
Value 


AAU01195 


Human cyclophilin A protein - 
Homo sapiens, 165 aa. 
[WO200132876-A2, 10-MAY- 
2001] 


1..163 
1..164 


134/164(81%) 
147/164 (88%) 


2e-75 


AAW56028 


Calcineurin protein - Mammalia, 
165 aa. [WO9808956-A2, 05- 
MAR-1998] 


1..163 
1 ..164 


134/164 (81%) 
147/164 (88%) 


2e-75 


AAR13726 


Bovine cyclophilin - Bos taurus, 
163 aa. [US5047512-A, 10-SEP- 
1991] 


2..163 
1..163 


133 '163 (81%) 
146/163 (88%) 


5e-75 


AAG65275 


Haematopoietic stem cell 
proliferation agent related human 
protein #2 - Homo sapiens, 164 aa. 
[JP2001163798-A, 19-JUN-2001] 


2..163 
1..163 


133/163 (81%) 
146/163 (88%) 


9e-75 


AAP90431 


Cyclophilin - Homo sapiens 
(human), 164 aa. [EP326067-A, 02- 
AUG-1989J 


2..163 
1..163 


133/163 (81%) 
146/163 (88%) 


9e-75 



In a BLAST search of public sequence databases, the NOV 108a protein was found 
to have homology to the proteins shown in the BLASTP data in Table 1 08D. 



Table 108D. Public BLASTP Results for NOV108a 


Protein 
Accession 
Number 


Protein/Organism/Length 


NOV108a 
Residues/ 

Match 
Residues 


Identities/ 
Similarities for 
the Matched 
Portion 


Expect 
Value 


CAC39529 


SEQUENCE 26 FROM PATENT 
WOO 132876 - Homo sapiens 
(Human), 165 aa. 


1..163 
1..164 


134/164 (81%) 
147/164 (88%) 


8e-75 


Q9BRU4 


PEPTTDYLPROLYL ISOMERASE 
A (CYCLOPHILIN A) - Homo 
sapiens (Human), 165 aa. 


1..163 
1..164 


134/164 (81%) 
146/164 (88%) 


2e-74 


P04374 


Peptidyl-prolyl cis-trans isomerase 
A (EC 5.2.1.8) (PPIase) (Rotamase) 
(Cyclophilin A) (Cyclosporin A- 
binding protein) - Bos taurus 
(Bovine), and, 1 63 aa. 


2..163 
1-163 


133/163 (81%) 
146/163 (88%) 


2e-74 
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P05092 


Peptidyl-prolyl cis-trans isomerase 
A (EC 5.2.1.8) (PPIase) (Rotamase) 
(Cyclophilin A) (Cyclosporin A- 
binding protein) - Homo sapiens 
(Human),, 1 64 aa. 


2..163 
1..163 


133/163 (81%) 
146/163 (88%) 


3e-74 


Q9TTC6 


CYCLOPHILIN 18 - Oryctolagus 
cuniculus (Rabbit), 1 64 aa. 


1..163 
1..164 


133/164 (81%) 
147/164 (89%) 


5e-74 



PFam analysis predicts that the NOV 108a protein contains the domains shown in 
the Table 108E. 



Table 108E. Domain Analys 


is ofNOV108a 


Pfam Domain 


NOV108a Match 
Region 


Identities/ 
Similarities 
for the Matched 
Region 


Expect 
Value 


pro isomerase: domain 1 
of 1 


5..163 


101/181 (56%) 
137/181 (76%) 


5.2e-79 



Example 109. 



The NOV109 clone was analyzed, and the nucleotide and predicted polypeptide 
sequences are shown in Table 109A. 



Table 109A. NOV109 Sequence Analysis 




SEQIDNO:307 1 887 bp 


NOV109a, 

CG59613-01 DNA Sequence 


GATATCATTTTTTATG^jCAGCCATr'iTTAA-jC'rrC'CAijAACCTATACCTTTAAAATSG 
TTAACAGATAAGCCAGTTTGGATAGAACAATGGCCACTGAGTAAAGAGAAACTGGAGG 

ACACAGCATGAAAGTATCACACATTTCCTTAGTGCAGCTAACCCTGTGTGACCAGGGC 
TTCAACACATACCACTGTGACCACAACCTAGCCATGAGCATGAGCCTCACCAGCATGT 
CCAAAATGCTAAAATACAACAATGGCAGTGAAGACATCACTACATGGAGGGCTGAAGG 
TACTATGGATCTCTTGGTGCTAGAATTTGAAGCACTAAATCAAGAGAACTTTGTGGAC 
TGTGAATTGAAGTTAATGACTCTAGATGTTGAGCAACTTGAAATTCCAGAACAAGAGT 
ACAGCTGTGTAATAAAGATGCATTCTAGTGAATTTGTTCATATATGCCAAGATCTCAG 

AATGGAGAACTTGGACATGGAAACATTGCCACAATTGCCCAAACAAGTAATTACAATA 
AAGAAGAGGAGGCTGTTGCCATAATGATGAATGGGCCAGTTCAGCTAACTTTTGCACT 
AAGTTACTTAAATTTCTTTATAACAGGCACTCCACTCTCTCAGATGCACCCCTTGCTG 
GAGAGTATAAGATTGCCGGATATGGAACATTTAAAGTATTATTTGGCTCCCAAAATTG 
AGGATGAAAAAGGATTTTAGAAATTCTTAGAATCCAAGAAAATAAAACTAAGCTCTTT 


G AAAATTG CT T CTG AG A 




ORE Start: ATG at 14IORF Stop: TAG at 830 




Si;o ID NO: 308 |272 aa |MW at 30831. lkD 


NOV109a, 

CG596 13-01 Protein Sequence 


MAAIVKPPEPIPLKWLTDKPVWIEQWPLSKEKLEALEDLVTEQFSIIIFQKVNLHSMK 
VSHISLVQLTLCDQGFNTYHCDHNLAMSMSLTSMSKMLKYNNGSEDITTWRAEGTMDL 
LVLEFEALNQENFVDCELKLMTLDVEQLE I PEQE YSCVI KMHSSEFVHI CQDLSH I GE 
SAIISCAKDGVNFSANGELGHGNIATIAQTSNYNKEEEAVAIMMNGPVQLTFALSYLN 
PFITGTPLSQMHPLLE IRLPDMEHLKY [i^xIEDSKGF 
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Further analysis of the NOV 109a protein yielded the following properties shown in 
Table 109B. 



Table 109B. Protein Sequence Properties NO VI 09a 


PSort 
analysis: 


0.6500 probability located in cytoplasm; 0.1000 probability located in 
mitochondrial matrix space; 0.1000 probability located in lysosome (lumen); 
0.0000 probability located in endoplasmic reticulum (membrane) 


SignalP 
analysis: 


Likely cleavage site between residues 19 and 20 



A search of the NOV 109a protein against the Geneseq database, a proprietary 
database that contains sequences published in patents and patent publication, yielded 
several homologous proteins shown in Table 109C. 



Table 109C. Geneseq Results tor NOV 109a 


; 

Geneseq 
Identifier 


Protein/Organism/Length [Patent 
#, Date] 


NOV109a 
Residues/ 

Match 
Residues 


Identities/ 
Similarities for 
the Matched 
Region 


Expect 
Value 


AAY51639 


Human PCNA protein fragment - 
Homo sapiens, 261 aa. 
LWO200008164-A2, 17-FEB-2000] 


25. .271 
8..260 


158/255 (61%) 
184/255 (71%) 


8e-78 




Human PCNA protein - Homo 
sapiens, 261 aa. [DE 19840771 -A 1, 
10-FEB-2000] 


25..271 
8..260 


158/255 (61%) 
184/255 (71%) 


8e-78 


j AAB43712 


Human cancer associated protein 
sequence SEQ ID NO: 1 1 57 - Homo 
sapiens, 269 aa. [WO200055350- 
Al,21-SEP-2000] 


25 .271 
16..268 


158/255 (61%) 
184/255 (71%) 


8e-78 


AAG75139 


Human colon cancer antigen 
protein SEQ IDNO:5903 - Homo 
sapiens, 268 aa. [WO200 122920- 
A2, 05-APR-2001] 


25..269 
16..266 


1 57/253 (62%) 
182/253 (71%) 


5e-77 


AAW90758 


Human PCNA protein fragment #2 

- Homo sapiens, 236 aa. 

[DE 19840771 -A 1, 10-FEB-2000] 


39..268 
1..236 


149/238 (62%) 
171/238(71%) 


7e-73 



In a BLAST search of public sequence databases, the NOV 109a protein was found 
to have homology to the proteins shown in the BLASTP data in Table 109D. 
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Table 109D. Public BLASTP Results for NOV109a 


Protein 
Accession 
Number 


Protein/Organism/Length 


NO VI 09a 
Residues/ 

Match 
Residues 


Identities/ 
Similarities for 
the Matched 
Portion 


Expect 
Value 


P12004 


Proliferating cell nuclear antigen 
(PCNA) (Cyclin) - Homo sapiens 
(Human), 261 aa. 


25..271 
8..260 


158/255 (61%) 
184/255 (71%) 


3e-77 


P04961 


Proliferating cell nuclear antigen 
(PCNA) (Cyclin) - Rattus norvegicus 
(Rat), 261 aa. 


25..271 
8. .260 


158/255 (61%) 
185/255 (71%) 


5e-77 


P57761 


Proliferating cell nuclear antigen 
(PCNA) - Cricetulus griseus 
(Chinese hamster), 261 aa. 


25..271 
8..260 


158/255 (61%) 
184/255 (71%) 


7e-77 

. . . 


Q91ZH2 


1 1 DAYS EMBRYO CDNA, 
RIKEN FULL-LENGTH 
ENRICHED LIBRARY, 
CLONE:2700095L20, FULL 
INSERT SEQUENCE - Mus 
musculus (Mouse), 26 1 aa. 

— ■ - - ■ — — — — — 


25..272 
8..261 


156/256 (60%) 
183/256(70%) 


le-75 


PI 79 18 


Proliferating cell nuclear antigen 
(PCNA) (Cyclin) - Mus musculus 
(Mouse), 261 aa. 


25..270 
8..259 


155/254 (61%) 
182/254 (71%) 


5e-75 



PFam analysis predicts that the NOV 109a protein contains the domains shown in 
the Table 109E. 



Table 109E. Domain Analysis of NOV ! 09a 


Pfam Domain 


NOV109a Match 
Region 


Identities/ 
Similarities 
for the Matched 
Region 


Expect 
Value 


PCNA: domain 1 of 1 


23..143 


46/128 (36%) 
83/128 (65%) 


2.3e-20 


PCNA C: domain 1 of 
1 


145..265 


59/131 (45%) 
98/131 (75%) 


1.6e-45 



Example 110. 



The NOV1 10 clone was analyzed, and the nucleotide and predicted polypeptide 
sequences are shown in Table 1 10A. 



426 



Table 110A. NOV110 Sequence Analysis 




SEQIDNO:309 [ 1233 bp 


NOVllOa, 

CG596 19-01 DNA Sequence 


TGGCAATGGAAGAAGAGATCCCCGCGCTCTTCATTGACAATGGCTCCGGCATGTGGAA 
AGCAGCTTTGCTGGGAGACAATGCCCTCCGAGCCATATTCCCCTCCATCATCGGGCAC 
CCCCGGCACCAGGGCGTGATGGTGGGCATGGGCCAGAAGGACTCCTACGTGGGCGACC 
AGGCCCAGAGCAAGTGCGGCATCCTGACCCTGAAGTACCCCATCAAGCATGGCATCGT 
CACAAACTGGGACGACATGGAGAAGATCTGGCACCATGTTTTCTACAACGAGCTGTGC 
GTGGCCCTGGAGGAGCAGGTGGTGCTGCTGACCGAGGCCCCGCTAAACCCCAGGGCCA 
ATAGGGAGAAGATGACTCAGATCATGTTTAAGACCTTCAACACCCAGGCCATGTACGT 
GGCCATTCAGGCCGTGCTGACCCTCCACAGCTCTGGTTGCACCACTGGCATTGTCATG 
GACTCTGGAGATGGGGTCACCCACACAGTGCCCATCTACGAGCGCCACACCCTCCCTC 
ACACCATCTTGCATCTGGACCTGGCTGGCCAGGACCTTACTGACTACCTCATGAAGAT 
CCCTACCTACCGCAGCTATAGCTTCAACACCATGGCCAAGTGGAAAATCGTGCGCAAC 
ATCAAGGAGAAGCTATGCTATGTCGCTCTGGACTTCGAGGAGGAGATGGCCACTGCTG 
CATCCTCCTCCTCCCTGGAGAAGAGCTACGAGCTGCCTGACAGCCAGGCCATCATTAT 
TAGCAATGAGCGGTTCCGGTGTCCGGAGGCACTGTTCCAGCCTTCCTTCCTGGGCATG 
GAATCCTGTGGCATCCATGAAAGTACCTTCAACTCCATCATGAAGTGTGATATGGACA 
TCCCCAAAGACCTGTACGCCAACACAGTGCTGTCTGGCGTCACCACCATGTACCCTGG 
CATCCCCAATAGGATGCAGAAGGAGATCACTGCCCTGGCATCCAGCACCATGAAGATC 
AAGATATCGTGCCCCATCGTGCCCCCAGAGTGCAAGTACTTTGTGTGGATCGGTGGCT 
CCATCCTGGCCTCACTGTCCACCTTCCAGCAGATGTGGATTAGCAAGCAGGAGTACAA 
CGAGTCGGGCCCCTCCATCATCCACCGCAAATGGACTGCGAGCAGATGCATAGCATTT 
GCTGCATGGGTTAATTCAGAAGTATAAATTTGCCCCTGGCAAATGCATATACCTCATG 


[ ;CC1 iCGAl 




ORF Start: ATG at oloRF Stop: TAA at 1 1 85 




SEQ ID NO: 310 (393 aa |MW at 44147.5kD 


NOVllOa, 

CG596 19-01 Protein Sequence 


MEEEIPALFIDNGSGMWKAALLGDNALRAIFPSIIGHPRHQGVMVGMGQKDSYVGDQA 
QS KCG ILTLKYP I KHGI VTNWDDMEK I WHHVFYNELCVALEEQWLIiTEAPLNPRANR 
EKMTQIMFKTFNTQAMYVAIQAVLTLHSSGCTTGIVMDSGDGVTHTVPIYERHTLPHT 
ILHLDLAGQDLTDYLMKIPTYRSYSFNTMAKWKIVRNIKEKLCYVALDFEEEMATAAS 
SSSLEKSYELPDSQAIIISNERFRCPEALFQPSFLGMESCGIHESTFNSIMKCDMDIP 
KDLYANTVLSGVTTMYPGIPNRMQKEITALASSTMKIKrSCPIVPPECKYFVWIGGSI 
LASLSTFQQMWISKQEYNESGPSIIHRKWTASRCIAFAAWVNSEV 



Further analysis of the NOV 1 10a protein yielded the following properties shown in 
Table HOB. 





Table 1 10B. Protein Sequence Properties NOVllOa 


PSort 
analysis: 


0.4500 probability located in cytoplasm; 0.1547 probability located in 
microbody (peroxisome); 0.1000 probability located in mitochondrial matrix 
space; 0.1000 probability located in lysosome (lumen) 


SignalP 
analysis: 


No Known Signal Sequence Predicted 



A search of the NOV 1 10a protein against the Geneseq database, a proprietary 
database that contains sequences published in patents and patent publication, yielded 
several homologous proteins shown in Table 1 IOC. 
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Table HOC. Geneseq Results for NOVHOa 


i Geneseq 
Identifier 


Protein/Organism/Length [Patent 
#, Date] 


NOVHOa 
Residues/ 

Match 
Residues 


Identities/ 
Similarities for 
the Matched 
Region 


Expect 
Value 


AAU32060 


Novel human secreted protein 
#2551 - Homo sapiens, 399 aa. 
[WO200179449-A2, 25-OCT-2001] 


1..376 
25..397 


3 1 5/376 (83%) 
336/376 (88%) 


e-180 


AAB43991 


Human cancer associated protein 
sequence SEQ ID NO: 1436 - Homo 
sapiens, 413 aa. [WO200055350- 
Al,21-SEP-2000] 


1..376 
39..411 


311/376 (82%) 
336/376 (88%) 


e-179 


AAP61532 


Sequence of beta-actin - Homo 
sapiens, 375 aa. [EP174608-A, 19- 
MAR-1986] 


1..376 
1..373 


311/376 (82%) 
335/376 (88%) 


e-179 


| AAB12985 


Human beta-actin protein sequence 
- Homo sapiens, 374 aa. 
[US6087398-A, ll-JUL-2000] 


2..376 
1..372 


310/375 (82%) 
334/375 (88%) 


e-178 


•■ AAR50328 

j 


Drug resistant structural protein - 
Homo sapiens, 375 aa. 
[JP06038773-A. 15-1 !■;[}- 1994] 


1..376 
I. .373 


309/376 (82%) 
335/376 (88%) 

,,,,,„,,„„,„ 


e-178 



In a BLAST search of public sequence databases, the NOV 1 10a protein was found 
to have homology to the proteins shown in the BLASTP data in Table 1 10D. 



Table HOD. Public BLASTP Results for NOVHOa 


Protein 
Accession 
Number 


Protein/O rgan ism/Length 


NOVHOa 
Residues/ 

Match 
Residues 


Identities/ 
Similarities for 
the Matched 
Portion 


Expect 
Value 


P02571 


Actin, cytoplasmic 2 (Gamma-actin) 
- Homo sapiens (Human).. 375 aa. 


1..376 
1..373 


315/376 (83%) 
336/376 (88%) 


e-179 


ATBOG 


actin gamma (tentative sequence) - 
bovine, 374 aa. 


2..376 
1..372 


314/375 (83%) 
335/375 (88%) 


e-179 


P53505 


Actin, cytoplasmic type 5 - Xenopus 
laevis (African clawed frog), 376 aa. 


2..376 
3.J74 


313/375 (83%) 
335/375 (88%) 


e-178 


P29751 


Actin, cytoplasmic 1 (Beta-actin) - 
Oryctolagus cuniculus (Rabbit), 375 
aa. 


1..376 
1..373 


311/376 (82%) 
337/376 (88%) 




e-178 



093400 


ACTIN, CYTOPLASMIC 1 


1..376 


311/376(82%) 


e-178 




(BETA-ACTIN) (CYTOPLASMIC 


1..373 


336/376 (88%) 






BETA ACTIN) - Xenopus laevis 










(African clawed frog), 375 aa. 









PFam analysis predicts that the NOV1 10a protein contains the domains shown in 
the Table 11 0E. 



Table 110E. Domain Analysis of NOVllOa 


Pfam Domain 


NOVllOa Match Region 


Identities/ 
Similarities 
for the Matched Region 


Expect Value 


actin: domain 1 of 1 


1..378 


284/382 (74%) 
336/382 (88%) 


2.2e-227 



Example 111. 

The NOV1 1 1 clone was analyzed, and the nucleotide and predicted polypeptide 
sequences are shown in Table 1 1 1 A. 



Table 111A. NOV111 Sequence Analysis 




SEQ ID NO: 311 


1197 bp 


NOVllla, 

CG59621-01 DNA Sequence 


AACCATGTCTAAGCGGGAGTCCTTTAACCTGGAAAGTTATGAATTGGACAAAAGCTTC 
TGGCTAACCAGATTCACTGAACTGAAGGGCACAGGTTGCAAAGTGCCCCAAGATGTCT 
TGCAAAAATTGCTGGAATCTTTACAGGAGAACCACTTCCAAGAAGATGAGCAGTTTCT 
GGGAGCCGTTATGCCAAGGCTTCGCATTGGAATGGATACTTGTGCCATTTCTTTGAGG 
CATGGTGGGCTTTCCTTGGTTCAAACCACAGATTACATTTACCCGATCGTAGACGACC 
CTTACATGATGGGCAGGATAGCATGTGCCAATGTCCTCAGTGACCTCTATGCAATGGG 
GGTCACAGAATGTGACAATATGCTGATGCTCCTTGGAGTCAGTAATAAAATGACCGAC 
AGGGAAAGGGATAAAGTGATGCCTCTGATTATCCAAAGTTTTAAAGATGCAGCTGAGG 
AAGCAGGAATGTCTGTAATGGTCAGCCAAACAGTACTAAATCCCTGGATTGTCCTGGG 
AGGAGTCACTACCACTGTCTTCCAGCCCAATGAATTTATCATGCCAGACAATGCAGTG 
CCAGGGGACGTGCTGGTGTTGACAAAACCCCTGGGGACACAGGTGGCAGTGGCTGTGC 
ACCAGTGGCTGGATATTCCTTTGAAATGGAATAAGATTAAGCTAGTGGTCACCGAAGA 
TGTAGAGCTGGCCAACCAGGAGGCGATGATGAACATGGTGAGGCTCAACAGGACAGCT 
GCAGGACTCATGCACACGTTCAATGCCCACATGGCCACTGACATCACGGGCTTCGGGA 
TTTTGGGCCACGTGCAGAACCTAGCCAAGCAGCAGAGGAACGAGGTGTCGTTTGTAAT 
TCACAACCTCCTGGTGCTGGCCAAGATGGCTGCGGTGAGCAAGGCCTGCGGAAACATG 
TTCAGCCTCATGCATGGGACCTGCCCGGAGACCTCAGGCGGCCTTCTGATCTGTTTAC 
CATGTCAGCAAGCAGCTCGGTTCTGTGCAGAGATAAAGTCCCCCAAATATAGTGAAGG 
CCACCAAGCATGGATTATTGGGATTGTAGAGAAGGGCAACCACACAGCCAGAATCATA 
GACAAACCCCAGATCATCAAGGTTGCACCACAAGTGGCCACTCAAAATGTGAATCTCA 
CTAA i 11 




ORF Start: ATG at 5 


ORF Stop: TAA at 1178 




SEQ ID NO: 312 


391 aa MW at 43 193.9kD 


NOVllla, 

CG59621-01 Protein Sequence 


MSKRESFNLESYELDKSFWLTRFTELKGTGCKVPQDVLQKLLESLQENHFQEDEQFLG 
AVMPRLRIGMDTCAISLRHGGLSLVQTTDYIYPIVDDPYMMGRIACANVLSDLYAMGV 
TECDNMLMLLGVSNKMTDRERDKVMPLIIQSFKDAAEEAGMSVMVSQTVLNPWIVLGG 
VTTTVFQPNEFIMPDNAVPGDVLVLTKPLGTQVAVAVHQWLDIPLKWNKIKLWTEDV 
ELANQEAMMNMVRLNRTAAGLMHTFNAHMATDITGFGILGHVQNLAKQQRNEVSFVIH 
NLLVLAKMAAVSKACGNMFSLMHGTCPETSGGLLICLPCQQAARFCAEIKSPKYSEGH 
1 _ t 11 T T £■« I , E^ATS 
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Further analysis of the NOV1 1 la protein yielded the following properties shown in 
Table 1 1 IB. 



Table 11 IB. Protein Sequence Properties NOVllla 


PSort 
analysis: 


0.8500 probability located in endoplasmic reticulum (membrane); 0.4400 
probability located in plasma membrane; 0.1000 probability located in 
mitochondrial inner membrane; 0.1000 probability located in Golgi body 


SignalP 
analysis: 


No Known Signal Sequence Predicted 



A search of the NOV1 1 la protein against the Geneseq database, a proprietary 
database that contains sequences published in patents and patent publication, yielded 
several homologous proteins shown in Table 1 1 1C. 



Table 1 11C. Geneseq Results for NOV11 la 


Geneseq 
Identifier 


Protein/Organism/Length [Patent 
#, Date] 


NOVllla 
Residues/ 

Match 
Residues 


Identities/ 
Similarities for 
the Matched 
Region 


Expect 
Value 


AAB58174 


Lung cancer associated polypeptide 
sequence SEQ ID 512 - Homo 
sapiens, 250 aa. [WO200055180- 
A2. 21-SEP-2000] 


166..391 
20..243 


168/227 (74%) 
189/227 (83%) 


2e-88 


AAO01161 


Human polypeptide SEQ ID NO 
15053 - Homo sapiens, 122 aa. 
[WO200164835-A2, 07-SEP-2001J 


147..264 
1..118 

42..99 
...58 


81/119(68%) 
92/119(77%) 


2e-36 


AAB53700 

- - 


Human colon cancer antigen protein 
sequence SEQ ID NO: 1240 - Homo 
sapiens, 106 aa. [WO200055351- 
A1.21-SEP-2000] 


53/58 (91%) 
54/58 (92%) 


le-24 



In a BLAST search of public sequence databases, the NOV1 1 la protein was found 
to have homology to the proteins shown in the BLASTP data in Table 1 1 ID. 
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T:il»lc 1111). Public BLASTP Results lot NOV1 1 la 


Protein 
Accession 
Number 


Protein/Organism/Length 


NOVllla 
Residues/ 

Match 
Residues 


Identities/ 
Similarities for 
the Matched 
Portion 


Expect 
Value 


Q9BVT4 \ 


SELENOPHOSPHATE 
SYNTHETASE , HUMAN 
SELENIUM DONOR PROTEIN - 
Homo sapiens (Human), 392 aa. 


1..391 
1..392 


364/392 (92%) 
367/392 (92%) 


0.0 1 


P49903 


Selenide,water dikinase 1 (EC 
2.7.9.3) (Selenophosphate 
synthetase 1) (Selenium donor 
protein 1) - Homo sapiens 
(Human), 383 aa. 


1..375 
1..376 


348/376 (92%) 
351/376 (92%) 


0.0 


AAC50958 


SELENOPHOSPHATE 

S Y M 1 HE 1 ASJb I - Homo sapiens 

(Human), 448 aa. 


2..391 
33. .441 


272/411 (66%) 
J1J/41 1 (75%) 


e-147 


Q9961 1 


Selenide, water dikinase 2 (EC 
2.7.9.3) (Selenophosphate 
synthetase 2) (Selenium donor 
protein 2) - Homo sapiens 
(Human), 448 aa. 


2..391 
33..441 


272 411 (66%) 
313/411 (75%) 


. -J 


AAC53024 


SELENOPHOSPHATE 
SYNTHETASE 2 - Mus musculus 
(Mouse), 452 aa. 


2..387 
36..441 


267/407 (65%) 
307/407 (74%) 


e-146 



PFam analysis predicts that the NOV1 1 1 a protein contains the domains shown in 
the Table 11 IE. 



Table 11 IE. Domain Analysis of NOVllla 



Pfam Domain 


NOVllla Match 
Region 


Identities/ 
Similarities 
for the Matched 
Region 


Expect 
Value 


AIRS: domain 1 of 1 


32..188 


29/180(16%) 
113/180 (63%) 


3e-18 


AIRS C: domain 1 of 
1 


191..367 


34/197(17%) 
125/197 (63%) 


l.le-20 



Example 1 12. 



The NOV1 12 clone was analyzed, and the nucleotide and predicted polypeptide 
sequences are shown in Table 1 12A. 
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Table 112 A. NOV112 Sequence Analysis 




SEQ ID NO: 313 |T544 bp 


NOV 112a, 

CG59625-01 DNA Sequence 


CGATGGGACACAGACAGGTCACCCCAGCTCTGATCTTTGCCATCACAGTTGCTACAAT 
CGGCTCTTTCCAGTTTGGCTACAACACTGGGGTCATCAATGCTCCTGAGACGGTGCAG 
ATCATAAAGGAATTTATCAATAAAACTTTGACGGACAAGGCAAATGCCCCTCCCTCTG 

GATCGGCTCCTTTTCCGTCGGACTCTTTGTTAACCGCTTTGGCAGGAGGCGCAATTCA 
ATGCTGATTGTCAACCTGTTGGCTGCCACTGGTGGCTGCCTTATGGGACTGTGTAAAA 
TAGCTGAGTCAGTTGAAATGCTGATCCTGGGCCGCTTGGTTATTGGCCTCTTCTGCGG 
ACTCTGCACAGGTTTTGTGCCCATGTACATTGGAGAGATCTCGCCTACTGCCCTGAGG 
GGTGCCTTTGGCACTCTCAACCAGCTGGGCATAGTTATTGGAATTCTGGTGGCCCAGG 
TAATCTTTGGTCTGGAACTCATCCTTGGGTCTGAAGAGCTATGGCCGGTGCTATTAGG 
CTTTACCATCCTTCCAGCTATCCTGCAAAGTGCAGCCCTTCCATGTTGCCCTGAAAGT 
CCCAGATTTTTGCTCATTAACAGAAAAAAAGAGGAGAATGCTACGCGGGTCCTCCAGC 
GGTTGTGGGGCACCCAGGATGTATCCCAAGACATCCAGGAGATGAAAGATGAGAGTGC 
AAGGATGTCACAAGAAAAGCAAGTCACCGTGCTGGAGCTCTTTAGAGTGTCCAGCTAC 
CGACAGCCCATCATCATTTCCATTGTGCTCCAGCTCTCTCAGCAGCTCTCTGGGATCA 
ATGCTGTGGTGTTCTATTACTCAACAGGAATCTTCAAGGATGCAGGTGTTCAACAGCC 
CATCTATGCCACCATCAGCGCGGGTGTGGTTAATACTATCTTCACTTTACTTTCTGTA 
GTAGCTCAGATGCTGTTTTCATGGAAAGGAAAACTGAAGTTTCATGTCATAACTGTTT 
CTTTGTTATTAAAGCTGGGTTACACTGTCTTTAAATTTAATCTTCTGTGTTCCTTCCT 
CTTACAGAATCACTATAATGGGATGAGCTTTGTCTGTATTGGGGCTATCTTGGTCTTT 
GTGGCCTGTTTTGAAATTGGACCAGGCCCCATTCCCTGGTTTATTGTGGCCGAACTCT 
TCAGCCAGGGCCCCCGCCCAGCTGCGATGGCAGTGGCCGGCTGCTCCAACTGGACCTC 
CAACTTCCTAGTCGGATTGCTCTTCCCCTCTGCTGCTTACTATTTAGGAGCCTACGTT 
TTTATTATCTTCACCGGCTTCCTCATTACCTTCTTGGCCTTTACCTTCTTCAAAGTCC 
CTGAGACCCGTGGCAGGACTTTTGAGGATATCACACGGGCCTTTGAAGGGCAGGCACA 
CGGTGCAGATAGATCTGGGAAGGACGGCGTCATGGGGATGAACAGCATCGAGCCTGCT 
] I 




ORf Start: ATG at 3 ORF Stop: 1 A A at 1530 




SEQ ID NO i 14 i509 aa ,NTw at 55571 7kD 


NOV 1 12a. 

CG59625-01 Protein Sequence 


MGHRQVTPALIFAITVATIGSFQFGYWTGVINAPETVQIIKEFINKTLTDKANAPPSE 
VLLTNLWSLSVAIFSVGGMIGSFSVGLFVNRFGRRRNSMLIVNLLAATGGCLMGLCKI 
AESVEMLILGRLVIGLFCGLCTGFVPMYIGEISPTALRGAFGTLNQLGIVIGILVAQV 
IFGLELILGSEELWPVLLGFTILPAILQSAALPCCPESPRFLLINRKKEENATRVLQR 
LWGTQDVSQDI QEMKDE SARMSQEKQVTVLELFRVSSYRQP 1 1 ISIVLQLSQQLSGIN 
AWFYYSTGIFKDAGVQQPIYATISAGWKTIFTLLSWAQMLFSWKGKLKFHVITVS 
LLLKLGYTVFKFNLLCSFLLQNHYNGMSFVCIGAILVFVACFEIGPGPIPWFIVAELF 
SQGPRPAAMAVAGCSNWTSNFLVGLLFPSAAYYLGAYVFIIFTGFLITFLAFTFFKVP 
ETF 3RTFEDITRAFEGQAHGADRSGKDGVMGMNSIEPAKETTTNV 



Further analysis of the NOV 1 12a protein yielded the following properties shown in 
Table 112B. 



Table 112B. Protein Sequence Prop* i ties \1 »\ { [2a 


Psort 
analysis: 


0.6400 probability located in plasma membrane; 0.4600 probability located in 
Golgi body; 0.3700 probability located in endoplasmic reticulum (membrane); 
0.1000 probability located in endoplasmic reticulum (lumen) 


SignalP 
analysis: 


Likely cleavage site between residues 22 and 23 



A search of the NOV1 12a protein against the Geneseq database, a proprietary 
database that contains sequences published in patents and patent publication, yielded 
several homologous proteins shown in Table 1 12C. 
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Table 112C. Geneseq Results for NOV112a 


Geneseq 
Identifier 


Protein/Organism/Length 
[Patent #, Date] 


NO VI 12a 
Residues/ 

Match 
Residues 


Identities/ 
Similarities for 
the Matched 
Region 


Expect 
Value 


AAY27289 


Glucose transporter protein 
GLUT3 - Homo sapiens, 494 aa. 
[US5942398-A, 24-AUG-1999] 


1..505 
1..492 


389/505 (77%) 
431/505 (85%) 


0.0 


AAR11360 


Glucose Transporter Protein from 
CHO cells - Cricetulus sp, 492 aa. 
[ W( )9 1 03554-A. 2 1 -MAR- 1 99 1 J 


4..491 
6.. 481 


289/489 (59%) 
364/489 (74%) 


e-156 


AAW17835 


Human glucose transporter GLUT- 
1 - Homo sapiens, 492 aa. 
[W09715668-A2, 01 -MAY- 1997] 


4.. 491 
6. .481 


287/489 (58%) 
362/489 (73%) 


e-155 


AAW93000 


Human GLUT1 protein - Homo 
sapiens, 492 aa. [W09618957-A1, 
20-JUN-1996] 


4.. 49 1 
6..481 


284/489 (58%) 
360/489 (73%) 


e-153 


AAB30522 


Amino acid sequence of a 
consensus GLUT polypeptide - 
Synthetic, 493 aa. [US6136547-A, 
24-OCT-2000] 


6..501 
10..490 


289/496 (58%) 
357/496 (71%) 




e-151 



In a BLAST search of public sequence databases, the NOV1 12a protein was found 
to have homology to the proteins shown in the BLASTP data in Table 1 12D. 



Table 112D. Public BLASTP Results for NOV112a 



Protein 
Accession 
Number 


Protein/Organism/Length 


NO VI 12a 
Residues/ 

Match 
Residues 


Identities/ j 
Similarities for Expect 
the Matched j Value 
Portion J 


P11169 


Solute carrier family 2, facilitated 
glucose transporter, member 3 
(Glucose transporter type 3, brain) - 
Homo sapiens (Human), 496 aa. 


1..509 
1..496 


446/510(87%) | 0.0 
468/510(91%) | 


P47842 


Solute carrier family 2, facilitated 
glucose transporter, member 3 
(Glucose transporter type 3, brain) - 
Canis familiaris (Dog), 495 aa. 


1..507 
1..494 


400/507(78%) j 0.0 
446/507(87%) 1 

1 


P47843 


Solute carrier family 2, facilitated 
glucose transporter, member 3 
(Glucose transporter type 3, brain) - 
Ovis aries (Sheep), 494 aa. 


1..505 
1..492 


389/505 (77%) I 0.0 
431/505 (85%) J 
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P58352 


Solute carrier family 2, facilitated 
glucose transporter, member 3 
(Glucose transporter type 3, brain) - 
Bos taurus (Bovine), 494 aa. 


1.505 
1..492 


390/505 (77%) 
431/505 (85%) 


0.0 


Q07647 


Solute carrier family 2, facilitated 
glucose transporter, member 3 
(Glucose transporter type 3, brain) - 
Rattus norvegicus (Rat), 493 aa. 


1..508 
1..492 


380/508 (74%) 
422/508 (82%) 


0.0 



PFam analysis predicts that the NOV 1 12a protein contains the domains shown in 
the Table 112E. 



Table 112E. Domain Analysis of NO VI 12a 


Pfam Domain 


NO VI 12a Match 
Region 


Identities/ 
Similarities 
for the Matched 
Region 


Expect 
Value 


Herpes glycop: domain 1 
of 1 


1..249 


40/417(10%) 
171/417(41%) 


7.2 


GntP_permease: domain 1 
of 1 


65.329 


70/478 (15%) 
185/478 (39%) 


2.5 


sugar_tr: domain 1 of 1 


12..478 


188/503 (37%) 
410/503 (82%) 


2.2e-158 



Example 113. 

The NOV1 13 clone was analyzed, and the nucleotide and predicted polypeptide 
sequences are shown in Table 1 13 A. 



Table 1 13 A. NOV113 Sequence Analysis 




SEQIDNO:315 1 1731 bp 


NOV113a, 

CG59887-01 DNA Sequence 


A-rTA-TTTOS-vvi "A t.". -.-v,.". ">t . :■ i AAATG- '■■ ■ -h 
GCTCGTTCACCTCGTTTGCCCTGGCCTTTTCCATGGTCTCGATCAACACCGGCGTGGT 
CACGCTGTTCGCCGACCCGTTCAACCGCGTCGGGGGCATCGGCATCCTCCTGTGGCTG 
TTGGTGATCCCGCTGGTGTGCTGCATCGTCATGGTCTACTGCCACCTGGCCGGGCGCA 
TTCCGCTCACCGGCTACGCCTACCAATGGTCCAGCCGATTGGCGGGCAATCACTTCGG 






GCGGCCATCGGTACGGTGTTCGCACCGGAGATCTGGGCCAACCCGACACAGGGTCAGA 
TCCAGGGCCTGAGCATCGGCGCCACGCTGGTGGTGGGCTTGCTGAATATCTGCGGGAT 
TCGCCTGGCCACCCGGATCAACGACATCGGCGCGATCATCGAAATCATCGGCACGGTA 
CTGCTGGCGATTGCGTTGTTCTTCGGGGTGTTTTTCTTCTTTGAGCACACCCAGGGCG 
TGGCGATCCTGACCTCCGCGCAACCAGTGAGCGGCGGCACGCTCAGCTTCACCACCAT 
CGCCCTCGCCACCTTGCTGCCGGTCTCGGTGCTGCTGGGTTGGGAAGGCGCCGCCGAC 
CTGTCCGAGGAAACCAAGGACCCACGCCGCGCCGCGCCCCGGGCGATGATTCGTGCGG 
TGCTGGTGTCCAGCGTATTGGGCTTCGTGGTGTTCGCCTTGCTGAGCATCGCGATCCC 
GGGCTCGGTCAGCGAACTGCTCAGCCACAGCGAAAACCCGGTGATCAATATCGTGCGC 
CTGCAACTGGGCAATGCCGCCGGCGTGGGCATGATCGTGATCGCTTTCGCCTCGATCC 
TCGCCTGCCTGATCGCCAACATGGCGGTGGCCACGCGCATGACCTTCGCCCTGTCCCG 
GGACAACATGCTGCCGGGCTCCAAGGTGCTGGCGAAGATCAACCCGCACTTCGGCACG 



434 



... 

p 





GCGGGTTTGTCACGGCGATCTACTCGATGGTCGGCCTGACCTACTACTGCACTTACCT 
GCTGACGCTGATTGCCGCGTACCTGGCCTATAAAAACGGCCGGATGCCGGGGGCGCCT 
GCGGGCGTGTTCAGCCTGGGCCGCTGGTTGCTGCCGATGATTATCCTCGGCGGCCTGT 
GGGCCATCGCGGTGATCCTGACCCTGAGCGTGCCGGAAGAAAGCCACACTGGCGCTAT 
CACCACCGGGGTTACACTCGGCGTGGGCGTGTTGTGGTGGTTGTTTTCACTGCGCACG 
CGCCTTAACAATGGCACCGCCGGGCCGAGCGGCAAATTGCTCGACCACTAGCCGCTGA 
TTGCAGCCAAAAGACAAAACCCCGAACACCGGGGTTTTGTCTTGTCACCTCCAAGGAG 


CTTCCCGATGTTTGAACAGGCCAGCTGGCTCAATCAACCCCAGCATTGGCGCCGAGAA 


GGCGAGCGACTCAAGGTCCGCACCGATGCCAGTACCGATTTCTGGCGTGAAACCCACT 


ATGGTTTTGTACGCGACAACGGGCATTTCCTGTTTGTTGAAACCGACGGCGACTTTAC 


- 3TCA j T 3TGAGTTTACCCACCT6TATGACCTTCGC 




ORF Start: ATG at 43 


ORF Stop: TAG at 1441 




SEQ ID NO: 316 


466 aa |MW at 49070.4kD 


NOV113a, 

CG59887-01 Protein Sequence 


MHRTMSSFTSFALAFSMVSINTGWTLFADPFNRVGGIGILLWLLVIPLVCCIVMVYC 
HLAGRrPLTGYAYQWSSRLAGNHFGWFTGWAFTSFVAGTAATSAAIGTVFAPEIWAN 
PTQGQIQGLSIGATLWGLLNICGIRLATRINDIGAIIEIIGTVLLAIALFFGVFFFF 
EHTQGVAILTSAQPVSGGTLSFTTIALATLLPVSVLLGWEGAADLSEETKDPRRAAPR 
AMIRAVLVSSVLGFWFALLSIAIPGSVSELLSHSENPVINIVRLQLGNAAGVGMIVI 
AFASILACLIANMAVATRMTFALSRDNMLPGSKVLAKINPHFGTPVAAIVLITAIAVL 
LNLASGGFVTAIYSMVGLTYYCTYLLTLIAAYLAYKNGRMPGAPAGVFSLGRWLLPMI 
ILGGLWAIAVILTLSVPEESHTGAITTGVTLGVGVLWWLFSLRTRLNNGTAGPSGKLL 
DH 




SEQ ID NO: 317 1433 bp 


NOVIBb, 

CG59887-02 DNA Sequence 


AAAAATGCACCGCACCATGAGCTCGTTCACCTCGTTTGCCCTGGCCTTTTCCATGGTC 
TCGATCAACACCGGCGTGGTCACGCTGTTCGCCGACCCGTTCAACCGCGTCGGGGGCA 
TCGGCATCCTCCTGTGGCTGTTGGTGATCCCGCTGGTGTGCTGCATCGTCATGGTCTA 
CTGCCACCTGGCCGGGCGCATTCCGCTCACCGGCTACGCCTACCAATGGTCCAGCCGA 
TTGGCGGGCAATCACTTCGGCTGGTTTACCGGCTGGGTGGCGTTCACCTCGTTTGTCG 
CCGGTACAGCCGCCACCTCGGCGGCCATCGGTACGGTGTTCGCACCGGAGATCTGGGC 
CAACCCGACACAGGGTCAGATCCAGGGCCTGAGCATCGGCGCCACGCTGGTGGTGGGC 
TTGCTGAATATCTGCGGGATTCGCCTGGCCACCCGGATCAACGACATCGGCGCGATCA 
TCGAAATCATCGGCACGGTACTGCTGGCGATTGCGTTGTTCTTCGGGGTGTTTTTCTT 
CTTTGAGCACACCCAGGGCGTGGCGATCCTGACCTCCGCGCAACCAGTGAGCGGCGGC 
ACGCTCAGCTTCACCACCATCGCCCTCGCCACCTTGCTGCCGGTCTCGGTGCTGCTGG 
GTTGGGAAGGCGCCGCCGACCTGTCCGAGGAAACCAAGGACCCACGCCGCGCCGCGCC 
CCGGGCGATGATTCGTGCGGTGCTGGTGTCCAGCGTATTGGGCTTCGTGGTGTTCGCC 
TTGCTGAGCATCGCGATCCCGGGCTCGGTCAGCGAACTGCTCAGCCGCAGCGAAAACC 
CGGTGATCAATATCGTGCGCCTGCAACTGGGCAATGCCGCCGGCGTGGGCATGGTCGT 
GATCGCTTTCGCCTCGATCCTCGCCTGCCTGATCGCCAACATGGCGGTGGCCACGCGC 
ATGACCTTCGCCCTGTCCCGGGACAACATGCTGCCGGGCTCCAAGGTGCTGGCGAAGA 
TCAACCCGCACTTCGGCACGCCGGTCGCCGCCATCGTGCTGATCACCGCCATCGCCGT 
GCTGCTGAACCTGGCGAGTGGCGGGTTTGTCACGGCGATCTACTCGATGGTCGGCCTG 
ACCTACTACTGCACTTACCTGCTGACGCTGATTGCCGCGTACCTGGCCTATAAAAACG 
GCCGGATGCCGGGGGCGCCTGCGGGCGTGTTCAGCCTGGGCCGCTGGTTGCTGCCGAT 
GATTATCCTCGGCGGCCTGTGGGCCATCGCGGTGATCCTGACCCTGAGCGTGCCGGAA 
GAAAGCCACACTGGCGCTATCACCACCGGGGTTACACTCGGCGTGGGCGTGTTGTGGT 
GGTTGTTTTCACTGCGCACGCGCCTTAACAATGGCACCGCCGGGCCGAGCGGCAAATT 
GCTCGACCACTAGCCGCTGATTGCAGCCAAAAGACAAAACC 




ORF Start: ATG at 5 


ORF Stop: TAG at 1403 




SEQ ID NO: 318 


466 aa MW at 49075.4kD 


NOV 113b, 

CG59887-02 Protein Sequence 


MHRTMSSFTSFALAFSMVSINTGWTLFADPFNRVGGIGILLWLLVIPLVCCIVMVYC 
HLAGRIPLTGYAYQWSSRLAGNHFGWFTGWVAFTSFVAGTAATSAAIGTVFAPEIWAN 
PTQGQ I QGLS IGATLWGLLNI CGI RLATRINDIGAI I EI IGTVLLAIALFFGVFFFF 
EHTQGVAILTSAQPVSGGTLSFTTIALATLLPVSVLLGWEGAADLSEETKDPRRAAPR 
AMIRAVLVSSVLGFWFALLSIAIPGSVSEL.LSRSENPVINIVRLQLGNAAGVGMWI 
AFASILACLIANMAVATRMTFALSRDNMLPGSKVIAKINPHFGTPVAAIVLITAIAVL 
LNLASGGFVTAIYSMVGLTYYCTYLLTLIAAYLAYKNGRMPGAPAGVFSLGRWLLPMI 
ILGGLWAIAVILTLSVPEESHTGAITTGVTLGVGVLWWLFSLRTRLNNGTAGPSGKLL 
DH 



Sequence comparison of the above protein sequences yields the following sequence 
relationships shown in Table 1 13B. 



435 



Table 113B. Comparison of NOV 1 13a against NOVU3b. 


Protein Sequence 


NOV113a Residues/ 
Match Residues 


Identities/ 
Similarities for the Matched Region 


NOV 113b 


1..466 
1..466 


343/466 (73%) 
344/466 (73%) 



Further analysis of the NOV 1 13a protein yielded the following properties shown in 
Table 11 3C. 



Table 113C. Protein Sequence Properties NOV113a 


PSort 
analysis: 


0.6400 probability located in plasma membrane; 0.4600 probability located in 
Golgi body; 0.3700 probability located in endoplasmic reticulum (membrane); 
0.1000 probability located in endoplasmic reticulum (lumen) 


SignalP 
analysis: 


Likely cleavage site between residues 59 and 60 



A search of the NOV 1 13a protein against the Geneseq database, a proprietary 
database that contains sequences published in patents and patent publication, yielded 
several homologous proteins shown in Table 1 13D. 



Table 113D. Geneseq Results for NO VI 13a 



Geneseq 
Identifier 


Protein/Organism/Length [Patent 
#, Date] 


NOV113a 
Residues/ 

Match 
Residues 


Identities/ 
Similarities for 
the Matched 
Region 


Expect 
Value 


AAG49885 


Arabidopsis thaliana protein 
fragment SEQ ID NO: 63155 - 
Arabidopsis thaliana, 504 aa. 
| HP 1 033405-A2, 06-SEP-2000] 


1..449 
17.. 492 


122/486(25%) 
217/486(44%) 


3e-31 


AAG49884 


Arabidopsis thaliana protein 
fragment SEQ ID NO: 63 154 - 
Arabidopsis thaliana, 5 1 6 aa. 
[EP1033405-A2, 06-SEP-2000] 


1..449 
29..504 


122/486 (25%) 
217/486(44%) 


3e-31 


AAG20282 


Arabidopsis thaliana protein 
fragment SEQ ID NO: 22407 - 
Arabidopsis thaliana, 504 aa. 
[EP1033405-A2, 06-SEP-2000] 


1 ..449 
17..492 


122/486(25%) 
217/486 (44%) 


3e-31 


AAG20281 


Arabidopsis thaliana protein 
fragment SEQ ID NO: 22406 - 
Arabidopsis thaliana, 5 1 6 aa. 
[EP1033405-A2, 06-SEP-2000] 


1..449 
29..504 


122/486 (25%) 
217/486(44%) 


3e-31 
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AAG20280 


Arabidopsis thaliana protein 
fragment SEQ ID NO: 22405 - 
Arabidopsis thaliana, 528 aa. 
[EP1033405-A2, 06-SEP-2000] 


I 1..449 
41. .516 

1 _ _ _ 


122/486 (25%) 
217/486(44%) 


3e-31 


In a BLAST search of public sequence databases, the NOV1 1 3a protein was found 
to have homology to the proteins shown in the BLASTP data in Table 1 13E. 


Table 113E. Public BLASTP Results for NO VI 13a 


Protein 
Accession 
Number 


Protein/Organism/Length 


NO VI 13a 
Residues/ 

Match 
Residues 


Identities/ 
Similarities for 
the Matched 
Portion 


Expect 
Value 


Q9KZF1 


PROBABLE AMINO 
ACID/METABOLITE 
PERMEASE - Streptomyces 
coelicolor, 504 aa. 


3..450 
27..481 


139/469(29%) 
214/469 (44%) 


2e-41 j 


Q98H14 


AMINO ACID/METABOLITE 
PERMEASE - Rhizobium loti 
(Mesorhizobium loti), 518 aa. 


1..446 
27..485 


118/466 (25%) 
209/466 (44%) 




le-36 j 


Q92NI8 


PUTATIVE AMINO-ACID 
PERMEASE PROTEIN - 
Rhizobium meliloti (Sinorhizobium 
meliloti), 515 aa. 


1..449 
25..487 


122/475 (25%) 
204/475 (42%) 


le-32 


022509 


PUTATIVE A\1IN< > \< 'ID OR 
GABA PERMEASE - Arabidopsis 
thaliana (Mouse-ear cress), 5 1 6 aa. j 


1..449 
29..504 


122/486 (25%) 
217/486 (44%) 


le-30 


Q9ZU50 


PIJTATIVC AMINO ACID 
PERMEASE - Arabidopsis thaliana 
(Mouse-ear cress), 5 1 7 aa. 


1..449 

29..505 


120/487 (24%) 
216/487 (43%) 


2e-28 



PFam analysis predicts that the NOV1 13a protein contains the domains shown in 
the Table 11 3F. 



Table 113F. Domain Analysis of NOV113a 


Pfam Domain 


NO VI 13a Match 
Region 


Identities/ 
Similarities 
for the Matched 
Region 


Expect 
Value 


oxidored_q3: domain 1 of 1 


162..307 


28/182 (15%) 
91/182 (50%) 


3.7 
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ISKChannel: domain 1 of 1 


196.326 


32/136 (24%) 

If. fAC\0/\ 

DDI 130 (4Uto) 


8.8 


ABC2 membrane: domain 1 
of 1 


122..377 


46/273 (17%) 
UHy z. 1 j \D\j /oj 


8.3 


SSF: domain 1 of 1 


7..394 


77/470(16%) 
222/470 (47%) 


7.8 


Aajxans: domain 1 of 1 


29..417 


67/483 (14%) 
236/483 (49%) 


9.7 


aa_permeases: domain 1 of 1 


1..451 


86/516(17%) 
287/516(56%) 


l.le-05 



Example 114. 



O The NOV1 14 clone was analyzed, and the nucleotide and predicted polypeptide 

Jj; sequences are shown in Table 1 14A. 



Table 114A. NOV114 Sequence Analysis 


|SEQ ID NO: 319 j 876 bp 


NOV114a, 

CG59861-01 DNA Sequence 


AACTTGCTTTTGGGAGCCAGCGGTATGGCGTCGGGCTGCAAGATTGGCCCGTCCATCC 
TCAACAGCGACCTGGCCAATTTAGGGGCCGAGTGCTCCCGGATGCTAGACTCTGGGGC 
CGATTATCTGCACCTGGACGTAATGGACGGGCATTTTGTTCCCAACATCACCTTTGGT 
CACCCTGTGGTGGAAAGCCTTCGAAAGCAGCTAGGCCAGGACCCTTTCTTTGACATGC 
ACATGATGGTGTCCAAGCCAGAACAGTGGGTAAAGCCAATGGCTGTAGCAGGAGCCAA 
TCAGTACACCTTTCATCTCGAGGCTACTGAGAACCCAGGGGCTTTGATTAAAGACATT 
CGGGAGAATGGGATGAAGGTTGGCCTTGCCATCAAACCAGGAACCTCAGTTGAGTATT 
TGGCACCATGGGCTAATCAGATAGATATGGCCTTGGTTATGACAGTGGAACCGGGGTT 
TGGAGGGCAGAAATTCATGGAAGATATGATGCCAAAGGTTCACTGGTTGAGGACCCAG 
TTCCCATCTTTGGATATAGAGGTCGATGGTGGAGTAGGTCCTGACACTGTCCATAAAT 
GTGCAGAGGCAGGAGCTAACATGATTGTGTCTGGCAGTGCTATTATGAGGAGTGAAGA 
CCCCAGATCTGTGATCAATCTATTAAGAAATGTTTGCTCAGAAGCTGCTCAGAAACGT 
TCTCTTGATCGGTGAAACCATAAGGAGCCCAGTGTTCCTGTTCATGAAATCTCCCTTT 


TACTGGAAAACAGGAATATTGACTACCAAATCACAATGCAATTGAAGCCGTACTGCTT 


TTTTGAGCAGTTATTCATTCCAGTGATTAAAACTGATTGTGCAGAATAAAAAAAAAAA 




AAAAAA 




ORF Start: ATG at 25 


ORF Stop: TGA at 709 




SEQ ID NO: 320 


228 aa !MW at 24901. 4kD 


NOV1 14a, 

CG59861-01 Protein Sequence 


MASGCKIGPSILNSDLAKLGAECSRMLDSGADYLHLDVMDGHFVPNITFGHPWESLR 
KQLGQDPFFDMHMMVSKPEQWVKPMAVAGANQYTFHLEATENPGALIKDIRENGMKVG 
LAIKPGTSVEYLAPWANQIDMALVMTVEPGFGGQKFMEDMMPKVHWLRTQFPSLDIEV 
' r_ KECAHAGANMIV SAIE 3EDPI ILL = Q SLOB 




SEQ ID NO: 321 (730 bp 


NOV1 14b, 

CG59861-02 DNA Sequence 


^ T ~, , -t-TT - --, : - :• :~AT<3- " '■ • " - - -.UTViC- T -T ~< 
TCAACAGCGACCTGGCCAATTTAGGGGCCGAGTGCCTCCGGATGCTAGACTCTGGGGC 
CGATTATCTGCACCTGGACGTAATGGACGGGCATTTTGTTCCCAACATCACCTTTGGT 
CACCCTGTGGTAGAAAGCCTTCGAAAGCAGCTAGGCCAGGACCCTTTCTTTGACATGC 
ACATGATGGTGTCCAAGCCAGAACAGTGGGTAAAGCCAATGGCTGTAGCAGGAGCCAA 
TCAGTACACCTTTCATCTCGAGGCTACTGAGAACCCAGGGGCTTTGATTAAAGACATT 
CGGGAGAATGGGATGAAGGTTGGCCTTGCCATCAAACCAGGAACCTCAGTTGAGTATT 
TGGCACCATGGGCTAATCAGATAGATATGGCCTTGGTTATGACAGTGGAACCGGGGTT 
TGGAGGGCAGAAATTCATGGAAGATATGATGCCAAAGGTTCACTGGTTGAGGACCCAG 
TTCCCATCTTTGGATATAGAGGTCGATGGTGGAGTAGGTCCTGACACTGTCCATAAAT 
GTGCAGAGGCAGGAGCTAACATGATTGTGTCTGGCAGTGCTATTATGAGGAGTGAAGA 
CCCCAGATCTGTGATCAATCTATTAAGAAATGTTTGCTCAGAAGCTGCTCAGAAACGT 
TCTCTTGATCGGTGAAACCATAAGGAGCCCAGTG 




ORF Start: ATG at 25 


ORF Stop: TGA at 709 




SEQ ID NO: 322 


228 aa |MW at 24927.5kD 
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NOV 114b, 

CG59861-02 Protein Sequence 



MASGCKIGPSILNSDLANLGAECLRMLDSGADYLHLDVMDGHFVPNITFGHPWESLR 
KQLGQDPFFDMHMMVSKPEQWKPMAVAGANQYTFHLEATENPGALIKDIRENGMKVG 
LAIKPGTSVEYLAPWANQIDMALVMTVEPGFGGQKFMEDMMPKVHWLRTQFPSLDIEV 
■rjGVGPl)7VHKCAKAGaN'yXVSGSAIMRSEDPRSVINLLRNVCSHaACIC:<-;LI.'R 



Sequence comparison of the above protein sequences yields the following sequence 
relationships shown in Table 1 14B. 



Table 114B. Comparison of NOV114a against NOV114b. 


Protein Sequence 


NOV1 14a Residues/ 
Match Residues 


Identities/ 
Similarities for the Matched Region 


NOV! 14b 


1..228 
1..228 


227/228 (99%) 
227/228 (99%) 



Further analysis of the NOV1 14a protein yielded the following properties shown in 
Table 114C. 



Table 114C. Protein Sequence Properties NOV114a 


PSort 
analysis: 


0.6500 probability located in cytoplasm; 0.1753 probability located in 
lysosome (lumen); 0.1000 probability located in mitochondrial matrix space; 
0.0000 probability located in endoplasmic reticulum (membrane) 


SignalP 
analysis: 


No Known Signal Sequence Predicted 



A search of the NOV1 14a protein against the Geneseq database, a proprietary 
database that contains sequences published in patents and patent publication, yielded 
several homologous proteins shown in Table 1 14D. 



Table 1 141). Geneseq Results for NOV! 14a 


Geneseq 
Identifier 


Protein/Organism/Length 

[Patent #, 
Protein/Organism/Length 
[Patent #, Date] 


NO VI 14a 
Residues/ 

Match 
Residues 


Identities/ 
Similarities for 
the Matched 
Region 


Expect 
Value 


AAM41358 


Human polypeptide SEQ ID NO 
6289 - Homo sapiens, 247 aa. 
[WO200153312-A1, 26-JUL- 
2001] 


1..228 
20..247 


227/228 (99%) 
227/228 (99%) 


e-132 


AAM41357 


Human polypeptide SEQ ID NO 
6288 - Homo sapiens, 247 aa. 
[WO200153312-A1, 26-JUL- 
2001] 


1..228 
20..247 


227/228 (99%) 
227/228 (99%) 


e-132 



439 



AAM39571 


Human polypeptide SEQ ID NO 
2716 - Homo sapiens, 228 aa. 
[WO200153312-A1, 26-JUL- 
2001] 


1..228 
1..228 


227/228 (99%) 
227/228 (99%) 


e-132 


AAB71912 


Human ISOM-4 - Homo sapiens, 
228 aa. [WO2001 12790-A2, 22- 
FEB-2001] 


1..228 
1..228 


227/228 (99%) 
227/228 (99%) 


e-132 


AAM39572 


Human polypeptide SEQ ID NO 
2717 - Homo sapiens, 246 aa. 
[WO200153312-Al,26-JUL- 
2001] 


1..228 
1..246 


227/246 (92%) 
227/246 (92%) 


e-129 



In a BLAST search of public sequence databases, the NOV1 14a protein was found 
to have homology to the proteins shown in the BLASTP data in Table 1 14E. 



Table 114E. Public BLASTP Results for NOV114a 



Protein 
Accession 
Number 


Protein/Organism/Length 


NO VI 14a 

Residues/ 

Match 
Residues 


Identities/ 
Similarities for 
the Matched 
Portion 


Expect 
Value 


Q96AT9 


HYPOTHETICAL 24.9 KDA 
PROTEIN - Homo sapiens 
(Human), 228 aa. 


1..228 
1..228 


227/228 (99%) 
227/228 (99%) 


e-131 


Q9BSB5 


HYPOTHETICAL 25.3 KDA 
PROTEIN - Homo sapiens 
(Human), 232 aa (fragment). 


1..228 
5..232 


227/228 (99%) 
227/228 (99%) 


e-131 


AAH19126 


HYPOTHETICAL 24.9 KDA 
PROTEIN - Mus musculus 
(Mouse), 228 aa. 


1..228 
1..228 


221/228 (96%) 
226/228 (98%) 


e-129 


043767 


RIBUL< >SE-5-PHOSPHATE- 
EPIMERASE - Homo sapiens 
(Human), 174 aa (fragment). 


55..228 
1..174 


174/174(100%) 
174/174(100%) 


2e-98 


Q96N34 


■ DNA1 LJ31466 MS. CLONE 
NT2NE2001372, HIGHLY 
SIMILAR TO HOMO SAPIENS 
PUTATIVE RIBULOSE-5- 
PHOSPHATE-EPIMERASE - 
Homo sapiens (Human), 1 78 aa. 


69..228 
1..178 


160/178 (89%) 
160/178 (89%) 


2e-86 



PFam analysis predicts that the NOV1 14a protein contains the domains shown in 
the Table 11 4E 
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Table 1I4K. Domain Analysis of NOV! 14a 


Pfam Domain 


NOV114a Match 
Region 


Identities/ 
Similarities 
for the Matched 
Region 


Expect 
Value 


Ribul P 3 epim: domain 1 
of 1 


6..204 


95/209 (45%) 
174/209 (83%) 


1.9e-105 


IGPS: domain 1 of 1 


179..213 


15/35 (43%) 
27/35 (77%) 


0.02 


trpsyntA: domain 1 of 1 


34..222 


45/273 (16%) 
124/273 (45%) 


2.9 



Example 115. 



The NOV1 15 clone was analyzed, and the nucleotide and predicted polypeptide 
sequences are shown in Table 1 15 A. 



Table 115A. NOV115 Sequence Analysis 




SEQ ID NO: 323 


1761 bp 


]NOV115a, 

CG59857-01 DNA Sequence 


AGTGTGGTACCT^ '1 n TG&AGGGGTTGACAAGGGAAAGGGCACCGGGG 


GGCACAGAGATGCAGGACAGATTGCACATCC7GGAGGACCTGAATATGCTCTACATTC 
GGCAGATGGCACTCAGCCTGGAGGACACGGAGTTGCAGAGGAAGCTAGACCATGAGAT 
CCGGATGAGGGAAGGGGCCTGTAAGCTGCTGGCAGCCTGCTCCCAGCGAGAGCAGGCT 
CTGGAGGCCACCAAGAGCCTGCTAGTGTGCAACAGCCGCATCCTCAGCTACATGGGCG 
AGCTGCAGCGGCGCAAGGAGGCGCAGGTGCTGGGGAAGACAAGCCGGCGGCCTTCTGA 
CAGTGGCCCGCCCGCTGAGCGCTCCCCCTGCCGCGGCCGGGTCTGCATCTCTGACCTC 
CGGATTCCACTCATGTGGAAGGACACAGAATATTTCAAGAACAAAGACTTGCACCGCT 
GGGCTGTGTTCCTGCTGCTGCAGCTGGGGGAACACATCCAGGACACAGAGATGATCCT 
AGTGGACAGGACCCTCACAGACATCTCCTTTCAGAGCAATGTGCTCTTCGCTGAGGCG 
GGGCCAGACTTTGAACTGCGGTTAGAGCTGTATGGGGCCTGTGTGGAAGAAGAGGGGG 
CCCTGACTGGCGGCCCCAAGAGGCTTGCCACCAAACTCAGCAGCTCCCTGGGCCGCTC 
CTCAGGGAGGCGTGTCCGGGCATCGCTGGACAGTGCTGGGGGTTCAGGGAGCAGTCCC 
ATCTTGCTCCCCACCCCAGTTGTTGGTGGTCCTCGTTACCACCTCTTGGCTCACACCA 
CACTCACCCTGGCAGCAGTGCAAGATGGATTCCGCACACATGACCTCACCCTTGCCAG 
TCATGAGGAGAACCCTGCCTGGCTGCCCCTTTATGGTAGCGTGTGTTGCCGTCTGGCA 
GCTCAGCCTCTCTGCATGACTCAGCCCACTGCAAGTGGTACCCTCAGGGTGCAGCAAG 
CTGGGGAGATGCAGAACTGGGCACAAGTGCATGGAGTTCTGAAAGGCACAAACCTCTT 
CTGTTACCGGCAACCTGAGGATGCAGACACTGGGGAAGAGCCGCTGCTTACTATTGCT 
GTCAACAAGGAGACTCGAGTCCGGGCAGGGGAGCTGGACCAGGCTCTAGGACGGCCCT 
TCACCCTAAGCATCAGTAACCAGTATGGGGATGATGAGGTGACACACACCCTTCAGAC 
AGAAAGTCGGGAAGCACTGCAGAGCTGGATGGAGGCTCTGTGGCAGCTTTTCTTTGAC 
ATGAGCCAATGGAAGCAGTGCTGTGATGAAATCATGAAAATTGAAACTCCTGCTCCCC 
GGAAACCACCCCAAGCACTGGCAAAGCAGGGGTCCTTGTACCATGAGATGGCTATTGA 
GCCGCTGGATGACATCGCAGCGGTGACAGACATCCTGACCCAGCGGGAGGGCGCAAGG 
CTGGAGACACCCCCACCCTGGCTGGCAATGTTTACAGACCAGCCTGCCCTGCCTAACC 
CCTGCTCGCCTGCCTCAGTGGCCCCAGCCCCAGACTGGACCCACCCCCTGCCCTGGGG 
GAGACCCCGAACCTTTTCCCTGGATGCTGTCCCCCCAGACCACTCCCCTAGGGCTCGC 
TCGGTTGCCCCCCTCCCACCTCAGCGATCCCCACGGACCAGAGGCCTCTGCAGCAAAG 
GCCAACCTCGCACTTGGCTCCAGTCACCAGTGTGAGAGAGAAAGGTGCTGGCATAGGA 
TCTGCCCAGAAGAGAAAATGA 


[ORF Start: ATG at 68 


ORF Stop: TGA at 1715 


|SEQ ID NO: 324 


549 aa |MW at 61 171.0kD 


NOV115a, 

CG59857-01 Protein Sequence 


MQDRLHILEDLNMLYIRQMALSLEDTELQRKLDHEIRMRKGACKLLAACSQREQALEA 
TKSLLVCNSR I LSYMGELQRRKEAQVLGKTSRRPSDSGPPAERS PCRGRVCI SDLRI P 
LMWKDTEYFKNKDLHRWAVFLLLQLGEHIQDTEMILVDRTLTDISFQSNVLFAEAGPD 
FELRLELYGACVEEEGALTGGPKRLATKLSSSLGRSSGRRVRASLDSAGGSGSSPILL 
PTPWGGPRYHLLAHTTLTLAAVQDGFRTHDLTLASHEENPAWLPLYGSVCCRLAAQP 
LCMTQPTASGTLRVQQAGEMQNWAQVHGVLKGTNLFCYRQPEDADTGEEPLLTIAVNK 
E ! TE il 1 £ 
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WKQCCDEIMKIETPAPRKPPQALAKQGSLYHEMAIEPLDDIAAVTDILTQREGARLET 
PPPWLAMFTDQPALPNPCSPASVAPAPDWTHPLPWGRPRTFSLDAVPPDHSPRARSVA 
PLPPQRS PRTRGLCSKGQPRTWLQSPV 

Further analysis of the NOV1 15a protein yielded the following properties shown in 
Table 115B. 



Table 115B. Protein Sequence Properties NOV115a 


PSort 
analysis: 


0.4500 probability located in cytoplasm; 0.3000 probability located in 
microbody (peroxisome); 0.1707 probability located in lysosome (lumen); 
0.1000 probability located in mitochondrial matrix space 


SignalP 
analysis: 


No Known Signal Sequence Predicted 



A search of the NOV 1 15a protein against the Geneseq database, a proprietary 
database that contains sequences published in patents and patent publication, yielded 
several homologous proteins shown in Table 1 15C. 



Table 1 15C. Geneseq Results for NOV1 15a 


| Geneseq 
Identifier 


Protein/Organism/Length [Patent 
#, Date] 


NOV115a 
Residues/ 

Match 
Residues 


Identities/ 
Similarities for 
the Matched 
Region 


Expect 
Value 


AAB35241 


Human rhotekin - Homo sapiens, 
563 aa. [US6183990-B1, 06-FEB- 
2001] 


24..549 
37..S63 


526/527 (99%) 
526/527 (99%) 


0.0 


AAY44559 


Human Rhotekin protein - Homo 
sapiens, 563 aa. [W09958667-A1, 
18-NOV-1999] 


24..549 
37..563 


526/527 (99%) 
526/527 (99%) 


0.0 


AAB35242 


Human rhotekin EST-derived 
protein - Homo sapiens, 527 aa. 
[US6183990-B1, 06-FEB-2001] 


24..549 
1..527 


522/527 (99%) 
523/527 (99%) 


0.0 


AAY44560 

I 


Human Rhotekin variant protein - 
Homo sapiens, 527 aa. 
[W09958667-A1, 18-NOV-1999] 


24..549 
1..527 


522/527 (99%) 
523/527 (99%) 


0.0 


AAB26790 


Human Ras correlative GTP 
binding kinase protein sequence - 
Homo sapiens, 544 aa. 
[CM 257924-A, 28-JUN-2000] 


24..549 
18..544 


518/527(98%) 
519/527(98%) 


0.0 



In a BLAST search of public sequence databases, the NOV 1 15a protein was found 
to have homology to the proteins shown in the BLASTP data in Table 1 15D. 
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Table 115D. Public BLASTP Results for NO VI 15a 


Protein 

Accession 
Number 


Protein/Organism/Len gth 


NOV115a 
Residues/ 

Match 
Residues 


Identities/ 
Similarities for 
the Matched 
Portion 


Expect 
Value 


AAH 17727 


SIMILAR TO RHOTEKIN - 
Homo sapiens (Human), 550 aa. 


1..549 
1..550 


549/550 (99%) 
549/550(99%) 


0.0 


Q9BST9 


SIMILAR TO RH( H EK1N - 
Homo sapiens (Human), 587 aa 
(fragment). 


24..549 
61..587 


526/527 (99%) 
526/527 (99%) 


0.0 


Q96PT6 


RTKN - Homo sapiens (Human), 
544 aa. 


24..549 
18..544 


518/527 (98%) 
519/527 (98%) 


0.0 


Q9HB05 


RHOTEKIN - Homo sapiens 
(Human), 567 aa (fragment). 


24..549 
41..567 


505/527 (95%) 
513/527 (96%) 


0.0 


Q61 l l >2 


RHOTEKIN - Mus musculus 
(Mouse), 551 aa. 


1..549 
1..551 


477/551 (86%) 
500/551 (90%) 


0.0 



PFam analysis predicts that the NOV1 15a protein contains the domains shown in 
the Table 115E. 



Table 115E. Domain Analysis of NO VI 15a 



Pfam Domain 


NO VI 15a Match Region 


Identities/ 
Similarities 
for the Matched Region 


Expect Value 


HR1: domain 1 of 1 


23..95 


17/87 (20%) 
54/87 (62%) 


0.27 


PH: domain 1 of 1 


296.-397 


19/102(19%) 
72/102 (71%) 


le-06 



Example 116. 

The NOV1 16 clone was analyzed, and the nucleotide and predicted polypeptide 
sequences are shown in Table 1 16A. 



Table 116A. NO VI 16 Sequence Analysis 




SEQ ID NO: 325 |450 bp 


NOV 116a, 

CG59855-01 DNA Sequence 


CTGGGAGACTGAAAAAATGCAGACCACCGGGGTATTACTCATTTCTCCAGCTCTGATC 
TGCTGTTGTACCAGGGGTCTAATCAGGCCTGTGTCTGCCTTCTCCTTGAATAGCCCAG 
AGAATTCATCTAAACAGCCTTCCTACAGCAGCTCCCCACTCCAGGTGGCCAGACGGGA 
GTTCCAGACCAGTGTTGTCTCCCGGGACACTGACACAGCCGCCAAGTTTATTGGTGCT 
GGGTCAGCCACAGTTGGTGTGGCTGATTCAGGGGCTGGCATTGGAGCGGTGTTTGGCA 
GCTTGATTATTGTCTATGCCAGGAAGCTGTCTCTCAAGCAGCAACTCCTCTTCTATGC 
CATTCTGGGCTTTGCCCTGTCTGAGGCCATGGGGCTCTTCTGTTTGATGATCTCCTTC 
TTCATCCTGT" rGAJ AGGG1 11 
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ORF Start: ATG at 17 


ORF Stop: TGA at 425 




SEQ II) NO: 326 


136 aa |M\V at !4384.6kD 


NOV 116a, 

CG59855-01 Protein Sequence 


MQTTGVLL I SPALI CCCTRGLI RPVSAFSLNSPENSS KQPS YSSSPLQVARRE FQTSV 
VSRDTDTAAKFIGAGSATVGVADSGAGIGAVFGSLIIVYARKLSLKQQLLFYAILGFA 
LSEAMGLFCLMISFFILFAM 




SEQ ID NO: 327 


434 bp 


NOV116b, 

CG59855-02 DNA Sequence 


ATGCAGACCACCGGGGTATTACTCATTTCTCCAGCTCTGATCT 1 1 

GTCTAATCAGGCCTGTGTCTGCCTTCTCCTTGAATAGCCCAGAGAATTCATCTAAACA 

GCCTTCCTACAGCAGCTCCCCACTCCAGGTGGCCAGACGGGAGTTCCAGACCAGTGTT 

GTCTCCCGGGACACTGACACAGCCGCCAAGTTTATTGGTGCTGGGTCAGCCACAGTTG 

GTGTGGCTGATTCAGAGGCTGGCATTGGAGCGGTGTTTGGCAGCTTGATTATTGTCTA 

TGCCAGGAAGCTGTCTCTCAAGCAGCAACTCCTCTTCTATGCCATTCTGGGCTTTGCC 




TGTGAGGCTCCGTGAGGGTCACCTGCCT 




ORF Start: ATG at 1 


ORF Stop: TGA at 409 




SEQ ID NO: 328 


136 aa <MW at 14456.7kD 


NOV1 16b, 

CG59855-02 Protein Sequence 


MQTTGVLL I SPALI CCCTRGLI RPVSAFSLNSPENSS KQPS YSSSPLQVARREFQTSV 
VSRDTDTAAKFI GAGSATVGVADSEAGI GAVFGSLI I VYARKLSLKQQLLFYAI LGFA 
LFC) FFILFAM 



Sequence comparison of the above protein sequences yields the following sequence 
relationships shown in Table 1 16B. 



Table 1 16B. Comparison of NOV 1 16a against NOV 11 6b. 


Protein Sequence 


NOV116a Residues/ 
Match Residues 


Identities/ 
Similarities for the Matched Region 


NOV116b 


1..136 
1..136 


120/136(88%) 
120/136(88%) 



Further analysis of the NOV 1 16a protein yielded the following properties shown in 
Table 11 6C. 



Table 116C. Protein Sequence Properties NOV116a 


iPSort 
analysis: 


0.9190 probability located in plasma membrane; 0.3000 probability located in 
lysosome (membrane); 0.1888 probability located in microbody (peroxisome); 
0.1000 probability located in endoplasmic reticulum (membrane) 


SignalP 
analysis: 


Likely cleavage site between residues 28 and 29 



A search of the NOV1 16a protein against the Geneseq database, a proprietary 
database that contains sequences published in patents and patent publication, yielded 
several homologous proteins shown in Table 1 16D. 
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Table 1 16D. Geneseq Results for NOV116a 


Geneseq 
Identifier 


Protein/Organism/Length [Patent 
#, Date] 


NO VI 16a 
Residues/ 

Match 
Residues 


Identities/ 
Similarities for 
the Matched 
Region 


Expect 
Value 


AAG75142 


Human colon cancer antigen protein 
SEQ ID NO.5906 - Homo sapiens, 
142 aa. [WO200122920-A2, 05- 
APR-2001] 


1..136 
7.. 142 


115/136 (84%) 
119/136(86%) 




2e-57 


AAB43866 


Human cancer associated protein 
sequence SEQ ID NO: 13 1 1 - Homo 
sapiens, 142 aa. [WO200055350- 
Al,21-SEP-2000] 


1..136 
7 142 


115/136(84%) 

1 10/1^6 (R&o/ n \ 
l lyl JOO \o\j /o) 


2e-57 

__J 


AAU69713 


Cell death protective sequence CNI- 
00730, protein #1 - Homo sapiens, 
142 aa. [WO2001 76532- A2, 18- 
OCT-2001] 


7..136 
7..142 


85/136 (62%) 
98/136 (71%) 


2e-36 

I 


ABB12016 


Human ATP synthase subunit 
homologue, SEQ ID NO:2386 - 
Homo sapiens, 1 87 aa. 
[WO200157188-A2, 09-AUG- 
2001] 


7..136 
52.. 187 


85/136 (62%) 
98/136 (71%) 


2e-36 


AAB53428 


Human colon cancer antigen protein 
sequence SEQ ID NO:968 - Homo 
sapiens, 212 aa. [WO200055351- 
A1,21-SEP-2000] 


7..136 
77..212 


85/136(62%) 
98/136(71%) 


2e-36 



In a BLAST search of public sequence databases, the NOV 1 16a protein was found 
to have homology to the proteins shown in the BLASTP data in Table 1 1 6E. 



Table 116E. Public BLASTP Results for NO VI 16a 



Protein 
Accession 
Number 


Pro tei n/O rganism/Len gth 


NOV116a 
Residues/ 

Match 
Residues 


Identities/ 
Similarities for 
the Matched 
Portion 


Expect 
Value 


P05496 


ATP synthase lipid-binding protein, 
mitochondrial precursor (EC 
3.6.1.34) (ATP synthase proteolipid 
PI) (ATPase protein 9) (ATPase 
subunit C) - Homo sapiens (Human), 
136 aa. 


1..136 
1..136 


115/136(84%) 
119/136(86%) 


9e-57 
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P32876 


ATP synthase lipid-binding protein, 
mitochondrial precursor (EC 
3.6.1.34) (ATP synthase proteolipid 
PI) (ATPase protein 9) (ATPase 
subunit C) - Bos taurus (Bovine), 136 
aa. 


1..136 
1..136 


1 13 136(83%) 
117/136(85%) 


le-54 


P17605 


ATP synthase lipid-binding protein, 
mitochondrial precursor (EC 
3.6.1.34) (ATP synthase proteolipid 
PI) (ATPase protein 9) (ATPase 
subunit C) - Ovis aries (Sheep), 136 
aa. 


1..136 
1..136 


113/136(83%) 
117/136(85%) 


2e-54 


Q9CR84 


ATP SYNTHASE C CHAIN 
ISOFOPvM 1 (EC 3.6.1.34) (LIPID- 
BINDING PROTEIN) (SUBUNIT C) 
- Mus musculus (Mouse), 136 aa. 


1..136 
1..136 


112/136(82%) ! 
117/136(85%) | 


le-53 


P48202 


ATP synthase lipid-binding protein, 
mitochondrial precursor (EC 
3.6.1.34) (ATP synthase proteolipid 
PI) (ATPase protein 9) (ATPase 
subunit C) - Mus musculus (Mouse), 
136 aa. 


1..136 
1.-136 


112/136 (82%) 
117/136(85%) 


le-53 



PFam analysis predicts that the NOV1 16a protein contains the domains shown in 
the Table 11 6F. 



Table 116F. Domain Analysis of NO VI 16a 


Pfam Domain 


NOV116a Match 
Region 


Identities/ 
Similarities 
for the Matched 
Region 


Expect 
Value 


ATP-synt C: domain 1 
of 1 


67..135 


31/70 (44%) 
57/70 (81%) 


2.3e-18 



Example 117. 

The NOV1 17 clone was analyzed, and the nucleotide and predicted polypeptide 
sequences are shown in Table 1 1 7A. 



Table 117A. NOV117 Sequence Analysis 




SEQIDNO:329 |1 769 bp 


NOV117a, 

CG59807-01 DNA Sequence 


GAGGTGATGCTGGAGACCTGCGGACTTCTCATGTCTCTGGGCTGTCCTTTGTTCAAAC 
CAGAGCTGATCTACCAGTTGGATCACAGACAGGAGCTATGGATGGCTACAAAAGACCT 
CTCCCAAAGCTCCTATCCAGGTGACAACACAAAACCCAAGACCACAGAGCCTACCTTT 
TCTCACCTGGCCTTGCCTGAGGAAGTCTTACTCCAGGAACAACTGACACAAGGAGCCT 
CAAAGAACTCCCAATTAGGGCAATCCAAGGATCAGGATGGGCCATCTGAAATGCAAGA 

r % „ - t • ^ - . _ i 
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TCTGAACGTGATGGTTTGGGGTCAGATGATGGTGTATGTACAAAGATTACACAGAAAC 
AAGTTTCAACAGAAGGTGATCTCTATGAATGTGATTCACATGGACCAGTTACAGATGC 
CTTGATTCGCGAAGAGAAAAATTCCTATAAATGTGAGGAATGCGGGAAAGTGTTTAAA 
AAGAATGCCCTCCTTGTTCAGCATGAACGGATTCACACTCAAGTGAAGCCCTATGAAT 
GCACAGAGTGTGGGAAAACCTTTAGCAAGAGCACTCATCTTCTTCAGCACCTCATCAT 
CCACACTGGGGAGAAGCCCTATAAGTGCATGGAGTGTGGGAAGGCTTTTAACCGCAGG 
TCACACCTCACACGGCACCAGCGGATTCACAGTGGAGAGAAGCCTTATAAGTGCAGTG 
AATGTGGAAAGGCCTTCACCCACCGCTCCACTTTTGTCTTGCATCACAGGAGCCACAC 
TGGAGAAAAACCCTTTGTGTGCAAAGAGTGTGGCAAAGCCTTTCGAGATAGGCCAGGT 
TTCATTCGACACTACATCATCCACACGGGAGAGAAGCCCTATGAGTGCATTGAGTGTG 
GGAAGGCCTTCAACCGCCGGTCATACCTCACGTGGCACCAACAGATTCACACTGGAGT 
GAAACCCTTTGAATGCAACGAGTGTGGAAAAGCTTTTTGCGAGAGTGCAGACCTCATT 
CAACACTACATTATCCACACTGGGGAGAAGCCCTATAAGTGCATGGAGTGTGGGAAGG 
CGTTCAACCGTAGGTCACACCTCAAGCAGCATCAACGGATTCACACTGGGGAGAAGCC 
TTATGAATGCAGTGAATGTGGAAAGGCCTTCACCCACTGCTCCACTTTTGTCTTGCAT 
AAAAGGACCCACACAGGAGAAAAACCCTATGAATGCAAAGAATGTGGAAAAGCCTTTA 
GTGATAGGGCAGACCTCATTCGCCACTTCAGCATCCACACTGGAGAGAAACCCTATGA 
GTGCGTGGAGTGTGGAAAGGCCTTCAACCGCAGCTCACACCTCACGAGGCACCAACAG 
ATTCACACTGGAGAGAAACCCTATGAATGCATCCAGTGTGGGAAAGCCTTTTGCCGGA 
GCGCAAACCTTATTCGACACTCCATCATTCACACTGGAGAGAAGCCGTATGAATGCAG 
TGAGTGTGGAAAGGCTTTTAATCGCGGCTCATCCCTCACACATCATCAAAGGATTCAT 
ACTGGGAGAAACCCTACCATTGTAACAGATGTGGGAAGACCTTTTATGACTGCACAGA 
CTTCAGTCAACATCCAGGAACTTTTATTAGGGAAAGAGTTTTTGAATATCACCACTGA 
AGAAAATCTGTGGTGAAAGGGAACATCTTACCATCTGGCCATTCACACTGAAGAGAAA 


CTTCATAAGCATC CTCTCTTTGAGAAAAC 




ORF Start: ATG at 7 


ORF Stop: TGA at 1696 




SEQ ID NO: 330 


563 aa |MW at 64300.6kD 


NOV117a, 

CG59807-01 Protein Sequence 


MLETCGLLMSLG-l ,] 1 l T E HRQELWMATKDLSQSSYPGDNTKPKTTEPTFSH 
LALPEEVLLQEQLTQGASKNSQLGQSKDQDGPSEMQEVHLKIGIGPQRGKLLEKMSSE 
RDGLGSDDGVCTKITQKQVSTEGDLYECDSHGPVTDALIREEKNSYKCEECGKVFKKN 
ALLVQHERIHTQVKPYECTECGKTFSKSTHLLQHLIIHTGEKPYKCMECGKAFNRRSH 
LTRHQRIHSGEKPYKCSECGKAFTHRSTFVLHHRSHTGEKPFVCKECGKAFRDRPGFI 
RHYIIHTGEKPYECIECGKAFNRRSYLTWHQQIHTGVKPFECNECGKAFCESADLIQH 
YIIHTGEKPYKCMECGKAFNRRSHLKQHQRIHTGEKPYECSECGKAFTHCSTFVLHKR 
THTGEKPYECKECGKAFSDRADLIRHFSIHTGEKPYECVECGKAFNRSSHLTRHQQIH 
TGEKPYECIQCGKAFCRSANLIRHSIIHTGEKPYECSECGKAFNRGSSLTHHQRIHTG 
T ^^FtFIT, 1 tMl :_LL ,1 TFI IIIfEEMLW 



Further analysis of the NOV 1 17a protein yielded the following properties shown in 
Table 117B. 



Table 117B. Protein Sequence Properties NOV117a 


Psort 
analysis: 


0.4500 probability located in cytoplasm; 0.3000 probability located in 
microbody (peroxisome); 0.1000 probability located in mitochondrial matrix 
space; 0.1000 probability located in lysosome (lumen) 


SignalP 
analysis: 


Likely cleavage site between residues 19 and 20 



A search of the NOV 1 17a protein against the Geneseq database, a proprietary 
database that contains sequences published in patents and patent publication, yielded 
several homologous proteins shown in Table 1 17C. 
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Table 117C. Geneseq Results for NOV! 17a 


Geneseq 
Identifier 


Protein/Organism/Length [Patent 
#, Date] 


NOV117a 
Residues/ 

Match 
Residues 


Similarities for 
the Matched 
Region 


Expect 
Value 


AAM79549 


Human protein SEQ ID NO 3 1 95 - 
Homo ScipicnSj 603 £ta. 
[WO200157190-A2, 09-AUG- 
2001] 


1..563 
38. .603 


563 566(99%) 


0.0 


AAM78565 


Human protein SEQ ID NO 1227 - 
Homo sapiens, 603 aa. 
[WO200157190-A2, 09-AUG- 
2001] 


— - 

1..563 
38..603 


^ — 

563/566 (99%) 
563/566 (99%) 




0.0 


ABB21767 


Protein #3766 encoded by probe for 
measuring heart cell gene 
expression - Homo sapiens, 55 1 aa. 
[WO200157274-A2, 09-AUG- 
2001] 


44..562 
10..527 


375/519 (72%) 
437/519(83%) 


0.0 


AAM69575 


Human bone marrow expressed 
probe encoded protein SEQ ID NO: 
29881 - Homo sapiens, 551 aa. 
[WO200157276-A2, 09-AUG- 
2001] 


44..562 
10..527 


375/519(72%) 
437/519 (83%) 


0.0 


AAM57172 


Human brain expressed single exon 
probe encoded protein SEQ ID NO: 
29277 - Homo sapiens, 551 aa. 
[WO200157275-A2, 09-AUG- 
2001] 


44..562 
10..527 


375/519(72%) 
437/519(83%) 


0.0 



In a BLAST search of public sequence databases, the NOV 1 17a protein was found 
to have homology to the proteins shown in the BLASTP data in Table 1 17D. 



1 Table 1 17D. Public BLASTP Results for NOV117a 


| 

Protein 
Accession 
Number 


Protein/Organism/Length 


NO VI 17a 
Residues/ 

Match 
Residues 


Identities/ 
Similarities for 
the Matched 
Portion 


Expect 
Value 


| 043296 


Zinc finger protein 264 - Homo 
sapiens (Human), 627 aa. 


1..562 
43..603 


401,562 (71%) 
468/562 (82%) 


0.0 


Q96NL3 


CDNA FLJ30663 FIS, CLONE 


1..535 
38..572 


299/535 (55%) 
369/535 (68%) 


0.0 
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SIMILAR TO ZINC FINGER 
PROTEIN 84 - Homo sapiens 
(Human), 588 aa. 








Q99676 


Zinc finger protein 184 - Homo 
sapiens (Human), 75 1 aa. 


2..535 
5S..595 


261/542 (48%) 
355/542 (65%) 


e-151 


Q96SE7 


ZINC FINGER 1 1 1 1 - Homo 
sapiens (Human), 839 aa. 


151..541 
306..694 


233/391 (59%) 
281/391 (71%) 


e-148 , 


Q03923 


Zinc finger protein 85 (Zinc finger 
protein HPF4) (HTF1) - Homo 
sapiens (Human), 595 aa. 


1..535 
33..547 


266/544 (48%) 
328/544 (59%) 




e-148 | 



La PFam analysis predicts that the NOV1 1 7a protein contains the domains shown in 

g the Table 11 7E. 



Table 117E. Domain Analysis of NO VI 17a 


Pfam Domain 


NOV117a Match 
Region 


Identities/ 
Similarities 
for the Matched 
Region 


Expect 
Value 


KRAB: domain 1 of 1 


1..34 


14/66 (21%) 
24/66 (36%) 


0. 1 5 


zf-C2H2: domain 1 of 13 


162.. 184 


1 1/24 (46%) 
19/24 (79%) 


3.6e-06 


zf-C2H2: domain 2 of 13 


190..212 


1 1/24 (46%) 
19/24 (79%) 


7.1e-06 


zf-C2H2: domain 3 of 13 


218..240 


14/24 (58%) 
22/24 (92%) 


2.3e-07 


zf-BED: domain 1 of 3 


203..241 


13/52 (25%) 
25/52 (48%) 




zf-C2H2: domain 4 of 13 


246..268 


1 1/24 (46%) 
20/24 (83%) 


4.6e-05 


LIM: domain 1 of 1 


220..284 


16/72 (22%) 
50/72 (69%) 


0.69 


zf-C2H2: domain 5 of 13 


274..296 


8/24 (33%) 
18/24 (75%) 


7.6e-05 


zf-C2H2: domain 6 of 13 


302..324 


1 1/24 (46%) 
20/24 (83%) 


8.4e-05 


Zn carbOpept: domain 1 
of 1 


312..330 


5/19(26%) 
17 19(89%) 


1.2 
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zf-C2H2: domain 7 of 13 


330..352 


8/24 (33%) 

1 Q/94 flQ%\ 


9.7e-05 


zf-C2H2: domain 8 of 13 


358..380 


14/24 (58%) 

99/94 (Q1°AC\ 
ZZ/Z4 \ yL /to) 


5.3e-07 


zf-BED: domain 2 of 3 


343..381 


12/52 (23%) 
zo/jz \p\) /o) 


1.3 


zf-C2H2: domain 9 of 13 


386..408 


11/24(46%) 

ZU/ZH- ^oj /a) 


9.4e-05 


zf-C2H2: domain 10 of 13 


■ - — 

414..436 


1 1/24 (46%) 

9H/94 

IXS! Z4 (do /o) 


5e-06 


zf-C2H2: domain 11 of 13 


442..464 


12/24 (50%) 

99/94 fQ9°/A 


3e-07 


zf-BED: domain 3 of 3 


427..465 


14/52 (27%) 
27/52 (52%) 


0.38 


zf-C2H2: domain 12 of 13 


470..492 


12/24 (50%) 
19/24 (79%) 


0.00044 


zf-C2H2: domain 13 of 13 


498..520 


12/24 (50%) 
22/24 (92%) 


9.8e-07 



Example 118. 



£ The NOV1 1 8 clone was analyzed, and the nucleotide and predicted polypeptide 

jtj sequences are shown in Table 1 1 8A. 



Table 118A. NOV118 Sequence Analysis 




SEQIDNO:331 1 1899 bp 


NOV118a, 

CG59805-01 DNA Sequence 


CAAACTCTACTACCTCTATATGACATTTCAGGTGTCTGTGACCTTTGATGATGTGGCT 
GTGACTTTCACCCAGGAGGAGTGGGGCCAGCTGGACCTAGCTCAGCGGACCCTGTACC 
AGGAGGTGATGCTGGAAAACTGTGGGCTCCTGGTATCTCTGGGTGGGTGTCCTGTTCC 
CAGACCTGAGCTGATCTACCACCTAGAGCATGGGCAGGAGCCATGGACCAGGAAGGAA 
GACCTCTCCCAAGGCACCTGTCCAGGTGACAAAGGAAAACCCAAGAGCACAGAACCTA 
CCACCTGTGAGCTAGCCTTGTCTGAAGGAATCTCTTTTTGGGGACAACTAACACAAGG 
AGCTTCAGGGGACTCCCAGTTGGGGCAACCCAAGGATCAGGATGGGTTTTCAGAAATG 
CAGGGAGAACGCTTGAGACCAGGGTTAGATTCCCAAAAGGAGAAGCTTCCTGGAAAAA 




GGATCGAGTCTCCTTAGGAGATGATGTCCATGACTGTGACTCACATGGATCAGGTAAA 
AATCCAGTTATTCAGGAAGAGGAAAATATCTTTAAATGCAATGAATGTGAAAAAGTGT 
TTAACAAGAAACGCCTGCTTGCTCGGCATGAGAGGATTCACTCTGGAGTGAAGCCCTA 
TGAATGCACAGAGTGTGGAAAAACCTTTAGCAAGAGTACATACCTCCTGCAGCACCAC 
ATGGTCCACACTGGGGAGAAGCCCTATAAGTGCATGGAGTGTGGGAAGGCTTTTAATC 
GGAAGTCACACCTTACCCAGCACCAGCGGATTCACAGTGGAGAGAAGCCTTATAAGTG 
CAGTGAATGTGGAAAGGCCTTCACCCACCGCTCCACTTTTGTCTTGCATAACAGGAGC 
CACACTGGAGAAAAACCCTTTGTGTGCAAAGAGTGTGGCAAAGCCTTTCGAGATAGGC 
CAGGTTTCATTCGACACTACATCATCCACAGTGGTGAGAATCCCTACGAGTGCTTCGA 
ATGTGGCAAGGTCTTCAAACACAGATCATACCTCATGTGGCACCAGCAGACTCATACC 
GGGGAGAAGCCCTATGAGTGCAGTGAATGTGGGAAGGCCTTCTGTGAGAGCGCAGCGC 
TGATTCACCACTATGTCATCCACACTGGAGAGAAGCCCTTTGAGTGCCTCGAGTGTGG 
GAAGGCTTTCAACCACCGATCCTACCTCAAAAGGCACCAGCGGATTCACACTGGGGAG 
AAGCCATATGTGTGTAGTGAATGCGGAAAGGCCTTCACCCACTGCTCTACTTTCATCT 
TGCATAAAAGGGCCCACACTGGAGAAAAACCTTTCGAGTGCAAAGAGTGTGGGAAAGC 
CTTTAGCAATAGGGCAGACCTCATTCGCCACTTCAGCATCCACACTGGAGAGAAGCCC 
TATGAGTGCATGGAGTGTGGAAAGGCCTTCAACCGCAGGTCAGGCCTCACAAGGCACC 
AGCGGATTCATAGTGGAGAGAAGCCCTATGAATGCATCGAGTGTGGGAAAACATTTTG 
CTGGAGCACAAACCTCATTCGACACTCTATCATCCACACTGGAGAGAAGCCGTATGAG 
TGCAGTGAATGTGGAAAGGCCTTCAGTCGCAGCTCGTCCCTCACTCAGCATCAAAGGA 
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TGCATACTooGAoAAAT^L.T T 1 1"TTACAAGTGG 
GCAGACCTCAGTCAACATCCAAGAACTTTTATTGGGGAAAAACTTTTTGAATGTCACC 
ACTGAGGAAAATCTTTTGCAAGAGGAAGCATCTTACATGGCATCTGATCGTACATACC 
AAAGAGAAACCCCACAAGTGTCTTCACTGTGAGAAAACCTTCT 




OR I 7 Start: ATG at 20]ORF Stop: TGA at 1886 




SEQ ID NO: 332 J 622 aa |MW at 70677.2kD 


NOV118a, 

CG59805-01 Protein Sequence 


MTFQVSVTFDDVAVTFTQEEWGQLDLAQRTLYQE\7MLENCGLLVSLGGCPVPRPELIY 
HLEHGQEPWTRKEDLSQGTCPGDKGKPKSTEPTTCELALSEGISFWGQLTQGASGDSQ 
LGQPKDQDGFSEMQGERLRPGLDSQKEKLPGKMSPKHDGLGTADSVCSRI IQDRVSLG 
DDVHDCDSHGSGKNPVIQEEENIFKCNECEKVFNKKRLLARHERIHSGVKPYECTECG 
KTFSKSTYLLQHHMVHTGEKPYKCMECGKAFNRKSHLTQHQRIHSGEKPYKCSECGKA 
FTHRSTFVLHNRSHTGEKPFVCKECGKAFRDRPGFIRHYIIHSGENPYECFECGKVFK 
HRSYLMWHQQTHTGEKPYECSECGKAFCESAALIHHYVIHTGEKPFECLECGKAFNHR 
SYLKRHQRIHTGEKPYVCSECGKAFTHCSTFILHKRAHTGEKPFECKECGKAFSNRAD 
LIRHFSIHTGEKPYECMECGKAFNRRSGLTRHQRIHSGEKPYECIECGKTFCWSTNLI 
RHSIIHTGEKPYECSECGKAFSRSSSLTQHQRMHTGRNPISVTDVGRPFTSGQTSVNI 
fill- ;?LI rTESNLLQEEASYK DRTYQRETPCVSSL 



Further analysis of the NOV 1 18a protein yielded the following properties shown in 
Table 118B. 



Table 118B. Protein Sequence Properties NO VI 18a 


PSort 
analysis: 


0.4500 probability located in cytoplasm; 0.3796 probability located in 
microbody (peroxisome); 0.1000 probability located in mitochondrial matrix 
space; 0.1000 probability located in lysosome (lumen) 


SignalP 
analysis: 


No Known Signal Sequence Predicted 



A search of the NOV 1 18a protein against the Geneseq database, a proprietary 
database that contains sequences published in patents and patent publication, yielded 
several homologous proteins shown in Table 1 1 8C. 



Table 118C. Geneseq Results for NO VI 18a 



Geneseq 
Identifier 


Protein/Organism/Length 
[Patent #, Date] 


NOV118a 
Residues/ 

Match 
Residues 


Identities/ 
Similarities for 
the Matched 
Region 


Expect 
Value 


ABB22693 


Protein #4692 encoded by probe 
for measuring heart cell gene 
expression - Homo sapiens, 468 aa. 
[WO200157274-A2, 09-AUG- 
2001] 


81..548 
1..468 


468/468 (100%) 
468/468 (100%) 


0.0 


AAM70526 


Human bone marrow expressed 
probe encoded protein SEQ ID 
NO: 30832 - Homo sapiens, 468 
aa. [WO200157276-A2, 09-AUG- 
2001] 


81. .548 
1..468 


468/468 (100%) 
468/468 (100%) 


0.0 
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AAM58080 


Human brain expressed single exon 
probe encoded protein SEQ ID 
iw. ovioj - riomo sapiens, 4oo 
aa. [WO200157275-A2, 09-AUG- 
2001] 


81..548 
1..468 


468/468(100%) 
468/468(100%) 


0.0 


AAM30843 


Peptide #4880 encoded by probe 
for measuring placental gene 
expression - Homo sspiens 468 3.£t 
[WO200157272-A2, 09-AUG- 
2001] 


81..548 
1..468 


468/468 (100%) 
468/468 (100%) 


0.0 


AAM18364 


Peptide #4798 encoded by probe 
for measuring cervical gene 
expression - Homo sapiens, 468 aa. 
[WO200157278-A2, 09-AUG- 
2001] 


81. .548 
1..468 


468/468 (100%) 
468/468 (100%) 


0.0 



In a BLAST search of public sequence databases, the NOV 1 18a protein was found 
to have homology to the proteins shown in the BLASTP data in Table 1 18D. 



I Table 118D. Public BLASTP Results for NOV118a 


Protein 
i Accession 
! Number 


Protein/Organism/Length 


NOV118a 
Residues/ 

Match 
Residues 


Identities/ 
Similarities for 
the Matched 
Portion 


Expect 
Value 


| 043296 


Zinc finger protein 264 - Homo 
sapiens (Human), 627 aa. 


4..622 
11. .627 


530/619(85%) 
567/619(90%) 


0.0 


1 Q96NL3 


CDNA FLJ30663 FIS, CLONE 
FCBBF1000598, MODERATELY 
SIMILAR TO ZINC FINGER 
PROTEIN 84 - Homo sapiens 
(Human), 588 aa. 


7..572 
9..573 


334/566 (59%) 
403/566 (71%) 


0.0 


1 Q99676 


Zinc finger protein 1 84 - Homo 
sapiens (Human), 751 aa. 


2..571 
23..623 


280/604 (46%) 
377/604 (62%) 


e-160 


1 P51523 


Zinc finger protein 84 (Zinc finger 
protein HPF2) - Homo sapiens 
(Human), 738 aa. 


4..617 
5..626 


286/637 (44%) 
368/637 (56%) 


e-157 


Q9BX82 


EZFIT-RELATED PROTEIN 1 - 
Homo sapiens (Human), 626 aa. 


7..617 
14..626 


278/621 (44%) 
364/621 (57%) 


e-156 



PFam analysis predicts that the NOV1 1 8a protein contains the domains shown in 
the Table 11 8E. 
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Table 118E. Domain Analysis of NOV118a 


Pfam Domain 


NO VI 18a Match 
Region 


Identities/ 
Similarities 
for the Matched 
Region 


Expect 
Value 


KRAB: domain 1 of 1 


7..70 


41/66 (62%) 
54/66 (82%) 




2.2e-33 


zf-C2H2: domain 1 of 
13 


198..220 


1 1/24 (46%) 
17/24 (71%) 


3.9e-05 


BolA: domain 1 of 1 


161..238 


14/88(16%) 
49/88 (56%) 




3.4 


zf-C2H2: domain 2 of 
13 


226..248 


10/24(42%) 
18/24 (75%) 


6.2e-05 


zf-C2H2: domain 3 of 
13 


254..276 


14/24 (58%) 
22/24 (92%) 


5e-07 


TFIIS: domain 1 of 1 


257..292 


12/39(31%) 
21/39(54%) 


5.7 


zf-C2H2: domain 4 of 
13 


282..304 


1 i '24 (46%) 
20/24 (83%) 


3.7e-05 


LIM: domain 1 of 1 


256..320 


14/71 (20%) 
48/71 (68%) 


0.38 


zf-C2H2: domain 5 of 
13 


310..332 


8/24 (33%) 
18/24 (75%) 


7.6e-05 


zf-C2H2: domain 6 of 
13 


338..360 


1 1/24 (46%) 
19/24 (79%) 


l.le-05 


zf-C2H2: domain 7 of 
13 


366..388 


9/24 (38%) 
1 8/24 (75%) 


0.00027 


zf-C2H2: domain 8 of 
13 


394..416 


12/24 (50%) 
21/24 (88%) 


7.9e-07 


zf-C2H2: domain 9 of 
13 


422..444 


10/24(42%) 
1 9/24 (79%) 


0.00014 


zf-C2H2: domain 10 of 
13 


450..472 


10/24(42%) 
20/24 (83%) 


8.3e-06 


zf-C2H2: domain 1 1 of 
13 


478..500 


13/24 (54%) 
21/24 (88%) 


3e-07 


zf-BED: domain 1 of 1 


463..501 


14/52 (27%) 
29/52 (56%) 


0 1 


zf-C2H2: domain 12 of 
13 


506..528 


11/24(46%) 
17/24 (71%) 


0.0016 
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zf-C2H2: domain 13 of 


534..556 


13.24 (54%) 


7.2e-08 


13 




23/24 (96%) 





Example 119. 

The NOV1 19 clone was analyzed, and the nucleotide and predicted polypeptide 
sequences are shown in Table 1 19A. 



Table 119A. NO VI 19 Sequence Analysis 




CCA IT) >jrV m j ] Un 


NOV1 19a 

CG59928-01 DNA Sequence 


GCTCAGTAGGCGTCGGGCTGTGATGCCCCAACTGCTCCAGCGTCTGCAGGCGCGCGCG 


GGCGCGGTAGGCGTACTCGCTGGCCGGATAGCGCGTGATGATGAACTGGTAGGTCTGC 


GCCGCATCGACGAACAGGCTCTCGCGCTCCAGGCATTGACCGCGCAGCAGGGAAATCT 




CCGGCTGCAGGTAATTGCGTGAGCGGCTCTTGCGCTCGGCCTGCGACAGCTCCAGCGC 




AGCGAGACACGGGTGCAACCCGCAGCAACCAGGGCCACGGCCAGAATGATCAGGTTAC 


GCATGGGCAATTCCTCCAATGAGCAGTGTATCGACAGCCCAGGCAAAAACTGAACAGC 


GGCAAGCCGACGACGGTTTTTCTGGCGGCGCCTTGGCATGACGCCACTGCCTCTCATT 


TTATCAACGCCAGCGCCACGACCGCTCGTCCTCTCGAACCAGCGCTAAATCCCCTTCT 


GCGCTGACCCATATCAATGCCGTTCAGCGCAACAGGGTGTGTAATGTAGGTACAGACT 


CCAGGCGAGGACGCTGCCATGAAACTGCAACGACTGTTGGTCGTCATCGACGCCGAAC 
ACCAGCAACAACCCGCCCTGCAACGCGCAGCCGATGTGGCACGCAAGACCGGCGCCGA 
ATTGCACCTGTTGCAGATCGAATACCACCCAAGCCTGGAAAGCGGCCTGCTGGACAGC 
CATCTGCTCAACCGCGCCCGTGAAACCATCCTGCGACAGAGCCACGAGGCCCTGCGTG 
CCAGCGTCGCTCACCTGAGCGATGAAGGATTCAAGATCGCAGTGGACGTGCGCTGGGG 
CAAACGTCGTCATGAAGAAATCCTCGCCCGCGTCGCGGTGTTGCAACCGGACATCCTG 
TTCAAGTCGACTCATCCCAGCAGTGCGCTGCGCCGCCTGTTGTTCAGTGATACCAGTT 
GGCAGCTGATTCGCCGCAGCCCGGTGCCGCTGTGGCTGGTACACGACGCCGAGCCCCA 
TGGTCAGAGCCTGTGCGCTGCGCTCGACCCGCTGCACAGCGCGGACAAACCTGCCGCC 
CTCGATCATCAGTTGATTGATGCCAGCCAGACCCTGCAGGCCGAGCTCGGCTTACAGG 
CCCAATACCTGCATGCACAGGCGCCTCTGCCGCGGTCGCTGCTGTTCGACGCCGAGGT 
AGCGCAGGAATATGAAGACTACGTGACCCAGTGCAGCCGCGAGCACCGCGAAGCCTTC 
GACAAGCTGATCGCCCAGCACGCCATCGATAGAGCACAGGCCCACCTGTTGGACGGTT 
TTGCCGAGGAAGTCATCCCGCGTTTCGTGCGTGAGCACAATATAGGCCTGCTGGTGAT 
GGGCGCCATCGCCCGCGGCCATCTGGACAGCCTGCTGATCGGCCACACCGCAGAACGG 
GTGCTGGAACGTGTCGAGTGCGATCTGCTGGTGATCAAATCGCACGGCAAAGGGTAGT 
GCACAGGAACAATGACTACAGCCCGACGCCTACTGAGC 




ORF Start: ATG at 599 


ORF Stop: TAG at 1505 




SEQ ID NO: 334 


302 aa MW at 33922.3kD 


NOV 119a, 

CG59928-01 Protein Sequence 


MKLQRLLWIDAEHQQQPALQRAADVARKTGAELHLLQIEYHPSLESGLLDSHLLNRA 
RETILRQSHEALRASVAHLSDEGFKIAVDVRWGKRRHEEILARVAVLQPDILFKSTHP 
SSALRRLLFSDTSWQLIRRSPVPLWLVHDAEPHGQSLCAALDPLHSADKPAALDHQLI 
DASQTLQAELGLQAQYLHAQAPLPRSLLFDAEVAQEYEDYVTQCSREHREAFDKLIAQ 
HAIDRAQAHLLDGFAEEVIPRFVREHNIGLLVMGAIARGHLDSLLIGHTAERVLERVE 



Further analysis of the NOV1 19a protein yielded the following properties shown in 
Table 119B. 



Table 119B. Protein Sequence Properties NO VI 19a 


PSort 
analysis: 


0.3000 probability located in microbody (peroxisome); 0.3000 probability 
located in nucleus; 0.2014 probability located in lysosome (lumen); 0.1000 
probability located in mitochondrial matrix space 


SignalP 
analysis: 


No Known Signal Sequence Predicted 
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A search of the NOV 1 19a protein against the Geneseq database, a proprietary 
database that contains sequences published in patents and patent publication, yielded 
several homologous proteins shown in Table 1 19C. 



Table 119C. Geneseq Results for NOV119a 






NOV119a 


Identities/ 




Geneseq 


Protein/Organism/Length 


Residues/ 


Similarities for 


Expect 


Identifier 


[Patent #, Date] 


Match 


the Matched 


Value 






Residues 


Region 




No Significant Matches Found 



In a BLAST search of public sequence databases, the NOV1 19a protein was found 
to have homology to the proteins shown in the BLASTP data in Table 1 19D. 



Table 1 1<>I>. Public BLASTP Results for NOV! 19a 


| 

| Protein 
| Accession 
1 Number 


Protein/Organism/Length 


NO VI 19a 
Residues/ 

Match 
Residues 


Identities/ 
Similarities for 
the Matched 
Portion 


Expect 
Value 


Q9HW73 


HYPOTHETICAL PROTEIN 
PA4328 - Pseudomonas 
aeruginosa, 304 aa. 


1..297 
1..299 


156/299 (52%) 
200/299 (66%) 




Q9KS28 


HYPOTHETICAL PROTEIN 
VC1433 - Vibrio cholerae, 315 aa. 


5..300 
6..304 


78/302 (25%) 
147/302 (47%) 


h— H 

4e-29 

-J 


CAC91106 


PUTATIVE STRESS PROTEIN - 
Yersinia pestis, 3 1 8 aa. 


2..300 
3..303 


93/310 (30%) 
137/310(44%) 


2e-28 


AAL20579 


PI IATI\ E I NIVERSAI 
STRESS PROTEIN - Salmonella 
typhimurium LT2, 3 1 5 aa. 


4..297 
5..300 


91/305 (29%) 
139/305 (44%) 


2e-28 


CAD01669 


CONSERVED HYPOTHETICAL 
PROTEIN - Salmonella enterica 
subsp. enterica serovar Typhi, 315 
aa. 


4..297 
5..300 


91/305 (29%) 
139/305 (44%) 


3e-28 



PFam analysis predicts that the NOV 1 19a protein contains the domains shown in 
the Table 11 9E. 
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Table 119E. Domain Analysis of NO VI 19a 


Pfam Domain 


NOV119a Match Region 


Identities/ 
Similarities 
for the Matched Region 


Expect Value 


Usp: domain 1 of 2 


2..144 


28/153 (18%) 
92/153 (60%) 


0.0014 


Usp: domain 2 of 2 


160..297 


28/153 (18%) 
88/153 (58%) 


0.013 



Example 120. 

The NOV 120 clone was analyzed, and the nucleotide and predicted polypeptide 
sequences are shown in Table 120A. 



Table 120A. NOV120 Sequence Analysis 




SEQ ID NO: 335 |2202 bp 


NOV 120a, 

CG59947-01 DNA Sequence 


CACCCTCCCGCCCCGCCCCCCGTCCAATGCTGAGCTCAGTCTGCGTCTCGTCCTTCCG 
CGGGCGCCAGGGGGCCAGCAAGCAGCAGCCGGCGCCACCGCCGCAGCCGCCCGAGGTC 
CCCGGTGGCGACAGCGGCAAGATCGTGATCAACGTGGGCGGCGTGCGCCATGAGACGT 
ACCGCTCGACGCTGCGCACCCTGCCGGGGACGCGGCTGGCCGGCCTGACGGAGCCCGA 

CCGGGAGTCTTCGCGTACGTGCTCAACTACTACCGCACCGGCAAGCTGCACTGCCCAG 
CCGACGTGTGCGGGCCCCTGTTTGAGGAGGAGCTCGGCTTCTGGGGCATCGACGAGAC 
CGACGTGGAGGCCTGCTGCTGGATGACCTACCGGCAGCATCGCGACGCTGAGGAGGCG 
CTCGACTCCTTCGAGGCGCCCGACCCCGCGGGCGCCGCCAACGCCGCCAACGCCGCAG 
GCGCCCACGACGGAGGCCTGGACGACGAGGCGGGCGCGGGCGGCGGCGGCCTGGACGG 
AGCGGGCGGCGAGCTCAAGCGCCTCTGCTTCCAGGACGCGGGCGGCGGCGCCGGGGGG 
CCGCCAGGGGGCGCGGGCGGCGCGGGCGGCACATGGTGGCGCCGCTGGCAGCCCCGCG 
TGTGGGCGCTCTTCGAGGACCCCTACTCGTCGCGGGCTGCCAGGTATGTGGCCTTCGC 
CTCCCTCTTCTTCATCCTCATCTCCATCACCACCTTCTGCCTGGAAACCCATGAGGGC 
TTCATCCATATTAGCAACAAGACGGTGACCCAGGCCTCCCCGATCCCCGGGGCACCTC 
CGGAGAACATCACCAACGTGGAGGTGGAGACGGAGCCCTTCCTGACCTACGTGGAGGG 
GGTGTGCGTGGTCTGGTTCACCTTCGAGTTCCTCATGCGCATCACCTTCTGCCCAGAC 
AAGGTGGAGTTTCTTAAAAGCAGCCTCAACATCATCGACTGTGTGGCCATCCTGCCCT 
TCTATCTCGAGGTGGGCCTCTCGGGCCTCAGCTCCAAGGCCGCCAAAGACGTGCTGGG 
CTTCCTGCGGGTGGTCCGCTTCGTCCGCATCCTGCGCATCTTCAAGCTGACCCGGCAC 
TTCGTGGGGCTGCGCGTGCTGGGACACACGCTCCGCGCCAGCACCAACGAGTTCCTGC 
TGCTCATCATCTTCCTGGCCCTGGGGGTGCTCATCTTCGCCACCATGATTTACTACGC 
TGAGCGCATTGGCGCCGACCCCGATGACATCCTGGGCTCCAACCACACCTACTTCAAG 
AACATCCCCATTGGCTTCTGGTGGGCTGTGGTCACCATGACGACCCTGGGCTATGGAG 
ACATGTACCCCAAGACGTGGTCGGGGATGCTGGTCGGGGCGCTGTGTGCCCTGGCGGG 
GGTGCTGACCATCGCCATGCCTGTGCCCGTCATTGTCAACAACTTTGGCATGTACTAT 
TCGCTGGCCATGGCCAAGCAGAAGCTGCCCAAGAAGAAGAACAAACACATCCCCCGGC 
CCCCGCAACCGGGCTCGCCCAACTACTGCAAGCCTGACCCACCCCCGCCACCCCCGCC 
CCACCCGCACCACGGCAGCGGGGGCATCAGCCCGCCGCCACCCATCACCCCACCCTCC 
ATGGGGGTGACTGTGGCCGGGGCCTACCCAGCGGGGCCCCACACGCACCCCGGGCTGC 
TCAGGGGGGGAGCGGGTGGGCTGGGGATCATGGGGCTGCCTCCTCTGCCAGCCCCCGG 
CGAGCCTTGCCCGTTGGCTCAGGAGGAGGTGATTGAGATCAACCGGGCAGATCCTCGC 
CCCAATGGGGATCCGGCAGCAGCTGCGCTTGCCCACGAGGACTGCCCAGCCATTGACC 
AGCCTGCCATGTCCCCGGAAGACAAGAGCCCCATCACGCCTGGAAGCCGTGGCCGCTA 
TAGCCGGGACCGAGCCTGCTTCCTCCTCACCGACTATGCCCCTTCCCCTGATGGCTCC 
ATCCGAAAAGCCACTGGTGCTCCCCCACTGCCCCCCCAAGACTGGCGTAAGCCAGGCC 
CCCCAAGCTTCTTGCCCGACCTCAACGCCAACGCCGCGGCCTGGATATCCCCCTAGTG 
GACGAACCCCCT"" Z CCi i 5' fCTTGTCACCGCCTGAGACCT 3 Ai A :i 




ORF Start: ATG at 27jORF Stop: TAG at 2142 




SEQ ID NO: 336 |705 aa |MW at 75590.5kD 


NOV 120a, 


MLSSVCVSSFRGRQGASKQQPAPPPQPPEVPGGDSGKIVINVGGVRHETYRSTLRTLP 
GTRLAGLTEPEAAARFDYDPGADEFFFDRHPGVFAYVLNYYRTGKLHCPADVCGPLFE 
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CG59947-01 Protein Sequence 



EELGFWGIDETDVEACCWMTYRQHRDAEEALDSFEAPDPAGAANAANAAGAHDGGLDD 
EAGAGGGGIJ3GAGGELKRLCFQDAGGGAGGPPGGAGGAGGTWWRRWQPRVWALFEDPY 
SSRAARYVAFASLFFILISITTFCLETHEGFIHISNKTVTQASPIPGAPPENITNVEV 
ETEPFLTYVEGVCWWFTFEFLMRITFCPDKVEFLKSSLNIIDCVAILPFYLEVGLSG 
LSSKAAKDVLGFLRWRFVRILRIFKLTRHFVGLRVLGHTLRASTNEFLLLIIFLALG 
VLIFATMIYYAERIGADPDDILGSNHTYFKNIPIGFWWAWTMTTLGYGDMYPKTWSG 
MLVGALCALAGVLTIAMPVPVIVNNFGMYYSLAMAKQKLPKKKNKHIPRPPQPGSPNY 
CKPDPPPPPPPHPHHGSGGISPPPPITPPSMGVTVAGAYPAGPHTHPGLLRGGAGGLG 
IMGLPPLPAPGEPCPLAQEEVIEINRADPRPNGDPAAAALAHEDCPAIDQPAMSPEDK 
SPITPGSRGRYSRDRACFLLTDYAPSPDGSIRKATGAPPLPPQDWRKPGPPSFLPDLN 
ANAAAWISP 



Further analysis of the NOV 120a protein yielded the following properties shown in 
Table 120B. 



Table 120B. Protein Sequence Properties NOV120a I 


PSort 
analysis: 


0.6000 probability located in plasma membrane; 0.5071 probability located in 
mitochondrial inner membrane; 0.4000 probability located in Golgi body; 
0.3000 probability located in endoplasmic reticulum (membrane) 


SignalP 
analysis: 


No Known Signal Sequence Predicted 



A search of the NOV 120a protein against the Geneseq database, a proprietary 
database that contains sequences published in patents and patent publication, yielded 
several homologous proteins shown in Table 120C. 



Table 120C. Geneseq Results for NOV120a 





Geneseq 
Identifier 


Protein/Organism/Length 
[Patent #, Date] 


NOV120a 
Residues/ 

Match 
Residues 


Identities/ 
Similarities for 
the Matched 
Region 


Expect 
Value 


AAY34120 


Human potassium channel 
K+Hnov4 - Homo sapiens, 601 aa. 
[W09943696-A1, 02-SEP-1999] 


32..526 
4.. 476 


371/510(72%) 
399/510(77%) 


0.0 


AAY32016 


Caenorhabditis elegans cation 
channel protein - Caenorhabditis 
elegans, 556 aa. [W09947923-A2, 
23-SEP-1999] 


33..512 
27..465 


217/486 (44%) 
300/486 (61%) 


e-113 


AAB86319 


Human Kv4.2 protein - Homo 
sapiens, 629 aa. [DE19963612-A1, 
12-JUL-2001] 


16..521 
22..441 


173/511 (33%) 
256/511 (49%) 


5e-69 


AAY13523 


Amino acid sequence of KV4.2FL 
ion channel protein - Mammalia, 
630 aa. [WO9923880-A1, 20- 
MAY-1999] 


16..521 
23. .442 


173/511 (33%) 
257/511 (49%) 


8e-68 
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AAW42996 


Putative mature potassium channel 
2 protein - Homo sapiens, 494 aa. 
[US5710019-A, 20-JAN-1998] 


17..510 1 171/503 (33%) 
4..42S | 240/503 (46%) 


2e-66 


In a BLAST search of public sequence databases, the NOV 120a protein was found 
to have homology to the proteins shown in the BLASTP data in Table 120D. 


Table 120D. Public BLASTP Results for NOV120a 


Accession 
Number 


Protein/Organism/Length 


NOV120a 
Residues/ 

Match 
Residues 


Identities/ 
Similarities for 
the Matched 
Portion 


Expect 
Value 


Q 14003 


Voltage-gated potassium channel 
protein Kv3.3 (KSHIIID) - Homo 
sapiens (Human), 757 aa. 


1..705 
1..757 


704/757 (92%) 
704/757 (92%) 


0.0 


Q01956 


Voltage-gated potassium channel 
protein Kv3.3 (KSHIIID) - Rattus 
norvegicus (Rat), 889 aa. 


1..693 
1..756 


663/757 (87%) 
668/757 (87%) 


0.0 


Q63959 


Voltage-gated potassium channel 
protein Kv3.3 (KSHIIID) - Mus 
musculus (Mouse), 769 aa. 


1..671 
1..724 


650 725 (89%) 
653/725 (89%) 


0.0 


A42073 


potassium channel protein Kv3.3 - 
mouse, 679 aa. 


32..607 
8..581 


557/576 (96%) 
559/576 (96%) 


0.0 


Q9PVD1 


KV3.1 POTASSIUM CHANNEL 
- Xenopus laevis (African clawed 
frog), 592 aa. 


34..671 
6..547 


441/640 (68%) 
479/640 (73%) 


0.0 



PFam analysis predicts that the NOV 120a protein contains the domains shown in 
the Table 120E. 



Table 120E. Domain Analysis of NOV120a 



Pfam Domain 


NOV120a Match 
Region 


Identities/ 
Similarities 
for the Matched 
Region 


Expect 
Value 


K tetra: domain 1 of 1 


36..137 


50/112(45%) 
86/112(77%) 


1.6e-47 


thaumatin: domain 1 of 
1 


314..319 


4/6 (67%) 
6/6(100%) 


0.7 
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ion trans: domain 1 of 


295..486 


51/231 (22%) 


1 




155/231 (67%) j 



Example 121. 

The NOV121 clone was analyzed, and the nucleotide and predicted polypeptide 
sequences are shown in Table 121A. 



Table 121A. NOV121 Sequence Analysis 




SEQIDNO:337 [ 1943 bp 


NOV121a, 

[CG5 993 8-01 DNA Sequence 


AGATCCACGTGATCTCCAAAGACCCCTGTTGTGTTGTGTTGGGAGGTGGATCCTGAAT 


^AAATCCCTGCTTGCTTTCCAGGAGACCCTTGG 


TCTTCATGTCTTTGGTGTGTGCACTCTTGAACACATGCCAGGCACACAGGGTGCATGA 
CGACAAGCCTAATATTGTCCTAATCATGGTTGATGACCTGGGTATTGGAGATCTGGGC 
TGCTACGGCAATGACACCATGAGGACGCCTCACATCGACCGCCTTGCCAGGGAAGGCG 

CTTGACGGGAAGATACCCCATCCGATCAGGTATGGTTTCTAGTGGTAATAGACGTGTC 
ATCCAAAATCTTGCAGTCCCCGCAGGCCTCCCTCTTAATGAGACAACACTTGCAGCCT 
TGCTAAAGAAGCAAGGATACAGCACGGGGCTTATAGGTAAGTTAGGCAAATGGCACCT 
GGGTTTGAGCTGCGCCTCTCGGAATGATCACTGTTACCACCCGCTCAACCATGGTTTT 
CACTACTTTTACGGGGTGCCTTTTGGACTTTTAAGCGACTGCCAGGCATCCAAGACAC 
CAGAACTGCACCGCTGGCTCAGGATCAAACTGTGGATCTCCACGGTAGCCCTTGCCCT 
GGTTCCTTTTCTGCTTCTCATTCCCAAGTTCGCCCGCTGGTTCTCAGTGCCATGGAAG 
GTCATCTTTGTCTTTGCTCTCCTCGCCTTTCTGTTTTTCACTTCCTGGTACTCTAGTT 
ATGGATTTACTCGACGTTGGAATTGCATCCTTATGAGGAACCATGAAATTATCCAGCA 
GCCAATGAAAGAGGAGAAAGTAGCTTCCCTCATGCTGAAGGAGGCACTTGCTTTCATT 
GAAAGGTACAAAAGGGAACCTTTTCTCCTCTTTTTTTCCTTCCTGCACGTACATACTC 
CACTCATCTCCAAAAAGAAGTTTGTTGGGCGCAGTAAATATGGCAGGTATGGGGACAA 
TGTAGAAGAAATGGATTGGATGGTGGGTGGTAAAATCCTGGATGCCCTGGACCAGGAG 
CGCCTGGCCAACCACACCTTGGTGTACTTCACCTCTGACAACGGGGGCCACCTGGAGC 
CCCTGGACGGGGCTGTTCAGCTGGGTGGCTGGAACGGGATCTACAAAGGTGGCAAAGG 

TTGGAGGCTGGGAGAGTGATCAATGAGCCCACCAGCTTAATGGACATCTATCCGACGC 
TGTCTTATATAGGCGGAGGGATCTTGTCCCAGGACAGAGTGATTGACGGCCAGAACCT 
AATGCCCCTGCTGGAAGGAAGGGCGTCCCACTCCGACCACGAGTTCCTCTTCCACTAC 
TGTGGGGTCTATCTGCACACGGTCAGGTGGCATCAGAAGGACACTGTGTGGAAAGCTC 
ATTATGTGACTCCTAAATTCTACCCTGAAGGAACAGGTGCCTGCTATGGGAGTGGAAT 
ATGTTCATGTTCGGGGGATGTAACCTACCACGACCCACCACTCCTCTTTGACATCTCA 
AGAGACCCTTCAGAAGCCCTTCCACTGAACCCTGACAATGAGCCATTATTTGACTCCG 
TGATCAAAAAGATGGAGGCAGCCATAAGAGAGCATCGTAGGACACTAACACCTGTCCC 
ACAGCAGTTCTCTGTGTTCAACACAATTTGGAAACCATGGCTGCAGCCTTGCTGTGGG 
ACCTTCCCCTTCTGTGGGTGTGACAAGGAAGATGACATCCTTCCCATGGCTCCCTGAG 
ACCATGCGGACCACGTGTTACCCACCACAAACTTACTGTTACAATGGTCATAGGAGCA 






( >RF Start: A K i at 1 22 ( )RF Stop: TGA at 1853 




SEQ ID NO: 338 1 577 aa f MW at 65099.5kD 


NOV121a, 

CG59938-01 Protein Sequence 


MSLVCALLNTCQAHRVHDDKPNIVLIMVDDLGIGDLGCYGNDTMRTPHIDRLAREGVR 
LTQHI SAASLCSPSRSAFLTGRYPI RSGMVSSGNRRVI QNLAVPAGLPLNETTLAALL 
KKQGYSTGLIGKLGKWHLGLSCASRNDHCYHPLNHGFHYFYGVPFGLLSDCQASKTPE 
LHRWLRIKLWISTVALALVPFLLLIPKFARWFSVPWKVIFVFALLAFLFFTSWYSSYG 
FTRRWNCILMRNHEIIQQPMKEEKVASLMLKEALAFIERYKREPFLLFFSFLHVHTPL 
ISKKKFVGRSKYGRYGDNVEEMDWTWGGKILDALDQERLANHTLVYFTSDNGGHLEPL 
DGAVQLGGWNGIYKGGKGMGGWEGGIRVPGIFRWPSVLEAGRVINEPTSLMDIYPTLS 
YIGGGILSQDRVIDGQNLMPLLEGRASHSDHEFLFHYCGVYLHTVRWHQKDTVWKAHY 
VTPKFYPEGTGACYGSGICSCSGDVTYHDPPLLFDISRDPSEALPLNPDNEPLFDSVI 
KKMEAAIREHRRTLTPVPQQFSVFNTIWKPWLQPCCGTFPFCGCDKEDDILPMAP 



Further analysis of the NOV121a protein yielded the following properties shown in 
Table 12 IB. 
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Table 121B. Protein Sequence Properties NOV121a 


PSort 
analysis: 


0.6400 probability located in plasma membrane; 0.4600 probability located in 
Golgi body; 0.3700 probability located in endoplasmic reticulum (membrane); 
0.1000 probability located in endoplasmic reticulum (lumen) 


SignalP 
analysis: 


No Known Signal Sequence Predicted 



A search of the NOV121a protein against the Geneseq database, a proprietary 
database that contains sequences published in patents and patent publication, yielded 
several homologous proteins shown in Table 121C. 



Table 121C. Geneseq Results for NOV121a 


Geneseq j Protein/Organism/Length 
Identifier [Patent #, Date] 


NOV121a 
Residues/ 

Match 
Residues 


Identities/ 
Similarities for 
the Matched 
Region 


Expect 
Value 


AAM78688 


Human protein SEQ ID NO 1350 - 
Homo sapiens, 590 aa. 
[WO200157190-A2, 09-AUG- 
2001] 


1..572 
10..580 


388/576 (67%) 
449/576 (77%) 


0.0 


AAM39343 


Human polypeptide SEQ ID NO 
2488 - Homo sapiens, 589 aa. 
[WO200153312-A1, 26-JUL- 
2001] 


20 .571 
37..587 


331/555 (59%) 1 0.0 
404/555 (72%) 

j 


AAM41129 


Human polypeptide SEQ ID NO 
6060 - Homo sapiens, 646 aa. 
[WO200153312-A1, 26-JUL- 
2001] 


20..571 
94..644 


331/555 (59%) 
404/555 (72%) 


0.0 


AAY39920 


Human steroid sulphatase protein 
sequence - Homo sapiens, 583 aa. 
[WO9950453-A1, 07-OCT-1999] 


20..569 
26..575 


295/559 (52%) 
374/559 (66%) 


e-166 


AAB51185 


Human sulfatase protein C SEQ ID 
NO: 14 - Homo sapiens, 583 aa. 
[US6153188-A, 28-NOV-2000] 


20..569 
26..575 


294/559 (52%) 
372/559 (65%) 


e-165 



In a BLAST search of public sequence databases, the NOV121a protein was found 
to have homology to the proteins shown in the BLASTP data in Table 121D. 
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Table 121D. Public BLASTP Results for NOV121a 


Protein 
Accession 
Number 


Protein/Organism/Length 


NOV121a 
Residues/ 

Match 
Residues 


Identities/ 
Similarities for 
the Matched 
Portion 


Expect 
Value 


P54793 


Arylsulfatase F precursor (EC 
3.1.6.-) (ASF) - Homo sapiens 
(Human), 591 aa. 


1..572 
10..581 


379/577 (65%) 
441/577 (75%) 


0.0 


AAH20229 


HYPOTHETICAL 64.9 KDA 
PROTEIN - Homo sapiens 
(Human), 593 aa. 


4..574 
24..593 


358/574 (62%) 
440/574 (76%) 


0.0 | 


P51689 


Arylsulfatase D precursor (EC 
3.1.6.-) (ASD) - Homo sapiens 
(Human), 593 aa. 


4..574 
24.. 593 


349/574 (60%) 
429/574 (73%) 


0.0 i 

1 


P51690 


Arylsulfatase E precursor (EC 
3.1.6.-) (ASE) - Homo sapiens 
(Human), 589 aa. 


20..571 
37..587 


334/555 (60%) 
405/555 (72%) 


0.0 


P08842 


Steryl-sulfatase precursor (EC 
3.1.6.2) (Steroid sulfatase) (Steryl- 
sulfate sulfohydrolase) 
(Arylsulfatase C) (ASC) - Homo 
sapiens (Human), 583 aa. 


20..569 
26..575 


295/559 (52%) 
374/559 (66%) 


e-166 



PFam analysis predicts that the NOV121a protein contains the domains shown in 
the Table 121E. 



Table 121 E. Domain Analysis of NOV1 21a 


Pfam Domain 


NOV121a Match 
Region 


Identities/ 
Similarities 
for the Matched 
Region 


Expect 
Value 


Sulfatase: domain 1 of 
1 


21. .504 


231/530(44%) 
410/530 (77%) 


le-187 



Example 122. 



The NOV 122 clone was analyzed, and the nucleotide and predicted polypeptide 
sequences are shown in Table 122A. 
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Table 122A. NOV122 Sequence Analysis 




SEQIDNO:339 (3005 bp 


NOV122a, 

CG5 9746-01 DNA Sequence 


ATTTCTTTGGTGTTGTCTTCACAGCTGAACTTGCAAAACAGATTGGAACTTCAAGATT 


ATCAATAATCGGAGATACGTATATTTTATTTGTAAAGAAAACATGGCTGCCCTATTCC 


TACGTGGTTTTGTCCAAATAGGGAACTGCAAGACTGGGATATCTAAGTCAAAAGAAGC 
ATTCATTGAAGCAGTGGAAAGAAAGAAGAAAGATAGACTGGTGCTGTATTTCAAAAGT 
GGAAAATATAGCACTTTTCGGCTAAGTGATAATATTCAAAATGTAGTCCTTAAATCCT 
ATAGAGGAAACCAAAATCACCTGCATTTAACTTTACAAAATAATAATGGCTTGTTTAT 
TGAAGGATTATCCTCCACAGATGCTGAACAATTGAAGATATTCTTGGACAGAGTTCAT 

CAACACAGAAGGAAATCAACAAAACTTCATTCCACAAAGTTGATGAGAAATCAAGTAG 
CAAATCTTTTGAGATAGCAAAAGGAAGTGGGACAGGTGTCCTTCAGAGGATGCCTTTG 
CTTACATCAAAATTGACACTTACTTGCGGAGAGTTATCAGAAAATCAGCACAAGAAGA 
GGAAAAGAATGCTCTCATCTAGCTCAGAGATGAATGAGGAATTCTTGAAAGAAAATAA 
TTCTGTAGAATACAAGAAATCCAAGGCAGATTGTTCGAGGTGTGTAAGCTATAATCGA 
GAGAAACAATTGAAGTTAAAAGAGTTAGAAGAGAATAAGAAATTGGAATGTGAATCTT 
CATGCATCATGAACGCCACTGGAAATCCTTACCTAGATGACATTGGTCTTCTCCAAGC 
TCTCACTGAGAAAATGGTTTTGGTATTTCTGTTACAACAAGGGTATAGTGACGGTTAC 
ACAAAGTGGGATAAATTAAAACTATTTTTTGAATTATTTCCAGAGAAAATATGCCACG 
GCCTCCCCAATTTGGGAAACACCTGTTATATGAATGCAGTGTTACAGTCTCTACTTTC 
AATCCCATCGTTTGCTGATGATTTACTTAATCAGAGTTTCCCATGGGGTAAAATTCCC 
CTTAATGCTCTTACCATGTGCTTGGCACGGCTACTTTTTTTTAAAGATACCTATAATA 
TAGAAATCAAGGAGATGTTACTCTTGAATCTTAAAAAGGCCATTTCAGCAGCTGCAGA 
GATATTCCATGGCAATGCACAGAACGATGCTCATGAGTTTTTAGCTCACTGTTTAGAT 
CAACTGAAAGATAACATGGAAAAACTCAACACAATTTGGAAGCCTAAAAGTGAATTTG 
GGGAAGATAATTTTCCTAAACAGGTTTTTGCTGATGATCCTGACACCAGTGGGTTTTC 
TTGCCCTGTCATTACTAATTTTGAGTTAGAGTTGTTGCACTCCATTGCTTGTAAAGCT 
TGTGGTCAGGTTATTCTCAAGACAGAACTGAATAATTACCTCTCCATCAACCTTCCCC 
AAAGAATAAAAGCACATCCTTCATCTATTCAGTCTACTTTTGATCTTTTTTTTGGAGC 
AGAAGAGCTTGAGTATAAATGTGCAAAATGTGAGCACAAGACTTCCGTTGGAGTGCAC 
TCATTCAGTAGGCTACCTAGAATCCTTATTGTTCACCTCAAACGCTATAGCTTGAATG 
AGTTTTGTGCATTAAAGAAGAATGACCAGGAAGTCATCATTTCCAAATATTTAAAGGT 
GTCTTCTCATTGCAATGAAGGCACCAGACCACCTCTTCCCTTGAGTGAGGATGGAGAA 
ATTACAGATTTCCAATTATTAAAAGTTATTCGAAAGATGACTTCTGGAAACATCAGTG 
TATCATGGCCTGCAACAAAGGAATCCAAAGATATCCTGGCTCCACACATTGGATCAGA 
TAAGGAGTCTGAACAAAAAAAAGGCCAGACAGTCTTTAAAGGGGCAAGCAGAAGACAG 
CAGCAAAAGTACCTTGGAAAAAATTCTAAACCAAATGAGCTAGAATCTGTATACTCAG 
GAGATCGAGCATTCATTGAAAAAGAACCGTTAGCTCACTTAATGACGTATCTGGAAGA 
TACCTCACTTTGTCAGTTCCACAAAGCTGGAGGTAAACCTGCCAGCAGCCCAGGCACA 
CCTCTCTCAAAAGTTGACTTTCAAACAGTGCCCGAAAATCCAAAACGAAAGAAATATG 
TGAAAACCAGTAAGTTTGTAGCTTTTGATAGGATTATCAATCCTACTAAAGATTTGTA 
TGAAGATAAAAATATCAGAATTCCAGAAAGATTCCAAAAAGTGTCTGAACAGACTCAG 
CAGTGTGACGGTATGAGAATCTGTGAACAAGCCCCTCAGCAGGCACTGCCTCAAAGCT 
TTCCAAAGCCAGGCACCCAGGGGCACACAAAGAACCTCCTAAGACCTACAAAATTAAA 
TCTACAGAAGTCTAACAGGAATTCCCTACTTGCACTGGGTTCCAATAAGAATCCAAGA 
AACAAAGACATTTTAGATAAGATAAAATCTAAAGCCAAGGAAACAAAAAGAAATGATG 
ATAAGGGAGATCATACCTACCGGCTCATTAGTGTTGTCAGCCATCTTGGGAAGACTCT 
AAAGTCAGGCCATTATATCTGTGATGCCTATGACTTTGAGAAACAGATCTGGTTCACT 
TACGATGATATGCGGGTGTTAGGTATCCAGGAGGCCCAGATGCAGGAGGATAGGCGTT 
GCACTGGGTACATCTTCTTTTACATGCATAATGAGATCTTTGAAGAGATGTTGAAAAG 
AGAAGAGAATGCCCAGCTTAATAGCAAGGAGGTAGAGGAGACCCTTCAGAAGGAATAA 
GAGGAACGTACTCCTCCTTGTACAGATCTGCCTGACTGTCTCACTCGATACCACTTCC 


TCCATGGAAGGAAAACTGTGAACTTTATCCAGAGATGAAAATGCAATTAGTCTAGGAC 
i e 31 :m ■ i * - AA ' - i - a 1 1 4 fGi 1 : ~ AT 




ORF Start: ATG at 
101 


ORF Stop: TAA at 2840 




SEQ ID NO: 340 


913 aa |MW at 104046.0kD 


NOV122a, 
CG59746-01 Protein 
Sequence 


MAALFLRGFVQIGNCKTG ISKSKSAFI EAVERKKKDRLVLYFKSGKYSTFRLSDNI QN 
WLKSYRGNQNHLHLTLQNNNGLFIEGLSSTDAEQLKIFLDRVHQNEVQPPVRPGKGG 
SVFSSTTQKEINKTSFHKVDEKSSSKSFEIAKGSGTGVLQRMPLLTSKLTLTCGELSE 
NQHKKRKRMLSSSSEMNEEFLKENNSVEYKKSKADCSRCVSYNREKQLKLKELEENKK 
LECESSCIMNATGNPYLDDIGLLQALTEKMVLVFLLQQGYSDGYTKWDKLKLFFELFP 
EKICHGLPNLGNTCYMNAVLQSLLSIPSFADDLLNQSFPWGKIPLNALTMCLARLLFF 
KDTYNI E I KEMLLLNLKKAI SAAAE I FHGNAQNDAHEFLAHCLDQLKDHMEKLNT I WK 
PKSEFGEDNFPKQVFADDPDTSGFSCPVITNFELELLHSIACKACGQVILKTELNNYL 
S INLPQRIKAHPSSI QSTFDLFFGAEELEYKCAKCEHKTSVGVHSFSRLPRI LIVKLK 
RYSLNEFCALKKNDQEVIISKYLKVSSHCNEGTRPPLPLSEDGEITDFQLLKVIRKMT 
ill )I F 1 l 
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ESVYSGDRAFIEKEPLAHLMTYLEDTSLCQFHKAGGKPASSPGTPLSKVDFQTVPENP 
KRKKWKTSKFVAFDRIINPTKDLYEDKNIRIPERFQKVSEQTQQCDGMRICEQAPQQ 
ALPQSFPKPGTQGHTKNLLRPTKLNLQKSNRNSLLALGSNKNPRNKDILDKIKSKAKE 
TKRNDDKGDHTYRLISWSHLGKTLKSGHYICDAYDFEKQIWFTYDDMRVLGIQEAQM 
1 t EE 1LI EEN 1 EVEET] 



Further analysis of the NOV 122a protein yielded the following properties shown in 
Table 122B. 



Table 122B. Protein Sequence Properties NOV122a 

PSort 0.7000 probability located in nucleus; 0.4270 probability located in 
analysis: mitochondrial matrix space; 0.3000 probability located in microbody 

(peroxisome); 0.1047 probability located in mitochondrial inner membrane 

SignalP Likely cleavage site between residues 1 6 and 1 7 
analysis: 



A search of the NOV122a protein against the Geneseq database, a proprietary 
database that contains sequences published in patents and patent publication, yielded 
several homologous proteins shown in Table 122C. 



Table 122C. Geneseq Results for NOV122a 


Geneseq 
Identifier 


Protein/Organism/Length 
[Patent #, Date] 


NOV122a 
Residues/ 

Match 
Residues 


Identities/ 
Similarities for 
the Matched 
Region 


Expect 
Value 


AAU07888 


Polypeptide sequence for human 
hspG25 - Homo sapiens, 913 aa. 
[WO200166752-A2, 13-SEP-2001] 


1..913 
1..913 


913/913 (100%) 
913/913 (100%) 


0.0 


AAB75607 


Human cancer associated antigen 
precursor HOM-TES-84/6 SEQ ID 
NO: 6 - Homo sapiens, 912 aa. 
[WO200100874-A2, 04-JAN-2001] 


1..905 
1..904 


429/920 (46%) 
566/920 (60%) 


0.0 


AAU07869 


Polypeptide sequence for 
mammalian Spg25 - Mammalia, 
835 aa. [WO200166752-A2, 13- 
SEP-2001] 


1..904 
1..834 


335/921 (36%) 
504/921 (54%) 


e-147 


AAG75460 


Human colon cancer antigen 
protein SEQ ID NO: 6224 - Homo 
sapiens, 109 aa. [WO200 122920- 
A2, 05-APR-2001] 


810..912 
3..107 


61/105 (58%) 
79/105 (75%) 


3e-28 


AAB39364 


Gene 8 human secreted protein 
homologous amino acid sequence 
#113 - Bos taurus, 64 aa. 
[WO200057903-A2, 05-OCT- 
2000] 


810..871 
1..64 


39/64 (60%) 
48/64 (74%) 


5e-15 
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In a BLAST search of public sequence databases, the NOV 122a protein was found 
to have homology to the proteins shown in the BLASTP data in Table 122D. 



Table 122D. Public BLASTP Results for NOV122a 


Protein 
Accession 
Number 


Protein/Organism/Length 


NOV122a 
Residues/ 

Match 
Residues 


Identities/ 
Similarities for 
the Matched 
Portion 


Expect 
Value 


Q9BXU7 


Ubiquitin carboxyl-terminal 
hydrolase 26 (EC 3.1.2.15) 
(Ubiquitin thiolesterase 26) 
(Ubiquitin-specific processing 
protease 26) (Deubiquitinating 
enzyme 26) - Homo sapiens 
(Human), 913 aa. 


1..913 
1..913 


913/913(100%) 
913/913 (100%) 


0.0 


Q9HBJ7 1 UBIQUITIN-SPECIFIC 

j PROCESSING PROTEASE - Homo 
sapiens (Human), 922 aa. 


1..905 
1..904 


429/920 (46%) 
566/920 (60%) 


0.0 


Q9HCH8 


KIAA1594 PROTEIN - Homo 
sapiens (Human), 93 1 aa (fragment). 


50..912 
3..929 


393/932(42%) 
535/932(57%) 




e-171 


Q99MX1 


Ubiquitin carboxyl-terminal 
hydrolase 26 (EC 3.1.2.15) 
(Ubiquitin thiolesterase 26) 
(Ubiquitin-specific processing 
protease 26) (Deubiquitinating 
enzyme 26) - Mus musculus 
(Mouse), 835 aa. 


1..904 
1..834 


335/921 (36%) 
504/921 (54%) 


e-147 


Q9ES63 


UBIQUITIN-SPECIFIC 
PROCESSING PROTEASE - Mus 
musculus (Mouse), 869 aa. 


1..908 
1..848 


341/933 (36%) 
480/933 (50%) 


e-131 , 



PFam analysis predicts that the NOV 122a protein contains the domains shown in 
the Table 122E. 



Table 122E. Domain Analysis of NOV122a 


Pfam Domain 


NOV122a Match 
Region 


Identities/ 
Similarities 
for the Matched 
Region 


Expect 
Value 


UCH-1: domain 1 of 
1 


295.326 


21/32 (66%) 
29/32 (91%) 


8.8e-12 


UCH-2: domain 1 of 
1 


820..885 


20/72 (28%) 
47/72 (65%) 


2.2e-ll 
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Example 123. 

The NOV 123 clone was analyzed, and the nucleotide and predicted polypeptide 
sequences are shown in Table 123 A. 



Table 123A. NOV 123 Sequence Analysis 




SEQIDNO:341 |2146bp 


NOV123a, 

CG88613-01 DNA Sequence 


GAAGGAGCGGGCATGAGGCGCTGCCCGTGCCGTGGGAGCCTGAACGAGGCGGAGGCCG 

GCAGCCGGGACAGCAGCGACCTGGGCCCGGCGCAGGGGCCCCGGCGGGGCGGCCGGAG 
GGGGGCGGGCCCTGGGCCCGGACAGAGGGGTCCAGCCTCCACAGCGAGCCTGAGAGGG 
CCGGCCTCGGGCCTGCGCCGGGGACAGAGAGTCCGCAGGCAGAATTCTGGACAGACGG 
ACAGACTGAGCCCGCGGCAGCTGGCCTTGGAGTAGAGACCGAGAGGCCCAAGCAAAAG 
ACGGAGCCAGACAGGTCCAGCCTCCGGACGCATCTAGAATGGAGCTGGTCAGAGCTGG 
AGACGACTTGTCTTTGGACGGAGACCGGGACAGATGGCCTTTGGACTGATCCGCACAG 
GTCCGACCTCCAGTTTCAGCCCGAGGAGGCCAGCCCCTGGACACAGCCAGGGGTTCAT 
GGGCCCTGGACAGAGCTGGAAACGCATGGGTCACAGACTCAGCCAGAGAGGGTCAAGT 
CCTGGGCTGATAACCTCTGGACCCACCAGAACAGTTCCAGCCTCCAGACTCACCCAGA 
AGGAGCCTGTCCCTCAAAAGAGCCAAGTGCTGATGGCTCCTGGAAAGAATTGTATACT 
GATGGCTCCAGGACACAACAGGATATTGAAGGTCCCTGGACAGAGCCATATACTGATG 
GCTCCCAGAAAAAACAGGATACTGAAGCAGCCAGGAAACAGCCTGGCACTGGTGGTTT 
CCAAATACAACAGGATACTGATGGCTCCTGGACACAACCTAGCACTGACGGTTCCCAG 
ACAGCACCTGGGACAGACTGCCTCTTGGGAGAGCCTGAGGATGGCCCATTAGAGGAAC 
CAGAGCCTGGAGAATTGCTGACTCACCTGTACTCTCACCTGAAGTGTAGCCCCCTGTG 
CCCTGTGCCCCGCCTCATCATTACCCCTGAGACCCCTGAGCCTGAGGCCCAGCCAGTG 




ACGAGTCTGAGGATGACGTGGTGGCCGGGGGCGGAGGTGCCAGCGATCCCGAGGACAG 
GTCTGGGAGCAAACCCTGGAAGAAGCTGAAGACAGTTCTGAAGTATTCACCCTTTGTG 
GTCTCCTTCCGAAAACACTACCCTTGGGTCCAGCTTTCTGGACATGCTGGGAACTTCC 
AGGCAGGAGAGGATGGTCGGATTCTGAAACGTTTCTGTCAGTGTGAGCAGCGCAGCCT 

CTGCAGGATGGCCAGACCTTCAACCAGATGGAAGACCTCCTGGCTGACTTTGAGGGCC 
CCTCCATTATGGACTGCAAGATGGGCAGCAGGACCTATCTGGAAGAGGAGCTAGTGAA 
GGCACGGGAACGTCCCCGTCCCCGGAAGGACATGTATGAGAAGATGGTGGCTGTGGAC 
CCTGGGGCCCCTACCCCTGAGGAGCATGCCCAGGGTGCAGTCACCAAGCCCCGCTACA 
TGCAGTGGAGGGAAACCATGAGCTCCACCTCTACCCTGGGCTTCCGGATCGAGGGCAT 

GTGACAAAAGTGCTGGAGGACTTCGTGGATGGAGACCACGTCATCCTGCAAAAGTACG 
TGGCATGCCTAGAAGAACTTCGTGAAGCTCTGGAGATCTCCCCCTTCTTCAAGACCCA 
CGAGGTGGTAGGCAGCTCCCTCCTCTTCGTGCACGACCACACCGGCCTGGCCAAGGTC 
TGGATGATAGACTTCGGCAAGACGGTGGCCTTGCCCGACCACCAGACGCTCAGCCACA 
GGCTGCCCTGGGCTGAGGGCAACCGTGAGGACGGCTACCTCTGGGGCCTGGACAACAT 
GATCTGCCTCCTGCAGGGGCTGGCACAGAGCTGAGCTGCTCAGCCACCATCAGGTTAA 
n 3aT = " T i VI ftGGCCTGAGGTT 




ORF Start: ATG at 13 joRF Stop: TGA at 2062 




SEQ ID NO: 342 |683 aa |MW at 75206.8kD 


NOV123a, 

CG88613-01 Protein Sequence 


MRRCPCRGSLNEAEAGALPAAARMGLEAPRGGRRRQPGQQRPGPGAGAPAGRPEGGGP 
WARTEGSSLHSEPERAGLGPAPGTESPQAEFWTDGQTEPAAAGLGVETERPKQKTEPD 
RSSLRTHLEWSWSEDETTCLWTETGTDGLWTDPHRSDLQFQPEEASPWTQPGVHGPWT 
ELETHGSQTQPERVKSWADNLWTHQNSSSLQTHPEGACPSKEPSADGSWKELYTDGSR 
TQQDIEGPWTEPYTDGSQKKQDTEAARKQPGTGGFQIQQDTDGSWTQPSTDGSQTAPG 
TDCLLGEPEDGPLEEPEPGELLTHLYSHLKCSPLCPVPRLIITPETPEPEAQPVGPPS 
RVEGGSGGFSSASSFDESEDDWAGGGGASDPEDRSGSKPWKKLKTVLKYSPFWSFR 
KHYPWVQLSGHAGNFQAGEDGRILKRFCQCEQRSLEQLMKDPLRPFVPAYYGMVLQDG 
QTFNQMEDLLADFEGPSIMDCKMGSRTYLEEELVKARERPRPRKDMYEKMVAVDPGAP 
TPEEHAQGAVTKPRYMQWRETMSSTSTLGFRIEGIKKADGTCNTNFKKTQALEQVTKV 
LEDFVDGDHVILQKYVACLEELREALEISPFFKTHEWGSSLLFVHDHTGLAKVWMID 
FGKTVALPDHQTLSERLPViAEGNREDGYLWGLDKMICLLCGLAQS 



Further analysis of the NOV123a protein yielded the following properties shown in 
Table 123B. 
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Table 123B. Protein Sequence Properties NOV123a 


PSort 
analysis: 


0.5663 probability located in microbody (peroxisome); 0.3000 probability 
located in nucleus; 0.1000 probability located in mitochondrial matrix space; 
0.1000 probability located in lysosome (lumen) 


SignalP 
analysis: 


No Known Signal Sequence Predicted 



A search of the NOV 123a protein against the Geneseq database, a proprietary 
database that contains sequences published in patents and patent publication, yielded 
several homologous proteins shown in Table 123C. 



Table 123C. Geneseq Results for NOV123a 


Geneseq 
Identifier 


Protein/Organism/Length 
[Patent #, Date] 


NOV123a 
Residues/ 

Match 
Residues 


Identities/ 
Similarities for 
the Matched 
Region 


Expect 
Value 


AAM41393 


Human polypeptide SEQ ID NO 
6324 - Homo sapiens, 687 aa. 
[WO200153312-A1, 26-JUL- 
2001] 


1..683 
5..687 


682/683 (99%) 
682/683 (99%) 


0.0 


AAM39607 


Human polypeptide SEQ ID NO 
2752 - Homo sapiens, 711 aa. 
[WO200153312-A1, 26-JUL- 
2001] 


12..683 
36..711 


642/680 (94%) 
643/680 (94%) 


0.0 


AAE04364 


Human kinase (PKIN)-5 - Homo 
sapiens, 798 aa. [WO2001 46397- 
A2, 28-JUN-2001] 


273..682 
380..793 


219/432 (50%) 
285/432 (65%) 


e-117 



In a BLAST search of public sequence databases, the NOV 123a protein was found 
to have homology to the proteins shown in the BLASTP data in Table 123D. 



Table 1231). Public BLASTP Results for \OV123a 


Protein 
Accession 
Number 


Protein/Organism/Length 


NOV123a 
Residues/ 

Match 
Residues 


Identities/ 
Similarities for 
the Matched 
Portion 


Expect 
Value 


Q96DU7 


INOSITOL 1,4,5- 
TPJSPHOSPHATE 3-KINASE C 
- Homo sapiens (Human), 683 aa. 


1..683 
1..683 


683/683 (100%) 
683/683 (100%) 


0.0 
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TRISPHOSPHATE 3-KINASE 
ISOENZYME (EC 2.7.1.127) - 
Homo sapiens (Human), 604 aa 
(fragment). 


83..683 
4..604 


oOI/oOl (100%) 
601/601 (100%) 


0.0 


SI 7682 


lD-myo-inositol-trisphosphate 3- 
kinase (EC 2.7.1 .127) B - human, 
472 aa. 


273..682 
54..467 


219/432 (50%) 
285/432 (65%) 


e-117 


CAB65055 


INOSITOL 1,4,5- 
TRISPHOSPHATE 3-KINASE B 
- Homo sapiens (Human), 946 aa. 


273. .682 
528..941 


219/432 (50%) 
285/432 (65%) 


e-117 


Q96JS1 


INOSITOL 1,4,5- 
TRISPHOSPHATE 3-KINASE, 
ISOFORM B (EC 2.7.1.127) - 
Homo sapiens (Human), 946 aa. 


273 .682 
528..941 


219/432 (50%) 
285/432 (65%) 


e-117 

_ 



PFam analysis predicts that the NOV 123a protein contains the domains shown in 
the Table 123E. 



Table I23F. Domain Analysis of NOV 123a 






Identities/ 


Pfam Domain 


NOV123a Match Region 


Similarities 1 Expect Value 






for the Matched Region j 


No Significant Matches Found 



Example 124. 



The NOV 124 clone was analyzed, and the nucleotide and predicted polypeptide 
sequences are shown in Table 124A. 



Table 124 V. NOV124 Sequence Analysis 




SEQIDNO:343 1 1395 bp 


NOV124a, 

CG59993-01 DNA Sequence 


GGTAAGACGACCTCTGGATGCTCACCCTGCCCTCTTCACCTCTCGTCCCCAGCTGTTT 


CCTCTGCCACCATGAGGAACATTTTCAAGAGGAACCAGGAGCCTATTGTGGCTCCTGC 
CACCACCACCGCCACGATGCCCATTGGACCCGTGGACAACTCCACTGAGAGTGGGGGT 
GCTGGGGAGAGCCAGGAGGACATGTTTGCCAAACTGAAGGAGAAGTTATTCAATGAGA 
TAAACAAGATTCCCTTACCACCCTGGGCACTGATCGCCATTGCTGTGGTTGCTGGGCT 
CCTGCTTCTCACCTGCTGCTTCTGCATCTGCAAGAAATGCTGCTGCAAGAAGAAGAAG 
AACAAGAAGGAGAAGGGCAAAGGCATGAAGAATGCCATGAACATGAAGGACATGAAAG 
GGGGTCAGGATGACGACGACGCAGAGACAGGCCTGACTGAGGGGGAAGGTGAAGGGGA 
GGAGGAGAAAGAGCCAGAGAACCTGGGCAAACTGCAGTTTTCCCTGGACTATGATTTT 
CAGGCTAATCAGCTTACTGTGGGCGTTCTGCAGGCTGCTGAACTGCCTGCCCTGGACA 
TGGGAGGCACCTCAGACCCTTATGTCAAGGTCTTCCTCCTTCCTGACAAGAAGAAGAA 
ATATGAGACCAAAGTCCATCGGAAGACACTGAACCCTGCCTTCAATGAAACCTTCACC 
TTCAAGGTGCCATACCAGGAGCTTGGGGGCAAAACTCTGGTGATGGCCATCTATGACT 
TTGACCGCTTCTCCAAACATGACATCATTGGAGAGGTAAAGGTGCCTATGAACACAGT 
GGACCTCGGCCAGCCCATTGAGGAGTGGAGAGACCTGCAAGGCGGGGAAAAGGAGGAG 
CCGGAGAAGCTGGGCGACATCTGCACCTCCCTGCGCTATGTGCCCACGGCCGGGAAGC 
TCACTGTCTGCATCCTGGAGGCTAAGAACCTCAAGAAGATGGACGTGGGCGGCCTTTC 
AGACCCGTACGTGAAGATCCACCTGATGCAGAATGGCAAGAGGCTCAAGAAGAAGAAG 
ACAACCGTGAAGAAGAAGACCCTGAACCCATACTTCAACGAGTCCTTCAGCTTTGAGA 
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TCCCCTTCGAGCAGATTCAGAAAGTCCAGGTAGTGGTCACCGTGCTGGACTATGACAA 
GCTGGGCAAGAACGAAGCCATAGGCAAGATCTTCGTGGGCAGCAATGCCACGGGCACA 
GAGCTGCGGCACTGGTCCGACATGCTGGCCAACCCCCGGAGGCCCATCGCCCAGTGGC 
ACTCGCTCAAGCCTGAGGAGGAGGTGGATGCACTCCTGGGCAAGAACAAGTAGACAGC 
AGCGGCTGGGACCCCACACCTTTCACGGACACTGACAAGATCCAGAGCTATCAATACC 




— „_„.,„..,„■■„„„.,„_„_.,■..■ 


ORF Start: ATG at 70jORF Stop: TAG at 1327 




SEQ ID NO: 344 (419 aa |MW at 46871. 8kD 


NOV124a, 

CG59993-01 Protein Sequence 


MRNIFKRNQEPIVAPATTTATMPIGPVDNSTESGGAGESQEDMFAKLKEKLFKEINKI 
PLPPWALIAIAWAGLLLLTCCFCICKKCCCKKKKNKKEKGKGMKNAMNMKDMKGGQD 
DDDAETGLTEGEGEGEEEKEPENLGKLQFSLDYDFQANQLTVGVLQAAELPALDMGGT 
SDPYVKVFLLPDKKKKYETKVHRKTLNPAFNETFTFKVPYQELGGKTLVMAIYDFDRF 
SKHDIIGEVKVPMNTVDLGQPIEEWRDLQGGEKEEPEKLGDICTSLRYVPTAGKLTVC 
ILEAKNLKKMDVGGLSDPYVKIHLMQNGKRLKKKKTTVKKKTLNPYFNESFSFEIPFE 
QIQKVQVWTVLDYDKLGKNEAIGKIFVGSNATGTELRHWSDMLANPRRPIAQWHSLK 
EEEEVE'ALLGKNK 




SEQ ID NO: 345 1 1338 bp 


NOV124b, 

CG59993-02 DNA Sequence 


CCACCATGAGGAACATTTTCAAGAGGAACCAGGAGCCTATTGTGGCTCCTGCCACCAC 
CACCGCCACGATGCCCATTGGACCCGTGGACAACTCCACTGAGAGTGGGGGTGCTGGG 
GAGAGTCAGGAGGACATGTTTGCCAAACTGAAGGAGAAGTTATTCAATGAGATAAACA 
AGATTCCCTTACCACCCTGGGCACTGATCGCCATTGCTGTGGTTGCTGGGCTCCTGCT 
TCTCACCTGCTGCTTCTGCATCTGCAAGAAATGCTGCTGCAAGAAGAAGAAGAACAAG 
AAGGAGAAGGGCAAAGGTATGAAGAATGCCATGAACATGAAGGACATGAAAGGGGGTC 
AGGATGACGACGACGCAGAGACAGGCCTGACTGAGGGGGAAGGTGAAGGGGAGGAGGA 
GAAAGAGCCAGAGAACCTGGGCAAACTGCAGTTTTCCCTGGACTATGATTTTCAGGCT 
AATCAGCTTACTGTGGGCGTTCTGCAGGCTGCTGAACTGCCTGCCCTGGACATGGGAG 
GCACCTCAGACCCTTATGTCAAGGTCTTCCTCCTTCCTGACAAGAAGAAGAAATATGA 
GACCAAAGTCCATCGGAAGACACTGAACCCTGCCTTCAATGAAACCTTCACCTTCAAG 
GTGCCATACCAGGAGCTTGGGGGCAAAACTCTGGTGATGGCCATCTATGACTTTGACC 
GCTTCTCCAAACATGACATCATTGGAGAGGTAAAGGTGCCTATGAACACAGTGGACCT 
CGGCCAGCCCATTGAGGAGTGGAGAGACCTGCAAGGCGGGGAAAAGGAGGAGCCGGAG 
AAGCTGGGCGACATCTGCACCTCCCTGCGCTATGTGCCCACGGCCGGGAAGCTCACTG 
TCTGCATCCTGGAGGCTAAGAACCTCAAGAAGATGGACGTGGGCGGCCTTTCAGACCC 
GTACGTGAAGATCCACCTGATGCAGAATGGCAAGAGGCTCAAGAAGAAGAAGACAACC 
ATGAAGAAGAAGACCCTGAACCCATACTTCAACGAGTCCTTCAGCTTTGAGATCCCCT 
TCGAGCAGATTCAGAAAGTCCAGGTAGTGGTCACCGTGCTGGACTATGACAAGCTGGG 
CAAGAACGAAGCCATAGGCAAGATCTTCGTGGGCAGCAATGCCACGGGCACAGAGCTG 
CGGCACTGGTCCGACATGCTGGCCAACCCCCGGAGGCCCATCGCCCAGTGGCACTCGC 
TCAAGCCTGAGGAGGAGGTGGGTGCACTCCTGGGCAAGAACAAGTAGACAGCAGCGGC 
TGGGACCCCACACCTTTCACGGACACTGACAAGATCCAGAGCTATCAATAAGGTGTAG 






ORF Start: ATG at 6 


ORF Stop: TAG at 1263 




SEQ ID NO: 346 


419 aa MW at 46845. 9kD 


NOV124b, 

CG5 9993-02 Protein Sequence 


MRNIFKRNQEPIVAPATTTATMPIGPVDNSTESGGAGESQEDMFAKLKEKLFNEINKI 
PLPPWALIAIAWAGLLLLTCCFCICKKCCCKKKKNKKEKGKGMKNAMNMKDMKGGQD 
DDDAETGLTEGEGEGEEEKEPENLGKLQFSLDYDFQANQLTVGVLQAAELPALDMGGT 
SDPYVKVFLLPDKKKKYETKVHRKTLNPAFNETFTFKVPYQELGGKTLVMAIYDFDRF 
SKHDI IGEVKVPMNTVDLGQP I EEWRDLQGGEKEEPEKLGD ICTSLRYVPTAGKLTVC 
I LEAKNLKKMDVGGLSDPYVK I HLMQKTGKRLKKKKTTMKKKTLNP YFNES FSFE I PFE 
QIQKVQVWTVLDYDKLGKNEAIGKIFVGSNATGTELRHWSDMLANPRRPIAQWHSLK 
PEEEVGALLGKKK 



Sequence comparison of the above protein sequences yields the following sequence 
relationships shown in Table 124B. 



Table 124B. Comparison of NOV124a against NOV124b. 


Protein Sequence 


NOV124a Residues/ 
Match Residues 


Identities/ 
Similarities for the Matched Region 


NOV1 24b 


1..419 
1 419 


335/419(79%) 
335/419(79%) 
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Further analysis of the NOV 124a protein yielded the following properties shown in 
Table 124C. 



Table 124C. Protein Sequence Properties NOV124a 


PSort 
analysis: 


0.8202 probability located in mitochondrial inner membrane; 0.6000 
probability located in endoplasmic reticulum (membrane); 0.3500 probability 
located in nucleus; 0.3034 probability located in mitochondrial intermembrane 
space 


SignalP 
analysis: 


No Known Signal Sequence Predicted 



A search of the NOV 124a protein against the Geneseq database, a proprietary 
database that contains sequences published in patents and patent publication, yielded 
several homologous proteins shown in Table 124D. 



Table 1241). Geneseq Results for NOV124a 


Geneseq 
Identifier 


Protein/Organism/Length 
[Patent #, Date] 


NOV124a 
Residues/ 

Match 
Residues 


Identities/ 
Similarities for 
the Matched 
Region 


Expect 
Value 


AAR97722 


Mouse inositol polyphosphate 
binding protein IP4-BP - Mus 
musculus, 422 aa. [JP08092290-A, 
09-APR-1996] 


1..419 
1..422 


412/422 (97%) 
414/422 (97%) 


0.0 


AAU19715 


Human novel extracellular matrix 
protein, Seq ID No 365 - Homo 
sapiens, 461 aa. [WO2001 55368- 
Al, 02-AUG-2001] 


128..405 
169..447 


141/280 (50%) 
201/280 (71%) 


2e-80 


AAU19714 


Human novel extracellular matrix 
protein, Seq ID No 364 - Homo 
sapiens, 295 aa. [WO200155368- 
A1.02-AUG-2001] 


141. .409 
11. .281 


140/273 (51%) 
193/273 (70%) 


3e-74 


AAW87702 


A human membrane fusion protein 
designated SYTAX2 - Homo 
sapiens, 375 aa. [W09856813-A2, 
17-DEC-1998] 


59.. 407 
31. .364 


146/352(41%) 
220/352 (62%) 


4e-73 I 


AAO05534 


Human polypeptide SEQ ID NO 

19426 - Homo sapiens, 149 aa. 

| WO200I 648^5- A2, 07-SHP-200 1 | 


33.. 164 
15.. 149 




127/135 (94%) 
131/135 (96%) 


5e-70 
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In a BLAST search of public sequence databases, the NOV 124a protein was found 
to have homology to the proteins shown in the BLASTP data in Table 124E. 





Table 124E. Public BLASTP Results for NOV124a 




Protein 
Accession 
Number 


Protein/Organism/Length 




NOV124a 
Residues/ 

Match 
Residues 


Identities/ 
Similarities for 
the Matched 
Portion 


Expect 
Value 


P29101 


Synaptotagmin II (Sytll) - Rattus 
norvegicus (Rat), 422 aa. 


1..419 
1..422 


411/422 (97%) 
414/422 (97%) 


0.0 


A55417 


synaptotagmin II - mouse, 422 aa. 


1..419 
1..422 


412/422 (97%) 
414/422 (97%) 


0.0 


P46097 




Synaptotagmin II (Sytll) - Mus 
musculus (Mouse), 422 aa. 


1..419 
1..422 


411/422 (97%) 
413/422 (97%) 


0.0 


P24506 


Synaptotagmin B (Synaptic vesicle 
protein 0-P65-B) - Discopyge 
ommata (Electric ray), 439 aa. 


10..419 
27..439 


341/413 (82%) 
366/413 (88%) 


0.0 


P46096 


Synaptotagmin I (Sytl) (p65) - 
Mus musculus (Mouse), 421 aa. 


10..419 
8..42 1 


323/418(77%) 
353/418(84%) 


0.0 



PFam analysis predicts that the NOV 124a protein contains the domains shown in 
the Table 124F. 



Table 124F. Domain Analysis of NOV124a 



Pfam Domain 


NOV124a Match 
Region 


Identities/ 
Similarities 
for the Matched 
Region 


Expect 
Value 


Adeno E3 CR2: domain 1 
of 1 


62..108 


16/50(32%) 
26/50 (52%) 


6.5 


C2: domain 1 of 2 


156..242 


54/97 (56%) 
81/97 (84%) 


1.8e-42 


C2: domain 2 of 2 


287..375 




44/97 (45%) 
80/97 (82%) 


2.9e-39 



Example 125. 



The NOV 125 clone was analyzed, and the nucleotide and predicted polypeptide 
sequences are shown in Table 125 A. 
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Table 125 A. NOV125 Sequence Analysis 




SEQ ID NO: 347 


3226 bp 


NOV125a, 

CG59991-01 DNA Sequence 


GGACCACTTCTGATGCATCTCTGGGTCCCAACACTATCCACTGCAAGGCCTCGAAACA 


GGGGGGCCAGATGGGACCCCCATTTAGCACAAGAGAGACGTCCACACTCTGTGAGCCC 
AAAGGGAGAAGGCTCAGGCCACGGCAGAGACGGAACCAGGAAAACGTCACGAAAAACA 
GCCTCAAGTTGCCAGGTCCCTTGCAGGAACAGACAGGCCTGGGGCCGCCCCACCTGGG 
CTCAGAGCTTGGGCTGCATGGAGGTGACACATGGGACTACAAGAGTCACGTGATGACC 
AAATTCGCTGAGGAGGAGGATGTACGTCGTAGTTTTGAAAACACTGCTGCTGACTGGC 
CGGAAATGCAAACGTTGGCTGGTGCTTTTGATTCAGACCGGTGGGGCTTCCGGCCTCG 
CACGGTGGTTCTGCACGGAAAGTCAGGAATTGGGAAATCGGCTCTAGCCAGAAGGATC 
GTGCTGTGCTGGGCGCAAGGTGGACTCTACCAGGGAATGTTCTCCTACGTCTTCTTCC 
TCCCCGTTAGAGAGATGCAGCGGAAGAAGGAGAGCAGTGTCACAGAGTTCATCTCCAG 
GGAGTGGCCAGACTCCCAGGCTCCGGTGACGGAGATCATGTCCCGACCAGAAAGGCTG 
TTGTTCATCATTGACGGTTTCGATGACCTGGGCTCTGTCCTCAACAATGACACAAAGC 
TCTGCAAAGACTGGGCTGAGAAGCAGCCTCCGTTCACCCTCATACGCAGTCTGCTGAG 
GAAGGTCCTGCTCCCTGAGTCCTTCCTGATCGTCACCGTCAGAGACGTGGGCACAGAG 
AAGCTCAAGTCAGAGGTCGTGTCTCCCCGTTACCTGTTAGTTAGAGGAATCTCCGGGG 
AACAAAGAATCCACTTGCTCCTTGAGCGCGGGATTGGTGAGCATCAGAAGACACAAGG 
GTTGCGTGCGATCATGAACAACCGTGAGCTGCTCGACCAGTGCCAGGTGCCCGCCGTG 
GGCTCTCTCATCTGCGTGGCCCTGCAGCTGCAGGACGTGGTGGGGGAGAGCGTCGCCC 
CCTTCAACCAAACGCTCACAGGCCTGCACGCCGCTTTTGTGTTTCATCAGCTCACCCC 
TCGAGGCGTGGTCCGGCGCTGTCTCAATCTGGAGGAAAGAGTTGTCCTGAAGCGCTTC 
TGCCGTATGGCTGTGGAGGGAGTGTGGAATAGGAAGTCAGTGTTTGACGGTGACGACC 
TCATGGTTCAAGGACTCGGGGAGTCTGAGCTCCGTGCTCTGTTTCACATGAACATCCT 
TCTCCCAGACAGCCACTGTGAGGAGTACTACACCTTCTTCCACCTCAGTCTCCAGGAC 
TTCTGTGCCGCCTTGTACTACGTGTTAGAGGGCCTGGAAATCGAGCCAGCTCTCTGCC 
CTCTGTACGTTGAGAAGACAAAGAGGTCCATGGAGCTTAAACAGGCAGGCTTCCATAT 
CCACTCGCTTTGGATGAAGCGTTTCTTGTTTGGCCTCGTGAGCGAAGACGTAAGGAGG 
CCACTGGAGGTCCTGCTGGGCTGTCCCGTTCCCCTGGGGGTGAAGCAGAAGCTTCTGC 




CGCCTTCCACTGTCTTTTCGAGACTCAAGACAAAGAGTTTGTTCGCTTGGCATTAAAC 
AGCTTCCAAGAAGTGTGGCTTCCGATTAACCAGAACCTGGACTTGATAGCATCTTCCT 
TCTGCCTCCAGCACTGTCCGTATTTGCGGAAAATTCGGGTGGATGTCAAAGGGATCTT 
CCCAAGAGATGAGTCCGCTGAGGCATGTCCTGTGGTCCCTCTATGGATGCGGGATAAG 
ACCCTCATTGAGGAGCAGTGGGAAGATTTCTGCTCCATGCTTGGCACCCACCCACACC 
TGCGGCAGCTGGACCTGGGCAGCAGCATCCTGACAGAGCGGGCCATGAAGACCCTGTG 
TGCCAAGCTGAGGCATCCCACCTGCAAGATACAGACCCTGATGTTTAGAAATGCACAG 
ATTACCCCTGGTGTGCAGCACCTCTGGAGAATCGTCATGGCCAACCGTAACCTAAGAT 
CCCTCAACTTGGGAGGCACCCACCTGAAGGAAGAGGATGTAAGGATGGCGTGTGAAGC 
CTTAAAACACCCAAAATGTTTGTTGGAGTCTTTGAGGCTGGATTGCTGTGGATTGACC 
CATGCCTGTTACCTGAAGATCTCCCAAATCCTTACGACCTCCCCCAGCCTGAAATCTC 
TGAGCCTGGCAGGAAACAAGGTGACAGACCAGGGAGTAATGCCTCTCAGTGATGCCTT 
GAGAGTCTCCCAGTGCGCCCTGCAGAAGCTGATACTGGAGGACTGTGGCATCACAGCC 
ACGGGTTGCCAGAGTCTGGCCTCAGCCCTCGTCAGCAACCGGAGCTTGACACACCTGT 
GCCTATCCAACAACAGCCTGGGGAACGAAGGTGTAAATCTACTGTGTCGATCCATGAG 
GCTTCCCCACTGTAGTCTGCAGAGGCTGATGCTGAATCAGTGCCACCTGGACACGGCT 
GGCTGTGGTTTTCTTGCACTTGCGCTTATGGGTAACTCATGGCTGACGCACCTGAGCC 
TTAGCATGAACCCTGTGGAAGACAATGGCGTGAAGCTTCTGTGCGAGGTCATGAGAGA 
ACCATCTTGTCATCTCCAGGACCTGGAGTTGGTAAAGTGTCATCTCACCGCCGCGTGC 
TGTGAGAGTCTGTCCTGTGTGATCTCGAGGAGCAGACACCTGAAGAGCCTGGATCTCA 
CGGACAATGCCCTGGGTGACGGTGGGGTTGCTGCACTGTGCGAGGGACTGAAGCAAAA 
GAACAGTGTTCTGACGAGACTCGGGTTGAAGGCATGTGGACTGACTTCTGATTGCTGT 
GAGGCACTCTCCTTGGCCCTTTCCTGCAACCGGCATCTGACCAGTCTAAACCTGGTGC 
AGAATAACTTCAGTCCCAAAGGAATGATGAAGCTGTGTTCGGCCTTTGCCTGTCCCAC 
GTCTAACTTACAGATAATTGGGCTGTGGAAATGGCAGTACCCTGTGCAAATAAGGAAG 
CTGCTGGAGGAAGTGCAGCTACTCAAGCCCCGAGTCGTAATTGACGGTAGTTGGCATT 
CTTTTGATGAAGATGACCGGTACTGGTGGAAAAACTGAAGATACGGAAACCTGCCCCA 
CTCACACCCATCTGATGGAGGAACTTTAAACGCTGT 




ORF Start: ATG at 
69 


ORF Stop: TGA at 3168 




SEQ (D NO: >48 


1033 aa MW a1 I 1 63 I0.7RD 


NOV 125a, 
CG59991-01 Protein 
Sequence 


MGPPFSTRETSTLCEPKGRRLRPRQRRNQENVTKNSLKLPGPLQEQTGLGPPHLGSEL 
GLHGGDTWDYKSHVMTKFAEEEDVRRSFENTAADWPEMQTLAGAFDSDRWGFRPRTW 
LHGKSGIGKSALARRIVLCWAQGGLYQGMFSYVFFLPVREMQRKKESSVTEFISREWP 
DSQAPVTEIMSRPERLLFIIDGFDDLGSVLNNDTKLCKDWAEKQPPFTLIRSLLRKVL 
LPESFLIVTVRDVGTEKLKSEWSPRYLLVRGISGEQRIHLLLERGIGEHQKTQGLRA 
IMNNRELLDQCQVPAVGSLICVALQLQDWGESVAPFNQTLTGLHAAFVFHQLTPRGV 
VRRCLNLEERWLKRFCRMAVEGVWNRKSVFDGDDLMVQGLGESELRALFHMNILLPD 
SHCEEYYTFFHLSLQDFCAALYYVLEGLEIEPALCPLYVEKTKRSMELKQAGFHIHSL 
WMKRFLFGLVSEDVRRPLEVLLGCPVPLGVKQKLLHWSLLGQQPNATTPGDTLDAFH 
CLFETQDKEFVRLALNSFQEVWLPINQNLDLIASSFCLQHCPYLRKIRVDVKGIFPRD 
~ - It F f L L TT 1 j I J 



471 



RHPTCKIQTLMFRNAQITPGVQHLWRIVMANRNLRSLNLGGTHLKEEDVRMACEALKH 
PKCLLESLRLDCCGLTHACYLKISQILTTSPSLKSLSLAGNKVTDQGVMPLSDALRVS 
QCALQKLILEDCGITATGCQSLASALVSNRSLTHLCLSKNSLGNEGVNLLCRSMRLPH 
CSLQRLMLNQCHLDTAGCGFLALALMGNSWLTHLSLSMNPVEDNGVKLLCEVMREPSC 
HLQDLELVKCHLTAACCESLSCVISRSRHLKSLDLTDNALGDGGVAALCEGLKQKNSV 
LTRLGLKACGLTSDCCEALSIALSCNRHLTSLNLVQNNFSPKGMMKLCSAFACPTSNL 
OH E -'J' K \ WHSFDEDDRYWWKI 

Further analysis of the NOV 125a protein yielded the following properties shown in 
Table 125B. 



Table 125B. Protein Sequence Properties NOV125a 


PSort 
analysis: 


0.7600 probability located in nucleus; 0.3000 probability located in 
microbody (peroxisome); 0.1000 probability located in mitochondrial matrix 
space; 0.1000 probability located in lysosome (lumen) 


SignalP 
analysis: 


No Known Signal Sequence Predicted 



A search of the NOV 125a protein against the Geneseq database, a proprietary 
database that contains sequences published in patents and patent publication, yielded 
several homologous proteins shown in Table 125C. 



Table 125C. Geneseq Results for NOVT25a 


Geneseq 
Identifier 


Protein/Organism/Length [Patent 
#, Date] 


NOV125a 
Residues/ 

Match 
Residues 


Identities/ 
Similarities for 
the Matched 
Region 


Expect 
Value 


AAE07514 


Human PYRIN- 1 protein - Homo 
sapiens, 1034 aa. [WO200161005- 
A2, 23-AUG-2001] 


103..934 
207..1003 


276/843 (32%) 
445/843 (52%) 


e-126 


AAE07513 


Human nucleotide binding site 1 
(NBS-1) protein - Homo sapiens, 
1033 aa. [WO200161005-A2, 23- 
AUG- >001] 


114..935 
180..990 


281/839 (33%) 
431/839(50%) 


e-120 


AAU07878 

— - 


Polypeptide sequence for 
mammalian Spg65 - Mammalia, 
748 aa. [WO200166752-A2, 13- 
SEP-2001] 


207..963 
9..748 


218/766 (28%) 
380/766 (49%) 


7e-95 


AAE06758 
AAB62571 


Human G-protein coupled receptor- 
8 (GCREC-8) protein - Homo 
sapiens, 1473 aa. [WO200 157085- 
A2, 09-AUG-2001] 


21. .764 
219..959 


235/772 (30%) 
380/772 (48%) 


3e-88 


Human CARD-7 polypeptide - 


21. .764 
219..959 


235/772 (30%) 
380/772 (48%) 


3e-88 
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I [WO200130813-A1, 03-MAY- 
i 2001 ] 



In a BLAST search of public sequence databases, the NOV125a protein was found 
to have homology to the proteins shown in the BLASTP data in Table 125D. 



Table 125D. Public BLASTP Results for NOV125a 



Protein 
Accession 
Number 


Protein/Organism/Length 


Residues/ 

Match 
Residues 


Identities/ 
Similarities for 
the Matched 
Portion 


j 

Expect 
Value 


Q9JLR2 


MATERNAL- ANTIGEN-THAT- 
EMBRYOS-REQUIRE PROTEIN - 
Mus musculus (Mouse), 1111 aa. 


24..1033 
104..1111 


548/1019 (53%) 
716/1019 (69%) 




0.0 

■ \ 


Q9R1M5 


MATER PROTEIN - Mus musculus 
(Mouse), 1111 aa. 


24..1033 
104..1111 


547/1019(53%) 
716/1019(69%) 


0.0 


AAL35293 


NALP4 - Homo sapiens (Human), 
994 aa. 


63..958 
94..981 


291/907 (32%) 
473/907 (52%) 


e-133 I 


Q96MN2 


CDNA FLJ32126 FIS, CLONE 
PEBLM20001 12, WEAKLY 
SIMILAR TO HOMO SAPIENS 
NUCLEOTIDE -BINDING SITE 
PROTEIN 1 MRNA - Homo 
sapiens (Human), 919 aa. 


63. .958 
19..906 


291/907 (32%) 
473/907 (52%) 


e-133 

1 

J 


AAL 12497 


CRYOPYRIN - Homo sapiens 
(Human), 1034 aa. 


103..934 
207.. 1003 


276/843 (32%) 
445/843 (52%) 


e-125 j 



PFam analysis predicts that the NOV125a protein contains the domains shown in 
the Table 125E. 



Table 125E. Domain Analysis of NOV125a 


Pfam Domain 


NOV125a Match Region 


Identities/ 
Similarities 
for the Matched Region 


Expect Value 


LRR: domain 1 of 6 


671. .695 


6/25 (24%) 
16/25 (64%) 


1.6e+02 


LRR: domain 2 of 6 


728..752 


7/27 (26%) 
17/27(63%) 


2.3e+02 
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LRR: domain 3 of 6 


785..809 


7/26 (27%) 
1 9/26 (73%) 


1.6e+02 


LRR: domain 4 of 6 


814..836 


6/25 (24%) 
14/25 (56%) 


4.3e+02 


LRR: domain 5 of 6 


899..923 


8/26 (31%) 
20/26 (77%) 


27 


LRR: domain 6 of 6 


956..977 




7/25 (28%) 
16/25 (64%) 


2.9e+02 



Example 126. 

The NOV 126 clone was analyzed, and the nucleotide and predicted polypeptide 
sequences are shown in Table 126A. 



Table 126A. NOV126 Sequence Analysis 




SEQIDNO:349 { 2310bp 


NOV 126a, 

CG59987-01 DNA Sequence 


CCGCGCCTCAGTCCGCCGTCCGCCCTCCGCGCCCGCGCCGCTAGCATGACCGACGCGC 


TGTTGCCCGCGGCCCCCCAGCCGCTGGAGAAGGAGAACGACGGCTACTTTCGGAAGGG 
CTGTAATCCCCTTGCACAAACCGGCCGGAGTAAATTGCAGAATCAAAGAGCTGCTTTG 
AATCAGCAGATCCTGAAAGCCGTGCGGATGAGGACCGGAGCGGAAAACCTTCTGAAAG 
TGGCCACAAACTCAAAGGTGCGGGAGCAAGTGCGGCTGGAGCTGAGCTTCGTCAACTC 
AGACCTGCAGATGCTCAAGGAAGAGCTGGAGGGGCTGAACATCTCGGTGGGCGTCTAT 
CAGAACACAGAGGAGGCATTTACGATTCCCCTGATTCCTCTTGGCCTGAAGGAAACGA 
AAGACGTCGACTTTGCAGTCGTCCTCAAGGATTTTATCCTGGAACATTACAGTGAAGA 
TGGCTATTTATATGAAGATGAAATTGCAGATCTTATGGATCTGAGACAAGCTTGTCGG 
ACGCCTAGCCGGGATGAGGCCGGGGTGGAACTGCTGATGACATACTTCATCCAGCTGG 
GCTTTGTCGAGAGTCGATTCTTCCCGCCCACACGGCAGATGGGACTCCTGTTCACCTG 
GTATGACTCTCTCACCGGGGTTCCGGTCAGCCAGCAGAACCTGCTGCTGGAGAAGGCC 
AGTGTCCTGTTCAACACTGGGGCCCTCTACACCCAGATTGGGACCCGGTGTGATCGGC 
AGACGCAGGCTGGGCTGGAGAGTGCCATAGATGCCTTTCAGAGAGCCGCAGGGGTTTT 
AAATTACCTGAAAGACACATTTACCCATACTCCAAGTTACGACATGAGCCCTGCCATG 
CTCAGCGTGCTCGTCAAAATGATGCTTGCACAAGCCCAAGAAAGCGTGTTTGAGAAAA 
TCAGCCTTCCTGGGATCCGGAATGAATTCTTCATGCTGGTGAAGGTGGCTCAGGAGGC 
TGCTAAGGTGGGAGAGGTCTACCAACAGCTACACGCAGCCATGAGCCAGGCGCCGGTG 
AAAGAGAACATCCCCTACTCCTGGGCCAGCTTAGCCTGCGTGAAGGCCCACCACTACG 
CGGCCCTGGCCCACTACTTCACTGCCATCCTCCTCATCGACCACCAGGTGAAGCCAGG 
CACGGATCTGGACCACCAGGAGAAGTGCCTGTCCCAGCTCTACGACCACATGCCAGAG 
GGGCTGACACCCTTGGCCACACTGAAGAATGATCAGCAGCGCCGACAGCTGGGGAAGT 
CCCACTTGCGCAGAGCCATGGCTCATCACGAGGAGTCGGTGCGGGAGGCCAGCCTCTG 
CAAGAAGCTGCGGAGCATTGAGGTGCTACAGAAGGTGCTGTGTGCCGCACAGGAACGC 
TCCCGGCTCACGTACGCCCAGCACCAGGAGGAGGATGACCTGCTGAACCTGATCGACG 
CCCCCAGAGTGTTGTTGCTAAAACTGAGCAAGAGGTTGACATTATATTGCCCCAGTTC 
TCCAGCTGACAGTCACGGACTTCTTCCAGAAGCTGGGCCCTTATCTGTGCTGTCGGCT 
AACAAGCGGTGGACGCCTCCTCGAAGCATCCGCTTCACTGCAGAAGAAGGGGACTTGG 
GGTTCACCTTGAGAGGGAACGCCCCCGTTCAGGTTCACTTCCTGGATCCTTACTGCTC 
TGCCTCGGTGGCAGGAGCCCGGGAAGGAGATTATATTGTCTCCATTCAGCTTGTGGAT 
TGTAAGTGGCTGACGCTGAGTGAGGTTATGAAGCTGCTGAAGAGCTTTGGCGAGGACG 
AGATCGAGATGAAAGTCGTGAGCCTCCTGGACTCCACATCATCCATGCATAATAAGAG 
TGCCACATACTCCGTGGGAATGCAGAAAACGTACTCCATGATCTGCTTAGCCATTGAT 
GATGACGACAAAACTGATAAAACCAAGAAAATCTCCAAGAAGCTTTCCTTCCTGAGTT 
GGGGCACCAACAAGAACAGACAGAAGTCAGCCAGCACCTTGTGCCTCCCATCGGTCGG 
GGCTGCACGGCCTCAGGTCAAGAAGAAGCTGCCCTCCCCTTTCAGCCTTCTCAACTCA 
GACAGTTCTTGGTACTAATGTGAGGAAACAAACATGTTCAGGCCCCGAACATTTCCGG 


TGCTGACTCGGCCTTAAACGTTTGTGCCATAATGGAAAATATCTATCTATCTGTTCTC 


AAATCCTGTTTTTCTCATAGTGTAAACTCACATTTGATGTGTTTTTATGAAGGAAAGT 
AACCAAGAAACC" 1 , - " 3 rT"l I 




ORF Start: ATG at 46|ORF Stop: TAA at 2104 




SEQ ID NO: 350 [686 aa jMW at 76812.3kD 


NOV 126a, 

CG59987-01 Protein Sequence 


MTDALLPAAPQPLEKENDGYFRKGCNPLAQTGRSKLQNQRAALNQQILKAVRMRTGAE 
NLLKVATNSKVREQVRLELSFVNSDLQMLKEELEGLNISVGVYQNTEEAFTIPLIPLG 
LKETKDVDFAWLKDFILEHYSEDGYLYEDEIADLMDLRQACRTPSRDEAGVELLMTY 
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FIQLGFVESRFFPPTRQMGLLFTWYDSLTGVPVSQQNLLLEKASVLFNTGALYTQIGT 
RCDRQTQAGLESAIDAFQRAAGVLNYLKDTFTHTPSYDMSPAMLSVLVKMMLAQAQES 
VFEKISLPGIRNEFFMLVKVAQEAAKVGEVYQQLHAAMSQAPVKENIPYSWASLACVK 
AHHYAALAHYFTAI LL I DHQVKPGTDLDHQEKCLSQLYDHMPEGLTPLATLKNDQQRR 
QLGKSHLRRAMAHHEESVREASLCKKLRSIEVLQKVLCAAQERSRLTYAQHQEEDDLL 
NLIDAPRVLLLKLSKRLTLYCPSSPADSHGLLPEAGPLSVLSANKRWTPPRS I RFTAE 
EGDLGFTLRGNAPVQVHFLDPYCSASVAGAREGDYIVSIQLVDCKWLTLSEVMKLLKS 
FGEDEIEMKWSLLDSTSSMHNKSATYSVGMQKTYSMICLAIDDDDKTDKTKKISKKL 
SFLSWGTNKNRQKSASTLCLPSVGAARPQVKKKLPS] 



SEQ ID NO: 351 



;2109 bp 



NOV 126b, 

CG59987-02 DNA Sequence 



CGCCGCTAGCA TGACCGACGCGCTGTTGCCCGCGGCCCCCCAGCCGCTGGAGAAGGAG 
AACGACGGCTACTTTCGGAAGGGCTGTAATCCCCTTGCACAAACCGGCCGGAGTAAAT 
TGCAGAATCAAAGAGCTGCTTTGAATCAGCAGATCCTGAAAGCCGTGCGGATGAGGAC 
CGGAGCGGAAAACCTTCTGAAAGTGGCCACAAACTCAAAGGTGCGGGAGCAAGTGCGG 
CTGGAGCTGAGCTTCGTCAACTCAGACCTGCAGATGCTCAAGGAAGAGCTGGAGGGGC 
TGAACATCTCGGTGGGCGTCTATCAGAACACAGAGGAGGCATTTACGATTCCCCTGAT 
TCCTCTTGGCCTGAAGGAAACGAAAGACGTCGACTTTGCAGTCGTCCTCAAGGATTTT 
ATCCTGGAACATTACAGTGAAGATGGCTATTTATATGAAGATGAAATTGCAGATCTTA 
TGGATCTGAGACAAGCTTGTCGGACGCCTAGCCGGGATGAGGCCGGGGTGGAACTGCT 
GATGACATACTTCATCCAGCTGGGCTTTGTCGAGAGTCGATTCTTCCCGCCCACACGG 
CAGATGGGACTCCTGTTCACCTGGTATGACTCTCTCACCGGGGTTCCGGTCAGCCAGC 
AGAACCTGCTGCTGGAGAAGGCCAGTGTCCTGTTCAACACTGGGGCCCTCTACACCCA 
GATTGGGACCCGGTGCGATCGGCAGACGCAGGCTGGGCTGGAGAGTGCCATAGATGCC 
TTTCAGAGAGCCGCAGGGGTTTTAAATTACCTGAAAGACACATTTACCCATACTCCAA 
GTTACGACATGAGCCCTGCCATGCTCAGCGTGCTCGTCAAAATGATGCTTGCACAAGC 
CCAAGAAAGCGTGTTTGAGAAAATCAGCCTTCCTGGGATCCGGAATGAATTCTTCATG 
CTGGTGAAGGTGGCTCAGGAGGCTGCTAAGGTGGGAGAGGTCTACCAACAGCTACACG 
CAGCCATGAGCCAGGCGCCGGTGAAAGAGAACATCCCCTACTCCTGGGCCAGCTTAGC 
CTGCGTGAAGGCCCACCACTACGCGGCCCTGGCCCACTACTTCACTGCCATCCTCCTC 
ATCGACCACCAGGTGAAGCCAGGCACGGATCTGGACCACCAGGAGAAGTGCCTGTCCC 
AGCTCTACGACCACATGCCAGAGGGGCTGACACCCTTGGCCACACTGAAGAATGATCA 
GCAGCGCCGACAGCTGGGGAAGTCCCACTTGCGCAGAGCCATGGCTCATCACGAGGAG 
TCGGTGCGGGAGGCAAGCCTCTGCAAGAAGCTGCGGAGCATTGAGGTGCTACAGAAGG 
TGCTGTGTGCCGCACAGGAACGCTCCCGGCTCACGTACGCCCAGCACCAGGAGGAGGA 
TGACCTGCTGAACCTGATCGACGCCCCCAGTGTTGTTGCTAAAACTGAGCAAGAGGTT 
GACATTATATTGCCCCAGTTCTCCAAGCTGACAGTCACGGACTTCTTCCAGAAGCTGG 
GCCCCTTATCTGTGTTTTCGGCTAACAAGCGGTGGACGCCTCCTCGAAGCATCCGCTT 
CACTGCAGAAGAAGGGGACTTGGGGTTCACCTTGAGAGGGAACGCCCCCGTTCAGGTT 

TTGTCTCCATTCAGCTTGTGGATTGTAAGTGGCTGACGCTGAGTGAGGTTATGAAGCT 
GCTGAAGAGCTTTGGCGAGGACGAGATCGAGATGAAAGTCGTGAGCCTCCTGGACTCC 
ACATCATCCATGCATAATAAGAGTGCCACATACTCCGTGGGAATGTA GAAAACGTACT 
CCATGATCTGCTTAGCCATTGATGATGACGACAAAACTGATAAAACCAAGAAAATCTC 



{ CAAGAAGCTTTCCTTCCTGAGTTGGGGCACCAACAAGAACAGACAGAAGTCAGCCAGC 

l ACCTTGTGCCTCCCATCCG"' f 'J>GAAGCTGCCCT 

j cCCCTTTCAGCCTTCTCAACTCAGACAGTTCTTGGTACTAATGTGAGGAAACAAACAT 
I GTTCAGGCCCCGAACATTTC 



|ORF Start: ATG at 11 


ORF Stop: TAG at 1844 




SEQ ID NO: 352 


611 aa IMW at 686 1 3 .9kD "™~~" 


NOV126b, 

CG59987-02 Protein Sequence 


MTDALLPAAPQPLEKENDGYFRKGCNPLAQTGRSKLQNQRAALNQQILKAVRMRTGAE 
NLLKVATNSKVREQVRLELSFVNSDLQMLKEELEGLNISVGVYQNTEEAFTIPLIPLG 
LKETKDVDFAWLKDFILEHYSEDGYLYEDEIADLMDLRQACRTPSRDEAGVELLMTY 
FIQLGFVESRFFPPTRQMGLLFTWYDSLTGVPVSQQNLLLEKASVLFNTGALYTQIGT 
RCDRQTQAGLESAIDAFQRAAGVLNYLKDTFTHTPSYDMSPAMLSVLVKMMLAQAQES 
VFEKISLPGIRNEFFMLVKVAQEAAKVGEVYQQLHAAMSQAPVKEWIPYSWASLACVK 
AHHYAALAHYFTAILLIDHQVKPGTDLDHQEKCLSQLYDHMPEGLTPLATLKNDQQRR 
QLGKSHLRRAMAHHEESVREASLCKKLRSIEVLQKVLCAAQERSRLTYAQHQEEDDLL 
KLIDAPS WAKTEQEVD 1 1 LPQFSKLTVTDFFQKLGPLSVFSANKRWTPPRS I RFTAE 
EGDLGFTLRGNAPVQVHFLDPYCSASVAGAREGDYIVSIQLVDCKWLTLSEVMKLLKS 
t EIEllr < LI [rt T 



Sequence comparison of the above protein sequences yields the following sequence 
relationships shown in Table 126B. 
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Table 126B. Comparison of NOV126a against NOV126b. 



Protein Sequence 


NOV126a Residues/ 
Match Residues 


Identities/ 
Similarities for the Matched Region 


NOV 126b 


1..611 
1..61 1 


585/612(95%) 
590/612(95%) 



Further analysis of the NOV 126a protein yielded the following properties shown in 
Table 126C. 



Table 126C. Protein Sequence Properties NOV126a 


PSort 
analysis: 


0.4500 probability located in cytoplasm; 0.3000 probability located in 
microbody (peroxisome); 0.1000 probability located in mitochondrial matrix 
space; 0.1000 probability located in lysosome (lumen) 


SignalP 
analysis: 


No Known Signal Sequence Predicted 

_ ... 



A search of the NOV 126a protein against the Geneseq database, a proprietary 
database that contains sequences published in patents and patent publication, yielded 
several homologous proteins shown in Table 126D. 



Table 126D. Geneseq Results for NOV 126a 


Geneseq 
Identifier 


Protein/Organism/Length [Patent 
#, Date] 


NOV126a 
Residues/ 

Match 
Residues 


Identities/ 
Similarities for 
the Matched 
Region 


Expect 
Value 


AAU10192 


Human prostate specific protein 
PSL22 - Homo sapiens, 686 aa. 
|WO200172962-A2, 04-OCT-2001] 


1..686 
1..686 


660/687 (96%) 
665/687 (96%) 


0.0 


AAB68561 


Human GTP-binding associated 
protein #61 - Homo sapiens, 666 aa. 
[WO200105970-A2, 25-JAN-2001] 


27..686 
7..666 


626/661 (94%) 
633/661 (95%) 


0.0 


AAG64579 


Human transcription termination 
factor binding protein 54 - Homo 
sapiens, 488 aa. [CN1297918-A, 
06-JUN-2001] 


201. .686 
3..488 


458/487 (94%) 
464/487 (95%) 


0.0 


AAB29661 


Human histidine domain-protein 
tyrosine phosphatase, SEQ ID NO:2 


110..357 
7..253 


82/252 (32%) 
135/252 (53%) 


3e-28 
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[WO200063392-A1, 26-OCT-2000] 









AAU00869 


Human cancer related protein 5 - 
Homo sapiens, 257 aa. 
[WO200118014-A1, 15-MAR- 
2001] 


409..597 
8..196 


70/189 (37%) 
102/189 (53%) 


2e-27 



In a BLAST search of public sequence databases, the NOV 126a protein was found 
to have homology to the proteins shown in the BLASTP data in Table 126E. 





Table 126E. Public BLASTP Results for NOV126a 


ill 


Protein 
Accession 
Number 


Protein/Organism/Length 


NOV126a 
Residues/ 

Match 
Residues 


Identities/ 
Similarities for 
the Matched 
Portion 


Expect 
Value 




Q96RU1 


RHOPHILIN-LIKE PROTEIN - 
Homo sapiens (Human), 685 aa. 


1..686 
1..685 


627/688 (91%) 
640/688 (92%) 


0.0 




Q9DBN2 


1300002E07RIK PROTEIN - 
Mus musculus (Mouse), 686 aa. 


1..686 
1..686 


573/687 (83%) 
616/687 (89%) 


0.0 




Q61085 


GTP-RHO binding protein 1 
(Rhophilin) - Mus musculus 
(Mouse), 643 aa. 


16..596 
20..580 


273/583 (46%) 
361/583 (61%) 


e-135 




Q9XYY9 


RHOPHILIN - Drosophila 
melanogaster (Fruit fly), 718 aa. 


21..615 
31. .674 


248/654 (37%) 
363/654 (54%) 


e-110 




Q96PV9 


KIAA1929 PROTEIN - Homo 
sapiens (Human), 410 aa 
(fragment). 


23..366 
17..362 


178/346 (51%) 
241/346 (69%) 


le-93 



PFam analysis predicts that the NOV 126a protein contains the domains shown in 
5 the Table 126F. 



Table 126F. Domain Analysis of NOV126a 



Pfam Domain 


NOV126a Match 
Region 


Identities/ 
Similarities 
for the Matched 
Region 


Expect 
Value 


HR1: domain 1 of 1 


38..110 


19/87(22%) 
53/87 (61%) 


1.2e-05 


BROl: domain 1 of 
1 


111. .263 


60/172 (35%) 
125/172 (73%) 


3.8e-56 
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PDZ: domain 1 of 1 


516..593 


20/84 (24%) 


1 0.46 






53/84 (63%) 





Example 127. 



The NOV127 clone was analyzed, and the nucleotide and predicted polypeptide 
sequences are shown in Table 127A. 



Table 127A. NOV127 Sequence Analysis 



SEQ ID NO: 353 



3351 bp 



NOV 127a, 

CG59971-01 DNA Sequence 



CGTCCCGTGGCC ATGACGACCGCTCAGAGGGACTCCCTGTTGTGGAAGCTCGCGGGGT 
TGCTGCGGGAGTCCGGTGATGTGGTCCTGTCTGGCTGTAGCACCCTGAGCCTGCTGAC 
TCCCACACTGCAACAGCTGAACCACGTATTTGAGCTGCACCTGGGGCCATGGGGCCCT 
GGCCAGACAGGCTTTGTGGCTCTGCCCTCCCATCCTGCCGACTCCCCTGTTATTCTTC 
AGCTTCAGTTTCTCTTCGATGTGCTGCAGAAAACACTTTCACTCAAGCTGGTCCATGT 
TGCTGGTCCTGGCCCCACAGGGCCCATCAAGATTTTCCCCTTCAAATCCCTTCGGCAC 
CTGGAGCTCCGAGGTGTTCCCCTCCACTGTCTGCATGGCCTCCGAGGCATCTACTCCC 
AGCTGGAGACCCTGATTTGCAGCAGGAGCCTCCAGGCATTAGAGGAGCTCCTCTCAGC 
CTGCGGCGGCGACTTCTGCTCTGCCCTCCCTTGGCTGGCTCTGCTTTCTGCCAACTTC 
AGCTACAATGCACTGACCGCCTTAGACAGCTCCCTGCGCCTCTTGTCAGCTCTGCGTT 
TCTTGAACCTAAGCCACAATCAAGTCCAGGACTGTCAGGGATTCCTGATGGATTTGTG 
TGAGCTCCACCATCTGGACATCTCCTATAATCGCCTGCATTTGGTGCCAAGAATGGGA 
CCCTCAGGGGCTGCTCTGGGGGTCCTGATACTGCGAGGCAATGAGCTTCGGAGCCTGC 
CAGGCCTAGAGCAGCTGAGGAATCTGCGGCACCTGGATTTGGCATACAACCTGCTGGA 
AGGACACCGGGAGCTGTCACCACTGTGGCTGCTGGCTGAGCTCCGCAAGCTCTACCTG 
GAGGGGAACCCTCTTTGGTTCCACCCTGAGCACCGAGCAGCCACTGCCCAGTACTTGT 
CACCCCGGGCCAGGGATGCTGCTACTGGCTTCCTTCTCGATGGCAAGGTCTTGTCACT 
GACAGATTTTCAGCAGACTCACACATCCTTGGGGCTCAGCCCCATGGGCCCACCTTTG 
CCCTGGCCAGTGGGGAGTACTCCTGAAACCTCAGGTGGCCCTGACCTGAGTGACAGCC 
TCTCCTCAGGGGGTGTTGTGACCCAGCCCCTGCTTCATAAGGTTAAGAGCCGAGTCCG 
TGTGAGGCGGGCAAGCATCTCTGAACCCAGTGATACGGACCCGGAGCCCCGAACTCTG 
AACCCCTCTCCGGCTGGTTGGTTCGTGCAGCAGCACCCGGAGCTGGAGCTCATGAGCA 
GCTTCCGGGAACGGTTCGGCCGCAACTGGCTGCAGTACAGGAGTCACCTGGAGCCCTC 
CGGAAACCCTCTGCCGGCCACCCCCACTACTTCTGCACCCAGTGCACCTCCAGCCAGC 
TCCCAGGGCCCCGACACTGCACCCAGACCTTCACCCCCGCAGGAGGAAGCCAGAGGCC 
CCCAGGAGTCACCACAGAAAATGTCAGAGGAGGTCAGGGCGGAGCCACAGGAGGAGGA 
AGAGGAGAAGGAGGGGAAGGAGGAGAAGGAGGAGGGGGAGATGGTGGAACAGGGAGAA 
GAGGAGGCAGGAGAGGAGGAAGAAGAGGAGCAGGACCAGAAGGAAGTGGAAGCGGAAC 
TCTGTCGCCCCTTGTTGGTGTGTCCCCTGGAGGGGCCTGAGGGCGTACGGGGCAGGGA 
ATGCTTTCTCAGGGTCACTTCTGCCCACCTGTTTGAGGTGGAACTCCAAGCAGCTCGC 
ACCTTGGAGCGACTGGAGCTCCAGAGTCTGGAGGCAGCTGAGATAGAGCCGGAGGCCC 
AGGCCCAGGGTCCCCCTCTTGCTGCGCAGGGCTCAGATCTGCTCCCTGGAGCCCCCAT 
CCTCAGTCTGCGCTTCTCCTACATCTGCCCTGACCGGCAGTTGCGTCGCTATTTGGTG 
CTGGAGCCTGATGCCCACGCAGCTGTCCAGGAGCTGCTTGCCGTGTTGACCCCAGTCA 
CCAATGTGGCTCGGGAACAGCTTGGGGAGGCCAGGGACCTCCTGCTGGGTAGATTCCA 
GTGTCTACGCTGTGGCCATGAGTTCAAGCCAGAGGAGCCCAGGATGGGATTAGACAGT 
GAGGAAGGCTGGAGGCCTCTGTTCCAAAAGACAGAATCTCCTGCTGTGTGTCCTAACT 



GAAACAGGGAGAGCAGTCTCTGGCTCCTTCTCCGTCTGCCAGCCCTGTCTGCCACCCT 
CCTGGCCATGGTGACCACCTTGACAGGGCCAAGAACAGCCCACCTCAGGCACCGAGCA 
CCCGTGACCATGGTAGTTGGAGCCTCAGTCCCGCCCCTGAGCGCTGTGGCCTCCGCTC 
TGTGGACCACCGACTCCGGCTCTTCCTGGATGTTGAGGTGTTCAGCGATGCCCAGGAG 
GAGTTCCAGTGCTGCCTCAAGGTCCCAGTGGCATTGGCAGGCCACACTGGGGAGTTCA 
TGTGCCTTGTGGTTGTGTCTGACCGCAGGCTGTACCTGTTGAAGGTGACTGGGGAGAT 



TTCCTGTTAGATGAGGATGCTGCAGGGTCCCCGGCAGAGCCCTCTCCTCCAGCAGCAT 
CTGGCGAAGCCTCTGAGAAGGTGCCTCCCTCGGGGCCGGGCCCTGCTGTGCGTGTCAG 
GGAGCAGCAGCCACTCAGCAGCCTGAGCTCCGTGCTGCTCTACCGCTCAGCCCCTGAG 
GACTTGCGGCTGCTCTTCTACGATGAGGTGTCCCGGCTGGAGAGCTTTTGGGCACTCC 
GTGTGGTGTGTCAGGAGCAGCTGACAGCCCTGCTTGCCTGGATCCGGGAACCATGGGA 
GGAGCTGTTTTCCATCGGACTCCGGACAGTGATCCAAGAGGCGCTGGCCCTTGACCGA 
T< A_,_ r - CCACGCTGACCTTGGCCCTGACCTCAGGAGCCACGCT 



ORF S tart: ATG at [ORF Stop: TGA at 3307 
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SEQ [D NO: 354 1098 aa MW at 121004. lkD 


NOV 127a, 
CG59971-01 Protein 
Sequence 


MTTAQRDSLLWKLAGLLRESGDWLSGCSTLSLLTPTLQQLNHVFELHLGPWGPGQTG 
FVALPSHPADSPVILQLQFLFDVLQKTLSLKLVHVAGPGPTGP I KI FPFKSLRHLELR 
GVPLHCLHGLRGIYSQLETLICSRSLQALEELLSACGGDFCSALPWIALLSANFSYNA 
LTALDSSLRLLSALRFLNLSHNQVQDCQGFLMDLCELHHLDISYNRLHLVPRMGPSGA 

LWFHPEHRAATAQYLSPRARDAATGFLLDGKVLSLTDFQQTHTSLGLSPMGPPLPWPV 
GSTPETSGGPDLSDSLSSGGWTQPLLHKVKSRVRVRRASISEPSDTDPEPRTLNPSP 
AGWFVQQHPELELMSSFRERFGRNWLQYRSHLEPSGNPLPATPTTSAPSAPPASSQGP 
DTAPRPSPPQEEARGPQESPQKMSEEVRAEPQEEEEEKEGKEEKEEGEMVEQGEEEAG 
EEEEEEQDQKEVEAELCRPLLVCPLEGPEGVRGRECFLRVTSAHLFEVELQAARTLER 
LELQSLEAAEIEPEAQAQGPPLAAQGSDLLPGAPILSLRFSYICPDRQLRRYLVLEPD 
AHAAVQELLAVLTPVTNVAREQLGEARDLLLGRFQCLRCGHEFKPEEPRMGLDSEEGW 
RPLFQKTESPAVCPNCGSDHWLLAVSRGTPNRERKQGEQSLAPSPSASPVCHPPGHG 
DHLDRAKNSPPQAPSTRDHGSWSLSPAPERCGLRSVDHRLRLFLDVEVFSDAQEEFQC 
CLKVPVALAGHTGEFMCLVWSDRRLYLLKVTGEMSEPPASWLQLTLAVPLQDLSGIE 
LGLAGQSLRLEWAAGAGRCVLLPRDARHCRAFLEELLGVLQSLPPAWRNCVSATEEEV 
TPQHRLWPLLEKDSSLEARQFFYLRAFLVEGEASVQLMLPSTCLVSLLLTPSTLFLLD 
EDAAGSPAEPSPPAASGEASEKVPPSGPGPAVRVREQQPLSSLSSVLLYRSAPEDLRL 
u i - L ir EMIT -ir^tvLF-L I QEALAI 




SEQ ID NO: 355 ,3348 bp 


NOV127b, 

CG59971-02 DNA Sequence 


TGCTGCGGGAGTCCGGTGATGTGGTCCTGTCTGGCTGTAGCACCCTGAGCCTGCTGAC 
TCCCACACTGCAACAGCTGAACCACGTATTTGAGCTGCACCTGGGGCCATGGGGCCCT 
GGCCAGACAGGCTTTGTGGCTCTGCCCTCCCATCCTGCCGACTCCCCTGTTATTCTTC 
AGCTTCAGTTTCTCTTCGATGTGCTGCAGAAAACACTTTCACTCAAGCTGGTCCATGT 
TGCTGGTCCTGGCCCCACAGGGCCCATCAAGATTTTCCCCTTCAAATCCCTTCGGCAC 
CTGGAGCTCCGAGGTGTTCCCCTCCACTGTCTGCATGGCCTCCGAGGCATCTACTCCC 
AGCTGGAGACCCTGATTTGCAGCAGGAGCCTCCAGGCATTAGAGGAGCTCCTCTCAGC 
CTGCGGCGGCGACTTCTGCTCTGCCCTCCCTTGGCTGGCTCTGCTTTCTGCCAACTTC 
AGCTACAATGCACTGACCGCCTTAGACAGCTCCCTGCGCCTCTTGTCAGCTCTGCGTT 
TCTTGAACCTAAGCCACAATCAAGTCCAGGACTGTCAGGGATTCCTGATGGATTTGTG 
TGAGCTCCACCATCTGGACATCTCCTATAATCGCCTGCATTTGGTGCCAAGAATGGGA 
CCCTCAGGGGCTGCTCTGGGGGTCCTGATACTGCGAGGCAATGAGCTTCGGAGCCTGC 
CAGGCCTAGAGCAGCTGAGGAATCTGCGGCACCTGGATTTGGCATACAACCTGCTGGA 
AGGACACCGGGAGCTGTCACCACTGTGGCTGCTGGCTGAGCTCCGCAAGCTCTACCTG 
GAGGGGAACCCTCTTTGGTTCCACCCTGAGCACCGAGCAGCCACTGCCCAGTACTTGT 
CACCCCGGGCCAGGGATGCTGCTACTGGCTTCCTTCTCGATGGCAAGGTCTTGTCACT 
GACAGATTTTCAGCAGACTCACACATCCTTGGGGCTCAGCCCCATGGGCCCACCTTTG 
CCCTGGCCAGTGGGGAGTACTCCTGAAACCTCAGGTGGCCCTGACCTGAGTGACAGCC 
TCTCCTCAGGGGGTGTTGTGACCCAGCCCCTGCTTCATAAGGTTAAGAGCCGAGTCCG 
TGTGAGGCGGGCAAGCATCTCTGAACCCAGTGATACGGACCCGGAGCCCCGAACTCTG 
AACCCCTCTCCGGCTGGTTGGTTCGTGCAGCAGCACCCGGAGCTGGAGCTCATGAGCA 
GCTTCCGGGAACGGTTCGGCCGCAACTGGCTGCAGTACAGGAGTCACCTGGAGCCCTC 
CGGAAACCCTCTGCCGGCCACCCCCACTACTTCTGCACCCAGTGCACCTCCAGCCAGC 
TCCCAGGGCCCCGACACTGCACCCAGACCTTCACCCCCGCAGGAGGAAGCCAGAGGCC 
CCCAGGAGTCACCACAGAAAATGTCAGAGGAGGTCAGGGCGGAGCCACAGGAGGAGGA 
AGAGGAGAAGGAGGGGAAGGAGGAGAAGGAGGAGGGGGAGATGGTGGAACAGGGAGAA 
GAGGAGGCAGGAGAGGAGGAAGAAGAGGAGCAGGACCAGAAGGAAGTGGAAGCGGAAC 
TCTGTCGCCCCTTGTTGGTGTGTCCCCTGGAGGGGCCTGAGGGCGTACGGGGCAGGGA 
ATGCTTTCTCAGGGTCACTTCTGCCCACCTGTTTGAGGTGGAACTCCAAGCAGCTCGC 
ACCTTGGAGCGACTGGAGCTCCAGAGTCTGGAGGCAGCTGAGATAGAGCCGGAGGCCC 
AGGCCCAGAGGTCGCCCAGGCCCACGGGCTCAGATCTGCTCCCTGGAGCCCCCATCCT 
CAGTCTGCGCTTCTCCTACATCTGCCCTGACCGGCAGTTGCGTCGCTATTTGGTGCTG 
GAGCCTGATGCCCACGCAGCTGTCCAGGAGCTGCTTGCCGTGTTGACCCCAGTCACCA 
ATGTGGCTCGGGAACAGCTTGGGGAGGCCAGGGACCTCCTGCTGGGTAGATTCCAGTG 
TCTACGCTGTGGCCATGAGTTCAAGCCAGAGGAGCCCAGGATGGGATTAGACAGTGAG 
GAAGGCTGGAGGCCTCTGTTCCAAAAGACAGAATCTCCTGCTGTGTGTCCTAACTGTG 
GTAGTGACCACGTGGTTCTCCTCGCTGTGTCTCGGGGAACCCCCAACAGGGAGCGGAA 
ACAGGGAGAGCAGTCTCTGGCTCCTTCTCCGTCTGCCAGCCCTGTCTGCCACCCTCCT 
GGCCATGGTGACCACCTTGACAGGGCCAAGAACAGCCCACCTCAGGCACCGAGCACCC 
GTGACCATGGTAGTTGGAGCCTCAGTCCCGCCCCTGAGCGCTGTGGCCTCCGCTCTGT 
GGACCACCGACTCCGGCTCTTCCTGGATGTTGAGGTGTTCAGCGATGCCCAGGAGGAG 
TTCCAGTGCTGCCTCAAGGTCCCAGTGGCATTGGCAGGCCACACTGGGGAGTTCATGT 




TGAGCCTCCAGCTAGCTGGCTGCAGCTGACCCTGGCTGTTCCCCTGCAGGATCTGAGT 
GGCATAGAGCTGGGCCTGGCAGGCCAGAGCCTGCGGCTAGAGTGGGCAGCTGGGGCGG 
GCCGCTGTGTGCTGCTGCCCCGAGATGCCAGGCATTGCCGGGCCTTCCTAGAGGAGCT 
CCTTGGTGTCTTGCAGTCTCTGCCCCCTGCCTGGAGGAACTGTGTCAGTGCCACAGAG 
GAGGAGGTCACCCCCCAGCACCGGCTCTGGCCATTGCTGGAAAAAGACTCATCCTTGG 
AGGCTCGCCAGTTCTTCTACCTTCGGGCGTTCCTGGTTGAAGGTGAAGCCTCTGTGCA 
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CTGTTAGATGAGGATGCTSCAGGGTCCCCGGCAGAGCCCTCTCCTCCAGCAGCATCTG 
GCGAAGCCTCTGAGAAGGTGCCTCCCTCGGGGCCGGGCCCTGCTGTGCGTGTCAGGGA 
GCAGCAGCCACTCAGCAGCCTGAGCTCCGTGCTGCTCTACCGCTCAGCCCCTGAGGAC 
TTGCGGCTGCTCTTCTACGATGAGGTGTCCCGGCTGGAGAGCTTTTGGGCACTCCGTG 
TGGTGTGTCAGGAGCAGCTGACAGCCCTGCTTGCCTGGATCCGGGAACCATGGGAGGA 




GGGTCCCACGCTGACCTTGGCCCTGACCTCAGGAGCCACGCT 




ORF Start: ATG at 
13 


ORF Stop: TGA at 3304 




SEQ ID NO: 356 


1097 aa |MW at 121064.1kD 


NOV 127b, 
CG59971-02 Protein 
Sequence 


MTTAQRDSLLWKLAGLLRESGDWLSGCSTLSLLTPTLQQLNHVFELHLGPWGPGQTG 
FVALPSHPADSPVILQLQFLFDVLQKTLSLKLVHVAGPGPTGPIKIFPFKSLRHLELR 
GVPLHCLHGLRGIYSQLETLICSRSLQALEELLSACGGDFCSALPWLALLSANFSYNA 
LTALDSSLRLLSALRFLNLSHNQVQDCQGFLMDLCELHHLDISYNRLHLVPRMGPSGA 
ALGVLILRGNELRSLPGLEQLRNLRHLDLAYNLLEGHRELSPLWLLAELRKLYLEGNP 
LWFHPEHRAATAQYLSPRARDAATGFLLDGKVLSLTDFQQTHTSLGLSPMGPPLPWPV 
GSTPETSGGPDLSDSLSSGGWTQPLLHKVKSRVRVRRASISEPSDTDPEPRTLNPSP 
AGWFVQQHPELELMSSFRERFGRNWLQYRSHLEPSGNPLPATPTTSAPSAPPASSQGP 
DTAPRPSPPQEEARGPQESPQKMSEEVRAEPQEEEEEKEGKEEKEEGEMVEQGEEEAG 
EEEEEEQDQKEVEAELCRPLLVCPLEGPEGVRGRECFLRVTSAHLFEVELQAARTLER 
LELQSLEAAEIEPEAQAQRSPRPTGSDLLPGAPILSLRFSYICPDRQLRRYLVLEPDA 
HAAVQELLAVLTPVTNVAREQLGEARDLLLGRFQCLRCGHEFKPEEPRMGLDSEEGWR 
PLFQKTESPAVCPNCGSDHWLLAVSRGTPNRERKQGEQSLAPSPSASPVCHPPGHGD 
HLDRAKKSPPQAPSTRDHGSWSLSPAPERCGLRSVDHRLRLFLDVEVFSDAQEEFQCC 
LKVPVALAGHTGEFMCLVWSDRRLYLLKVTGEMSEPPASWLQLTLAVPLQDLSGIEL 
GLAGQSLRLEWAAGAGRCVLLPRDARHCRAFLEELLGVLQSLPPAWRNCVSATEEEVT 
PQHRLWPLLEKDSSLEARQFFYLRAFLVEGEASVQLMLPSTCLVSLLLTPSTLFLLDE 
DAAGSPAEPSPPAASGEASEKVPPSGPGPAVRVREQQPLSSLSSVLLYRSAPEDLRLL 

- . J <TAL n L T ^ ' i ^ r ' V I F 



Sequence comparison of the above protein sequences yields the following sequence 
relationships shown in Table 127B. 



Table 127B. Comparison of NOV127a against NOV127b. 


Protein Sequence 


NOV127a Residues/ 
Match Residues 


Identities/ 
Similarities for the Matched Region 


NOV 127b 


1 1098 
1..1097 


891/1098 (81%) 
891/1098 (81%) 



Further analysis of the NOV 127a protein yielded the following properties shown in 
Table 127C. 



Table 127C. Protein Sequence Properties \(>\ I2^;i 


PSort 
analysis: 


0.5163 probability located in mitochondrial matrix space; 0.3000 probability 
located in microbody (peroxisome); 0.2442 probability located in 
mitochondrial inner membrane; 0.2442 probability located in mitochondrial 
intermembrane space 


SignalP 
analysis: 


No Known Signal Sequence Predicted 
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A search of the NOV 127a protein against the Geneseq database, a proprietary 
database that contains sequences published in patents and patent publication, yielded 
several homologous proteins shown in Table 127D. 



Table 127D. Geneseq Results for NOV127a 



Geneseq 
Identifier 


Protein/Organism/Length 
[Patent #, Date] 


NOV127a 
Residues/ 

Match 
Residues 


Identities/ 
Similarities for 
the Matched 
Region 


Expect 
Value 


AAM39827 


Human polypeptide SEQ ID NO 
2972 - Homo sapiens, 169 aa. 
[WO2001533I2-A1, 26-JUL-2001] 


375..528 
14..167 


140/154 (90%) 
145/154 (93%) 


3e-78 


AAM41613 


Human polypeptide SEQ ID NO 
6544 - Homo sapiens, 1 84 aa. 
[WO200153312-A1, 26-JUL-2001] 


375..528 
29.. 182 


140/154 (90%) 
145/154 (93%) 


4e-78 


AAU 19764 


Human novel extracellular matrix 
protein, Seq ID No 414 - Homo 
sapiens, 211 aa. [WO200155368- 
A1,02-AUG-2001] 


444..647 
13..209 


157/207 (75%) 
160/207 (76%) 


2e-75 


ABB19833 


Protein #1832 encoded by probe 
for measuring heart cell gene 
expression - Homo sapiens, 127 aa. 
[WO200157274-A2, 09-AUG- 
2001] 


409..535 
1..127 


127/127 (100%) 
127/127 (100%) 


2e-70 


AAM67606 


Human bone marrow expressed 
probe encoded protein SEQ ID 
NO: 27912 - Homo sapiens, 127 
aa. [WO200157276-A2, 09-AUG- 
2001] 


409..535 
1..127 


127/127 (100%) 
127/127 (100%) 


2e-70 



In a BLAST search of public sequence databases, the NOV 127a protein was found 
to have homology to the proteins shown in the BLASTP data in Table 127E. 



Table 127E. Public BLASTP Results for NOV 127;, 


Protein 
Accession 
Number 


Protein/Organism/Length 


NOV127a 
Residues/ 

Match 
Residues 


Identities/ 
Similarities for 
the Matched 
Portion 


Expect 
Value 


AAL49726 


LKB 1 -INTERACTING 
PROTEIN 1 - Homo sapiens 
(Human), 1099 aa. 


1..1098 
12..1099 


1077/1098 (98%) 
1078/1098 (98%) 


0.0 



481 



Q96PY9 


KIAA1898 PROTEIN - Homo 
sapiens (Human), 1013 aa 
(fragment). 


76..1098 
1-1013 


1003/1023 (98%) 
1003/1023 (98%) 


0.0 


Q96CN3 


SIMILAR TO RIKEN CDNA 
1200014D22 GENE - Homo 
sapiens (Human), 804 aa 
(fragment). 


288..1098 
4..804 


793/811 (97%) 
793/811 (97%) 


0.0 


Q9DBT7 


1200014D22RIK PROTEIN - 
Mus musculus (Mouse), 1072 aa. 


1..1098 
1..1072 


816/1098 (74%) 
895/1098 (81%) 


0.0 


Q9VMK9 


CG9044 PROTEIN - Drosophila 
melanogaster (Fruit fly), 1289 aa. 


12..433 
8..463 


139/459 (30%) 
220/459 (47%) 


6e-38 



PFam analysis predicts that the NOV 127a protein contains the domains shown in 
the Table 127F. 



Table 127F. Domain Analysis of NOV127a 



Pfam Domain 


" ""' " """ 

NOV127a Match 
Region 


Identities/ 
Similarities 
for the Matched 
Region 


Expect 
Value 


LRR: domain 1 of 5 


164.. 186 


7/25 (28%) 
15/25 (60%) 


2.5e+02 


LRR: domain 2 of 5 


187..209 


6/25 (24%) 
16/25 (64%) 


2.5e+02 


LRR: domain 3 of 5 


210..231 


8/25 (32%) 
13/25 (52%) 


83 


LRR: domain 4 of 5 


233..254 


9/25 (36%) 
17/25 (68%) 


16 


LRR: domain 5 of 5 


255..279 


10/27 (37%) 
19/27 (70%) 


22 


Pkinase C: domain 1 of 
1 


620..629 


5/1 1 (45%) 
9/1 1 (82%) 


8.9 


rubredoxin: domain 1 of 
2 


669..686 


5/18(28%) 
13/18(72%) 


4.6 


rubredoxin: domain 2 of 
2 


708..713 


5/6 (83%) 
6/6(100%) 


1.2e+03 
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Example B: Sequencing Methodology and Identification of NOVX Clones 

1. GeneCalling™ Technology: This is a proprietary method of performing 
differential gene expression profiling between two or more samples developed at CuraGen 
5 and described by Shimkets, et al., "Gene expression analysis by transcript profiling coupled 
to a gene database query" Nature Biotechnology 17:198-803 (1999). cDNA was derived 
from various human samples representing multiple tissue types, normal and diseased states, 
physiological states, and developmental states from different donors. Samples were 
obtained as whole tissue, primary cells or tissue cultured primary cells or cell lines. Cells 

1 0 and cell lines may have been treated with biological or chemical agents that regulate gene 
expression, for example, growth factors, chemokines or steroids. The cDNA thus derived 
was then digested with up to as many as 120 pairs of restriction enzymes and pairs of 
linker-adaptors specific for each pair of restriction enzymes were ligated to the appropriate 
end. The restriction digestion generates a mixture of unique cDNA gene fragments. 

15 Limited PCR amplification is performed with primers homologous to the linker adapter 
sequence where one primer is biotinylated and the other is fluorescently labeled. The 
doubly labeled material is isolated and the fluorescently labeled single strand is resolved by 
capillary gel electrophoresis. A computer algorithm compares the electropherograms from 
an experimental and control group for each of the restriction digestions. This and additional 

20 sequence-derived information is used to predict the identity of each differentially expressed 
gene fragment using a variety of genetic databases. The identity of the gene fragment is 
confirmed by additional, gene-specific competitive PCR or by isolation and sequencing of 
the gene fragment. 

25 2. SeqCalling™ Technology: cDNA was derived from various human samples 
representing multiple tissue types, normal and diseased states, physiological states, and 
developmental states from different donors. Samples were obtained as whole tissue, 
primary cells or tissue cultured primary cells or cell lines. Cells and cell lines may have 
been treated with biological or chemical agents that regulate gene expression, for example, 

30 growth factors, chemokines or steroids. The cDNA thus derived was then sequenced using 
CuraGen's proprietary SeqCalling technology. Sequence traces were evaluated manually 
and edited for corrections if appropriate. cDNA sequences from all samples were 
assembled together, sometimes including public human sequences, using bioinformatic 
programs to produce a consensus sequence for each assembly. Each assembly is included 
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in CuraGen Corporation's database. Sequences were included as components for assembly 
when the extent of identity with another component was at least 95% over 50 bp. Each 
assembly represents a gene or portion thereof and includes information on variants, such as 
splice forms single nucleotide polymorphisms (SNPs), insertions, deletions and other 
5 sequence variations. 

3. PathCalling™ Technology: 

The NOVX nucleic acid sequences are derived by laboratory screening of cDNA 
library by the two-hybrid approach. cDNA fragments covering either the full length of the 
10 DNA sequence, or part of the sequence, or both, are sequenced. In silico prediction was 
based on sequences available in CuraGen Corporation's proprietary sequence databases or 
in the public human sequence databases, and provided either the full length DNA sequence, 
or some portion thereof. 

The laboratory screening was performed using the methods summarized below: 

1 5 cDNA libraries were derived from various human samples representing multiple 

tissue types, normal and diseased states, physiological states, and developmental states 
from different donors. Samples were obtained as whole tissue, primary cells or tissue 
cultured primary cells or cell lines. Cells and cell lines may have been treated with 
biological or chemical agents that regulate gene expression, for example, growth factors, 

20 chemokines or steroids. The cDNA thus derived was then directionally cloned into the 
appropriate two-hybrid vector (Gal4-activation domain (Gal4-AD) fusion). Such cDNA 
libraries as well as commercially available cDNA libraries from Clontech (Palo Alto, CA) 
were then transferred from E.coli into a CuraGen Corporation proprietary yeast strain 
(disclosed in U. S. Patents 6,057,101 and 6,083,693, incorporated herein by reference in 

25 their entireties). 

Gal4-binding domain (Gal4-BD) fusions of a CuraGen Corportion proprietary 
library of human sequences was used to screen multiple Gal4-AD fusion cDNA libraries 
resulting in the selection of yeast hybrid diploids in each of which the Gal4-AD fusion 
contains an individual cDNA. Each sample was amplified using the polymerase chain 
30 reaction (PCR) using non-specific primers at the cDNA insert boundaries. Such PCR 
product was sequenced; sequence traces were evaluated manually and edited for 
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corrections if appropriate. cDNA sequences from all samples were assembled together, 
sometimes including public human sequences, using bioinformatic programs to produce a 
consensus sequence for each assembly. Each assembly is included in CuraGen 
Corporation's database. Sequences were included as components for assembly when the 
5 extent of identity with another component was at least 95% over 50 bp. Each assembly 
represents a gene or portion thereof and includes information on variants, such as splice 
forms single nucleotide polymorphisms (SNPs), insertions, deletions and other sequence 
variations. 

Physical clone: the cDNA fragment derived by the screening procedure, covering 
the entire open reading frame is, as a recombinant DNA, cloned into pACT2 plasmid 
(Clontech) used to make the cDNA library. The recombinant plasmid is inserted into the 
host and selected by the yeast hybrid diploid generated during the screening procedure by 
the mating of both CuraGen Corporation proprietary yeast strains N106' and YULH (U. S. 
Patents 6,057,101 and 6,083,693). 

4. RACE: Techniques based on the polymerase chain reaction such as rapid 
amplification of cDNA ends (RACE), were used to isolate or complete the predicted 
sequence of the cDNA of the invention. Usually multiple clones were sequenced from one 
or more human samples to derive the sequences for fragments. Various human tissue 
samples from different donors were used for the RACE reaction. The sequences derived 
from these procedures were included in the SeqCalling Assembly process described in 
preceding paragraphs. 

5. Exon Linking: The NOVX target sequences identified in the present invention 
25 were subjected to the exon linking process to confirm the sequence. PCR primers were 

designed by starting at the most upstream sequence available, for the forward primer, and 
at the most downstream sequence available for the reverse primer. Table Bl shows the 
sequences of the PCR primers used for obtaining different clones. In each case, the 
sequence was examined, walking inward from the respective termini toward the coding 
30 sequence, until a suitable sequence that is either unique or highly selective was 

encountered, or, in the case of the reverse primer, until the stop codon was reached. Such 
primers were designed based on in silico predictions for the full length cDNA, part (one or 
more exons) of the DNA or protein sequence of the target sequence, or by translated 
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homology of the predicted exons to closely related human sequences from other species. 
These primers were then employed in PCR amplification based on the following pool of 
human cDNAs: adrenal gland, bone marrow, brain - amygdala, brain - cerebellum, brain - 
hippocampus, brain - substantia nigra, brain - thalamus, brain -whole, fetal brain, fetal 
5 kidney, fetal liver, fetal lung, heart, kidney, lymphoma - Raji, mammary gland, pancreas, 
pituitary gland, placenta, prostate, salivary gland, skeletal muscle, small intestine, spinal 
cord, spleen, stomach, testis, thyroid, trachea, uterus. Usually the resulting amplicons were 
gel purified, cloned and sequenced to high redundancy. The PCR product derived from 
exon linking was cloned into the pCR2.1 vector from Invitrogen. The resulting bacterial 

10 clone has an insert covering the entire open reading frame cloned into the pCR2.1 vector. 
The resulting sequences from all clones were assembled with themselves, with other 
fragments in CuraGen Corporation's database and with public ESTs. Fragments and ESTs 
were included as components for an assembly when the extent of their identity with another 
component of the assembly was at least 95% over 50 bp. In addition, sequence traces were 

15 evaluated manually and edited for corrections if appropriate. These procedures provide the 
sequence reported herein. 

6. Physical Clone: 

Exons were predicted by homology and the intron/exon boundaries were 
20 determined using standard genetic rules. Exons were further selected and refined by means 
of similarity determination using multiple BLAST (for example, tBlastN, BlastX, and 
BlastN) searches, and, in some instances, GeneScan and Grail. Expressed sequences from 
both public and proprietary databases were also added when available to further define and 
complete the gene sequence. The DNA sequence was then manually corrected for apparent 
25 inconsistencies thereby obtaining the sequences encoding the full-length protein. 

The PCR product derived by exon linking, covering the entire open reading frame, 
was cloned into the pCR2.1 vector from Invitrogen to provide clones used for expression 
and screening purposes. 

30 

Example C: Quantitative expression analysis of clones in various cells and tissues 

The quantitative expression of various clones was assessed using microtiter plates 
containing RNA samples from a variety of normal and pathology-derived cells, cell lines 
35 and tissues using real time quantitative PCR (RTQ PCR). RTQ PCR was performed on an 
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Applied Biosystems ABI PRISM® 7700 or an ABI PRISM® 7900 HT Sequence Detection 
System. Various collections of samples are assembled on the plates, and referred to as 
Panel 1 (containing normal tissues and cancer cell lines), Panel 2 (containing samples 
derived from tissues from normal and cancer sources), Panel 3 (containing cancer cell 
5 lines), Panel 4 (containing cells and cell lines from normal tissues and cells related to 
inflammatory conditions), Panel 5D/5I (containing human tissues and cell lines with an 
emphasis on metabolic diseases), AI_comprehensive_panel (containing normal tissue and 
samples from autoimmune diseases), Panel CNSD.01 (containing central nervous system 
samples from normal and diseased brains) and CNS_neurodegeneration_panel (containing 
10 samples from normal and Alzheimer's diseased brains). 

RNA integrity from all samples is controlled for quality by visual assessment of 
agarose gel electropherograms using 28S and 18S ribosomal RNA staining intensity ratio 
as a guide (2:1 to 2.5:1 28s: 18s) and the absence of low molecular weight RNAs that would 
be indicative of degradation products. Samples are controlled against genomic DNA 
1 5 contamination by RTQ PCR reactions run in the absence of reverse transcriptase using 
probe and primer sets designed to amplify across the span of a single exon. 

First, the RNA samples were normalized to reference nucleic acids such as 
constitutively expressed genes (for example, (3-actin and GAPDH). Normalized RNA (5 ul) 
was converted to cDNA and analyzed by RTQ-PCR using One Step RT-PCR Master Mix 
20 Reagents (Applied Biosystems; Catalog No. 4309169) and gene-specific primers according 
to the manufacturer's instructions. 

In other cases, non-normalized RNA samples were converted to single strand 
cDNA (sscDNA) using Superscript II (Invitrogen Corporation; Catalog No. 18064-147) 
and random hexamers according to the manufacturer's instructions. Reactions containing 
25 up to 1 0 ug of total RNA were performed in a volume of 20 ul and incubated for 60 

minutes at 42°C. This reaction can be scaled up to 50 ug of total RNA in a final volume of 
100 ul. sscDNA samples are then normalized to reference nucleic acids as described 
previously, using IX TaqMan® Universal Master mix (Applied Biosystems; catalog No. 
4324020), following the manufacturer's instructions. 

30 Probes and primers were designed for each assay according to Applied Biosystems 

Primer Express Software package (version I for Apple Computer's Macintosh Power PC) 
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or a similar algorithm using the target sequence as input. Default settings were used for 
reaction conditions and the following parameters were set before selecting primers: primer 
concentration = 250 nM, primer melting temperature (Tm) range = 58°-60°C, primer 
optimal Tm = 59°C, maximum primer difference = 2°C, probe does not have 5'G, probe Tm 
5 must be 10°C greater than primer Tm, amplicon size 75bp to lOObp. The probes and 

primers selected (see below) were synthesized by Synthegen (Houston, TX, USA). Probes 
were double purified by HPLC to remove uncoupled dye and evaluated by mass 
spectroscopy to verify coupling of reporter and quencher dyes to the 5' and 3' ends of the 
probe, respectively. Their final concentrations were: forward and reverse primers, 900nM 
10 each, and probe, 200nM. 

PCR conditions: When working with RNA samples, normalized RNA from each 
tissue and each cell line was spotted in each well of either a 96 well or a 384-well PCR 
plate (Applied Biosystems). PCR cocktails included either a single gene specific probe and 
primers set, or two multiplexed probe and primers sets (a set specific for the target clone 

15 and another gene-specific set multiplexed with the target probe). PCR reactions were set up 
using TaqMan® One-Step RT-PCR Master Mix (Applied Biosystems, Catalog No. 
4313803) following manufacturer's instructions. Reverse transcription was performed at 
48°C for 30 minutes followed by amplification/PCR cycles as follows: 95°C 10 min, then 
40 cycles of 95°C for 15 seconds, 60°C for 1 minute. Results were recorded as CT values 

20 (cycle at which a given sample crosses a threshold level of fluorescence) using a log scale, 
with the difference in RNA concentration between a given sample and the sample with the 
lowest CT value being represented as 2 to the power of delta CT. The percent relative 
expression is then obtained by taking the reciprocal of this RNA difference and multiplying 
by 100. 

25 When working with sscDNA samples, normalized sscDNA was used as described 

previously for RNA samples. PCR reactions containing one or two sets of probe and 
primers were set up as described previously, using IX TaqMan® Universal Master mix 
(Applied Biosystems; catalog No. 4324020), following the manufacturer's instructions. 
PCR amplification was performed as follows: 95°C 10 min, then 40 cycles of 95°C for 15 

30 seconds, 60°C for 1 minute. Results were analyzed and processed as described previously. 
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Panels 1, 1.1, 1.2, and 1.3D 

The plates for Panels 1, 1.1, 1.2 and 1. 3D include 2 control wells (genomic DNA 
control and chemistry control) and 94 wells containing cDNA from various samples. The 
samples; in these panels are broken into 2 classes: samples derived from cultured cell lines 
5 and samples derived from primary normal tissues. The cell lines are derived from cancers 
of the following types: lung cancer, breast cancer, melanoma, colon cancer, prostate 
cancer, CNS cancer, squamous cell carcinoma, ovarian cancer, liver cancer, renal cancer, 
gastric cancer and pancreatic cancer. Cell lines used in these panels are widely available 
through the American Type Culture Collection (ATCC), a repository for cultured cell lines, 

10 and were cultured using the conditions recommended by the ATCC. The normal tissues 
found on these panels are comprised of samples derived from all major organ systems from 
single adult individuals or fetuses. These samples are derived from the following organs: 
adult skeletal muscle, fetal skeletal muscle, adult heart, fetal heart, adult kidney, fetal 
kidney, adult liver, fetal liver, adult lung, fetal lung, various regions of the brain, the 

15 spleen, bone marrow, lymph node, pancreas, salivary gland, pituitary gland, adrenal gland, 
spinal cord, thymus, stomach, small intestine, colon, bladder, trachea, breast, ovary, uterus, 
placenta, prostate, testis and adipose. 

In the results for Panels 1, 1.1, 1.2 and 1.3D, the following abbreviations are used: 

ca. = carcinoma, 
20 * = established from metastasis, 

met = metastasis, 

s cell var = small cell variant, 

non-s - non-sm = non-small, 

squam = squamous, 
25 pi. eff = pi effusion = pleural effusion, 

glio = glioma, 

astro = astrocytoma, and 

neuro = neuroblastoma. 

General_screening_panel_vl .4 

30 The plates for Panel 1 .4 include 2 control wells (genomic DNA control and 

chemistry control) and 94 wells containing cDNA from various samples. The samples in 
Panel 1 .4 are broken into 2 classes: samples derived from cultured cell lines and samples 
derived from primary normal tissues. The cell lines are derived from cancers of the 
following types: lung cancer, breast cancer, melanoma, colon cancer, prostate cancer, CNS 
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cancer, squamous cell carcinoma, ovarian cancer, liver cancer, renal cancer, gastric cancer 
and pancreatic cancer. Cell lines used in Panel 1 .4 are widely available through the 
American Type Culture Collection (ATCC), a repository for cultured cell lines, and were 
cultured using the conditions recommended by the ATCC. The normal tissues found on 
5 Panel 1 . 4 are comprised of pools of samples derived from all major organ systems from 2 
to 5 different adult individuals or fetuses. These samples are derived from the following 
organs: adult skeletal muscle, fetal skeletal muscle, adult heart, fetal heart, adult kidney, 
fetal kidney, adult liver, fetal liver, adult lung, fetal lung, various regions of the brain, the 
spleen, bone marrow, lymph node, pancreas, salivary gland, pituitary gland, adrenal gland, 
10 spinal cord, thymus, stomach, small intestine, colon, bladder, trachea, breast, ovary, uterus, 
placenta, prostate, testis and adipose. Abbreviations are as described for Panels 1, 1.1, 1.2, 
and 1.3D. 

Panels 2D and 2.2 

The plates for Panels 2D and 2.2 generally include 2 control wells and 94 test 

1 5 samples composed of RNA or cDNA isolated from human tissue procured by surgeons 
working in close cooperation with the National Cancer Institute's Cooperative Human 
Tissue Network (CHTN) or the National Disease Research Initiative (NDRI). The tissues 
are derived from human malignancies and in cases where indicated many malignant tissues 
have "matched margins" obtained from noncancerous tissue just adjacent to the tumor. 

20 These are termed normal adjacent tissues and are denoted "NAT" in the results below. The 
tumor tissue and the "matched margins" are evaluated by two independent pathologists (the 
surgical pathologists and again by a pathologist at NDRI or CHTN). This analysis provides 
a gross histopathological assessment of tumor differentiation grade. Moreover, most 
samples include the original surgical pathology report that provides information regarding 

25 the clinical stage of the patient. These matched margins are taken from the tissue 

surrounding (i.e. immediately proximal) to the zone of surgery (designated "NAT", for 
normal adjacent tissue, in Table RR). In addition, RNA and cDNA samples were obtained 
from various human tissues derived from autopsies performed on elderly people or sudden 
death victims (accidents, etc.). These tissues were ascertained to be free of disease and 

30 were purchased from various commercial sources such as Clontech (Palo Alto, CA), 
Research Genetics, and Invitrogen. 
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Panel 3D 



The plates of Panel 3D are comprised of 94 cDNA samples and two control 
samples. Specifically, 92 of these samples are derived from cultured human cancer cell 
lines, 2 samples of human primary cerebellar tissue and 2 controls. The human cell lines 
5 are generally obtained from ATCC (American Type Culture Collection), NCI or the 
German tumor cell bank and fall into the following tissue groups: Squamous cell 
carcinoma of the tongue, breast cancer, prostate cancer, melanoma, epidermoid carcinoma, 
sarcomas, bladder carcinomas, pancreatic cancers, kidney cancers, leukemias/lymphomas, 
ovarian/uterine/cervical, gastric, colon, lung and CNS cancer cell lines. In addition, there 
10 are two independent samples of cerebellum. These cells are all cultured under standard 

recommended conditions and RNA extracted using the standard procedures. The cell lines 
in panel 3D and 1.3D are of the most common cell lines used in the scientific literature. 

Panels 4D, 4R, and 4.1D 

Panel 4 includes samples on a 96 well plate (2 control wells, 94 test samples) 
15 composed of RNA (Panel 4R) or cDNA (Panels 4D/4.1D) isolated from various human cell 
lines or tissues related to inflammatory conditions. Total RNA from control normal tissues 
such as colon and lung (Stratagene, La Jolla, CA) and thymus and kidney (Clontech) was 
employed. Total RNA from liver tissue from cirrhosis patients and kidney from lupus 
patients was obtained from BioChain (Biochain Institute, Inc., Hayward, CA). Intestinal 
20 tissue for RNA preparation from patients diagnosed as having Crohn's disease and 

ulcerative colitis was obtained from the National Disease Research Interchange (NDRI) 
(Philadelphia, PA). 

Astrocytes, lung fibroblasts, dermal fibroblasts, coronary artery smooth muscle 
cells, small airway epithelium, bronchial epithelium, microvascular dermal endothelial 

25 cells, microvascular lung endothelial cells, human pulmonary aortic endothelial cells, 

human umbilical vein endothelial cells were all purchased from Clonetics (Walkersville, 
MD) and grown in the media supplied for these cell types by Clonetics. These primary cell 
types were activated with various cytokines or combinations of cytokines for 6 and/or 12- 
14 hours, as indicated. The following cytokines were used; IL-1 beta at approximately 1- 

30 5ng/ml, TNF alpha at approximately 5-1 Ong/ml, IFN gamma at approximately 20-50ng/ml, 
IL-4 at approximately 5-1 Ong/ml, IL-9 at approximately 5-10ng/ml, IL-13 at approximately 
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5-10ng/ml. Endothelial cells were sometimes starved for various times by culture in the 
basal media from Clonetics with 0.1% serum. 

Mononuclear cells were prepared from blood of employees at CuraGen 
Corporation, using Ficoll. LAK cells were prepared from these cells by culture in DMEM 
5 5% FCS (Hyclone), lOOuM non essential amino acids (Gibco/Life Technologies, 

Rockville, MD), ImM sodium pyruvate (Gibco), mercaptoethanol 5.5x1 0~ 5 M (Gibco), and 
lOmM Hepes (Gibco) and Interleukin 2 for 4-6 days. Cells were then either activated with 
10-20ng/ml PMA and l-2u.g/ml ionomycin, IL-12 at 5-10ng/ml, IFN gamma at 20-50ng/ml 
and IL-18 at 5-10ng/ml for 6 hours. In some cases, mononuclear cells were cultured for 4-5 

10 days in DMEM 5% FCS (Hyclone), lOOuM non essential amino acids (Gibco), ImM 

sodium pyruvate (Gibco), mercaptoethanol 5.5xlO~ 5 M (Gibco), and lOmM Hepes (Gibco) 
with PHA (phytohemaggfutinin) or PWM (pokeweed mitogen) at approximately 5ug/ml. 
Samples were taken at 24, 48 and 72 hours for RNA preparation. MLR (mixed lymphocyte 
reaction) samples were obtained by taking blood from two donors, isolating the 

1 5 mononuclear cells using Ficoll and mixing the isolated mononuclear cells 1:1 at a final 
concentration of approximately 2xl0 6 cells/ml in DMEM 5% FCS (Hyclone), lOOjiM non 
essential amino acids (Gibco), ImM sodium pyruvate (Gibco), mercaptoethanol (5.5x10" 
5 M) (Gibco), and lOmM Hepes (Gibco). The MLR was cultured and samples taken at 
various time points ranging from 1- 7 days for RNA preparation. 

20 Monocytes were isolated from mononuclear cells using CD 14 Miltenyi Beads, +ve 

VS selection columns and a Vario Magnet according to the manufacturer's instructions. 
Monocj/tes were differentiated into dendritic cells by culture in DMEM 5% fetal calf serum 
(FCS) (Hyclone, Logan, UT), lOOuM non essential amino acids (Gibco), ImM sodium 
pyruvate (Gibco), mercaptoethanol 5.5xlO~ 5 M (Gibco), and lOmM Hepes (Gibco), 50ng/ml 

25 GMCSF and 5ng/ml IL-4 for 5-7 days. Macrophages were prepared by culture of 

monocytes for 5-7 days in DMEM 5% FCS (Hyclone), lOOuM non essential amino acids 
(Gibco), ImM sodium pyruvate (Gibco), mercaptoethanol 5.5x1 0'^M (Gibco), lOmM 
Hepes (Gibco) and 10% AB Human Serum or MCSF at approximately 50ng/ml. 
Monocytes, macrophages and dendritic cells were stimulated for 6 and 12-14 hours with 

30 lipopolysaccharide (LPS) at 1 OOng/ml. Dendritic cells were also stimulated with anti-CD40 
monoclonal antibody (Pharmingen) at 10ng/ml for 6 and 12-14 hours. 
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CD4 lymphocytes, CD8 lymphocytes and NK cells were also isolated from 
mononuclear cells using CD4, CD8 and CD56 Miltenyi beads, positive VS selection 
columns and a Vario Magnet according to the manufacturer's instructions. CD45RA and 
CD45RO CD4 lymphocytes were isolated by depleting mononuclear cells of CDS, CD56, 
CD14 and CD19 cells using CD8, CD56, CD14 and CD19 Miltenyi beads and positive 
selection. CD45RO beads were then used to isolate the CD45RO CD4 lymphocytes with 
the remaining cells being CD45RA CD4 lymphocytes. CD45RA CD4, CD45RO CD4 and 
CD8 lymphocytes were placed in DMEM 5% FCS (Hyclone), lOOuM non essential amino 
acids (Gibco), ImM sodium pyruvate (Gibco), mercaptoethanol 5.5xlO~ 5 M (Gibco), and 
lOmM Hepes (Gibco) and plated at 10 6 cells/ml onto Falcon 6 well tissue culture plates that 
had been coated overnight with 0.5|j.g/ml anti-CD28 (Pharmingen) and 3ug/ml anti-CD3 
(OKT3, ATCC) in PBS. After 6 and 24 hours, the cells were harvested for RNA 
preparation. To prepare chronically activated CD8 lymphocytes, we activated the isolated 
CD8 lymphocytes for 4 days on anti-CD28 and anti-CD3 coated plates and then harvested 
the cells and expanded them in DMEM 5% FCS (Hyclone), lOOuM non essential amino 
acids (Gibco), ImM sodium pyruvate (Gibco), mercaptoethanol 5.5x1 0" 5 M (Gibco), and 
lOmM Hepes (Gibco) and IL-2. The expanded CDS cells were then activated again with 
plate bound anti-CD3 and anti-CD28 for 4 days and expanded as before. RNA was isolated 
6 and 24 hours after the second activation and after 4 days of the second expansion culture. 
The isolated NK cells were cultured in DMEM 5% FCS (Hyclone), lOOuM non essential 
amino acids (Gibco), ImM sodium pyruvate (Gibco), mercaptoethanol 5.5xlO~ 5 M (Gibco), 
and lOmM Hepes (Gibco) and IL-2 for 4-6 days before RNA was prepared. 

To obtain B cells, tonsils were procured from NDRI. The tonsil was cut up with 
sterile dissecting scissors and then passed through a sieve. Tonsil cells were then spun 
down and resupended at 10 6 cells/ml in DMEM 5% FCS (Hyclone), lOOuM non essential 
amino acids (Gibco), ImM sodium pyruvate (Gibco), mercaptoethanol 5.5x1 0" 5 M (Gibco), 
and lOmM Hepes (Gibco). To activate the cells, we used PWM at 5ug/ml or anti-CD40 
(Pharmingen) at approximately lOug/ml and IL-4 at 5-10ng/ml. Cells were harvested for 
RNA preparation at 24,48 and 72 hours. 

To prepare the primary and secondary Thl/Th2 and Trl cells, six-well Falcon 
plates were coated overnight with 10u.g/ml anti-CD28 (Pharmingen) and 2ug/ml OKT3 
(ATCC), and then washed twice with PBS. Umbilical cord blood CD4 lymphocytes 
(Poietic Systems, German Town, MD) were cultured at 10 5 -10 6 cells/ml in DMEM 5% FCS 
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(Hyclone), IOOjxM non essential amino acids (Gibco), ImM sodium pyruvate (Gibco), 
mercaptoethanol 5.5xl0~ 5 M (Gibco), lOmM Hepes (Gibco) and IL-2 (4ng/ml). IL-12 
(5ng/ml) and anti-IL4 (1 ug/ml) were used to direct to Thl, while IL-4 (5ng/ml) and anti- 
IFN gamma (1 |ig/ml) were used to direct to Th2 and IL-1 0 at 5ng/ml was used to direct to 
5 Trl . After 4-5 days, the activated Thl, Th2 and Trl lymphocytes were washed once in 
DMEM and expanded for 4-7 days in DMEM 5% FCS (Hyclone), 100|jM non essential 
amino acids (Gibco), ImM sodium pyruvate (Gibco), mercaptoethanol 5.5xl0~ 5 M (Gibco), 
lOmM Hepes (Gibco) and IL-2 (lng/ml). Following this, the activated Thl, Th2 and Trl 
lymphocytes were re-stimulated for 5 days with anti-CD28/OKT3 and cytokines as 

1 0 described above, but with the addition of anti-CD95L (1 ug/ml) to prevent apoptosis. After 
4-5 days, the Thl, Th2 and Trl lymphocytes were washed and then expanded again with 
IL-2 for 4-7 days. Activated Thl and Th2 lymphocytes were maintained in this way for a 
maximum of three cycles. RNA was prepared from primary and secondary Thl, Th2 and 
Trl after 6 and 24 hours following the second and third activations with plate bound anti- 

15 CD3 and anti-CD28 mAbs and 4 days into the second and third expansion cultures in 
Inter leukin 2. 

The following leukocyte cells lines were obtained from the ATCC: Ramos, EOL-1, 
KU-812. EOL cells were further differentiated by culture in O.lmM dbcAMP at 
5xl0 5 cells/ml for 8 days, changing the media every 3 days and adjusting the cell 

20 concent ration to 5xl0 5 cells/ml. For the culture of these cells, we used DMEM or RPMI (as 
recommended by the ATCC), with the addition of 5% FCS (Hyclone), lOO^iM non 
essential amino acids (Gibco), ImM sodium pyruvate (Gibco), mercaptoethanol 5.5xlO" 5 M 
(Gibco), lOmM Hepes (Gibco). RNA was either prepared from resting cells or cells 
activated with PMA at lOng/ml and ionomycin at lug/ml for 6 and 14 hours. Keratinocyte 

25 line CCD 106 and an airway epithelial tumor line NCI-H292 were also obtained from the 
ATCC. Both were cultured in DMEM 5% FCS (Hyclone), lOOuM non essential amino 
acids (Gibco), ImM sodium pyruvate (Gibco), mercaptoethanol 5.5xlO~ 5 M (Gibco), and 
lOmM Hepes (Gibco). CCD1 106 cells were activated for 6 and 14 hours with 
approximately 5 ng/ml TNF alpha and lng/ml IL-1 beta, while NCI-H292 cells were 

30 activated for 6 and 14 hours with the following cytokines: 5ng/ml IL-4, 5ng/ml IL-9, 
5ng/ml IL-1 3 and 25ng/ml IFN gamma. 

For these cell lines and blood cells, RNA was prepared by lysing approximately 
10 7 cells/ml using Trizol (Gibco BRL). Briefly, 1/10 volume of bromochloropropane 
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(Molecular Research Corporation) was added to the RNA sample, vortexed and after 10 
minutes at room temperature, the tubes were spun at 14,000 rpm in a Sorvall SS34 rotor. 
The aqueous phase was removed and placed in a 15ml Falcon Tube. An equal volume of 
isopropanol was added and left at -20°C overnight. The precipitated RNA was spun down 
5 at 9,000 rpm for 15 min in a Sorvall SS34 rotor and washed in 70% ethanol. The pellet was 
redissolved in 300ul of RNAse-free water and 35u.l buffer (Promega) 5ul DTT, 7ul 
RNAsin and 8|il DNAse were added. The tube was incubated at 37°C for 30 minutes to 
remove contaminating genomic DNA, extracted once with phenol chloroform and re- 
precipitated with 1/10 volume of 3M sodium acetate and 2 volumes of 100% ethanol. The 
10 RNA was spun down and placed in RNAse free water. RNA was stored at -80°C. 

AI_comiprehensive panel vl.O 

The plates for Al comprehensive panel_vl.O include two control wells and 89 test 
samples comprised of cDNA isolated from surgical and postmortem human tissues 
obtained from the Backus Hospital and Clinomics (Frederick, MD). Total RNA was 
1 5 extracted from tissue samples from the Backus Hospital in the Facility at CuraGen. Total 
RNA from other tissues was obtained from Clinomics. 

Joint tissues including synovial fluid, synovium, bone and cartilage were obtained 
from patients undergoing total knee or hip replacement surgery at the Backus Hospital. 
Tissue samples were immediately snap frozen in liquid nitrogen to ensure that isolated 
20 RNA was of optimal quality and not degraded. Additional samples of osteoarthritis and 
rheumatoid arthritis joint tissues were obtained from Clinomics. Normal control tissues 
were supplied by Clinomics and were obtained during autopsy of trauma victims. 

Surgical specimens of psoriatic tissues and adjacent matched tissues were provided 
as total RNA by Clinomics. Two male and two female patients were selected between the 
25 ages of 25 and 47. None of the patients were taking prescription drugs at the time samples 
were isolated. 

Surgical specimens of diseased colon from patients with ulcerative colitis and 
Crohns disease and adjacent matched tissues were obtained from Clinomics. Bowel tissue 
from three female and three male Crohn's patients between the ages of 41-69 were used. 
30 Two patients were not on prescription medication while the others were taking 

dexamethasone, phenobarbital, or tylenol. Ulcerative colitis tissue was from three male and 
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four female patients. Four of the patients were taking lebvid and two were on 
phenobarbital. 

Total RNA from post mortem lung tissue from trauma victims with no disease or 
with emphysema, asthma or COPD was purchased from Clinomics. Emphysema patients 
5 ranged in age from 40-70 and all were smokers, this age range was chosen to focus on 
patients with cigarette-linked emphysema and to avoid those patients with alpha- 1 anti- 
trypsin deficiencies. Asthma patients ranged in age from 36-75, and excluded smokers to 
prevent those patients that could also have COPD. COPD patients ranged in age from 35- 
80 and included both smokers and non-smokers. Most patients were taking corticosteroids, 
10 and bronchodilators. 

In the labels employed to identify tissues in the AI_comprehensive panelvl .0 
panel, the following abbreviations are used: 

AI = Autoimmunity 

Syn = Synovial 
1 5 Normal = No apparent disease 

Rep22 /Rep20 = individual patients 

RA = Rheumatoid arthritis 

Backus = From Backus Hospital 

OA = Osteoarthritis 
20 (SS) (BA) (MF) = Individual patients 

Adj = Adjacent tissue 

Match control = adjacent tissues 

-M = Male 

-F = Female 

25 (COPD = Chronic obstructive pulmonary disease 

Panels :5D and 51 

The plates for Panel 5D and 51 include two control wells and a variety of cDNAs 
isolated from human tissues and cell lines with an emphasis on metabolic diseases. 
Metabolic tissues were obtained from patients enrolled in the Gestational Diabetes study. 
30 Cells were obtained during different stages in the differentiation of adipocytes from human 
mesenchymal stem cells. Human pancreatic islets were also obtained. 

In the Gestational Diabetes study subjects are young (18-40 years), otherwise 
healthy women with and without gestational diabetes undergoing routine (elective) 
Caesarean section. After delivery of the infant, when the surgical incisions were being 
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repaired/closed, the obstetrician removed a small sample (<1 cc) of the exposed metabolic 
tissues during the closure of each surgical level. The biopsy material was rinsed in sterile 
saline, blotted and fast frozen within 5 minutes from the time of removal. The tissue was 
then flash frozen in liquid nitrogen and stored, individually, in sterile screw-top tubes and 
kept on dry ice for shipment to or to be picked up by CuraGen. The metabolic tissues of 
interest include uterine wall (smooth muscle), visceral adipose, skeletal muscle (rectus) and 
subcutaneous adipose. Patient descriptions are as follows: 

Patient 2 Diabetic Hispanic, overweight, not on insulin 

Patient 7-9 Nondiabetic Caucasian and obese (BMI>30) 

Patient 10 Diabetic Hispanic, overweight, on insulin 

Patient 1 1 Nondiabetic African American and overweight 

Patient 12 Diabetic Hispanic on insulin 

Adipocyte differentiation was induced in donor progenitor cells obtained from 
Osirus (a division of Clonetics/BioWhittaker) in triplicate, except for Donor 3U which had 
only two replicates. Scientists at Clonetics isolated, grew and differentiated human 
mesenchymal stem cells (HuMSCs) for CuraGen based on the published protocol found in 
Mark F. Pittenger, et al., Multi lineage Potential of Adult Human Mesenchymal Stem Cells 
Science Apr 2 1999: 143-147. Clonetics provided Trizol lysates or frozen pellets suitable 
for mRNA isolation and ds cDNA production. A general description of each donor is as 
follows:: 

Donor 2 and 3 U: Mesenchymal Stem cells, Undifferentiated Adipose 
Donor 2 and 3 AM: Adipose, AdiposeMidway Differentiated 
Donor 2 and 3 AD: Adipose, Adipose Differentiated 

Human cell lines were generally obtained from ATCC (American Type Culture 
Collection), NCI or the German tumor cell bank and fall into the following tissue groups: 
kidney proximal convoluted tubule, uterine smooth muscle cells, small intestine, liver 
HepG2 cancer cells, heart primary stromal cells, and adrenal cortical adenoma cells. These 
cells are all cultured under standard recommended conditions and RNA extracted using the 
standard procedures. All samples were processed at CuraGen to produce single stranded 
cDNA. 

Panel 51 contains all samples previously described with the addition of pancreatic 
islets from a 58 year old female patient obtained from the Diabetes Research Institute at the 
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University of Miami School of Medicine. Islet tissue was processed to total RNA at an 
outside source and delivered to CuraGen for addition to panel 51. 

In the labels employed to identify tissues in the 5D and 51 panels, the following 
abbreviations are used: 

5 GO Adipose = Greater Omentum Adipose 

SK = Skeletal Muscle 
UT = Uterus 
PL = Placenta 

AD = Adipose Differentiated 
1 0 AM = Adipose Midway Differentiated 

U = Undifferentiated Stem Cells 

Panel CNSD.01 

The plates for Panel CNSD.01 include two control wells and 94 test samples 
comprised of cDNA isolated from postmortem human brain tissue obtained from the 
1 5 Harvard Brain Tissue Resource Center. Brains are removed from calvaria of donors 

between 4 and 24 hours after death, sectioned by neuroanatomists, and frozen at -80°C in 
liquid nitrogen vapor. All brains are sectioned and examined by neuropathologists to 
confirm diagnoses with clear associated neuropathology. 

Disease diagnoses are taken from patient records. The panel contains two brains 
20 from each of the following diagnoses: Alzheimer's disease, Parkinson's disease, 

Huntington's disease, Progressive Supernuclear Palsy, Depression, and "Normal controls". 
Within each of these brains, the following regions are represented: cingulate gyrus, 
temporal pole, globus palladus, substantia nigra, Brodman Area 4 (primary motor strip), 
Brodman Area 7 (parietal cortex), Brodman Area 9 (prefrontal cortex), and Brodman area 
25 17 (occipital cortex). Not all brain regions are represented in all cases; e.g., Huntington's 
disease is characterized in part by neurodegeneration in the globus palladus, thus this 
region is impossible to obtain from confirmed Huntington's cases. Likewise Parkinson's 
disease is characterized by degeneration of the substantia nigra making this region more 
difficult to obtain. Normal control brains were examined for neuropathology and found to 
30 be free of any pathology consistent with neurodegeneration. 

In the labels employed to identify tissues in the CNS panel, the following 
abbreviations are used: 
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PSP = Progressive supranuclear palsy 
Sub Nigra = Substantia nigra 
Glob Palladus= Globus palladus 
Temp Pole = Temporal pole 
5 Cing Gyr = Cingulate gyrus 

BA 4 = Brodman Area 4 

Panel CNS Neurodegeneration Vl .0 

The plates for Panel CNS Neurodegeneration Vl.O include two control wells and 
47 test samples comprised of cDNA isolated from postmortem human brain tissue obtained 
10 from the Harvard Brain Tissue Resource Center (McLean Hospital) and the Human Brain 
and Spinal Fluid Resource Center (VA Greater Los Angeles Healthcare System). Brains 
are removed from calvaria of donors between 4 and 24 hours after death, sectioned by 
neuroanatomists, and frozen at -80°C in liquid nitrogen vapor. All brains are sectioned and 
examined by neuropathologists to confirm diagnoses with clear associated neuropathology. 

15 Disease diagnoses are taken from patient records. The panel contains six brains 

from Alzheimer's disease (AD) patients, and eight brains from "Normal controls" who 
showed no evidence of dementia prior to death. The eight normal control brains are divided 
into two categories: Controls with no dementia and no Alzheimer's like pathology 
(Controls) and controls with no dementia but evidence of severe Alzheimer's like 

20 pathology, (specifically senile plaque load rated as level 3 on a scale of 0-3; 0 = no 

evidence of plaques, 3 = severe AD senile plaque load). Within each of these brains, the 
following regions are represented: hippocampus, temporal cortex (Brodman Area 21), 
parietal cortex (Brodman area 7), and occipital cortex (Brodman area 17). These regions 
were chosen to encompass all levels of neurodegeneration in AD. The hippocampus is a 

25 region of early and severe neuronal loss in AD; the temporal cortex is known to show 
neurodegeneration in AD after the hippocampus; the parietal cortex shows moderate 
neuronal death in the late stages of the disease; the occipital cortex is spared in AD and 
therefore acts as a "control" region within AD patients. Not all brain regions are 
represented in all cases. 

30 In the labels employed to identify tissues in the CNS Neurodegeneration V 1 .0 

panel, the following abbreviations are used: 

AD = Alzheimer's disease brain; patient was demented and showed AD-like 
pathology upon autopsy 
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Control = Control brains; patient not demented, showing no neuropathology 
Control (Path) = Control brains; pateint not demented but showing sever AD-like 
pathology 

SupTemporal Ctx = Superior Temporal Cortex 
5 Inf Temporal Ctx = Inferior Temporal Cortex 

A. CG58522-01: HUMAN PLATELET- ACTIVATING FACTOR 
ACETYLHYDROLASE IB BETA 

Expression of gene CG58522-01 was assessed using the primer-probe set Ag3365, 
described in Table AA. Results of the RTQ-PCR runs are shown in Table AB. 

1 0 Table AA . Probe Name Ag3 365 



f 1 

Primers Sequences 


Length 


Start 
Position 


SEQID 
NO: 


(Forward ?5 ■ -cagaatgaaccaaggagactca-3 ' 




3 


357 


|p , TET-5 ' -ctactccgcatgcggcagaagacatt-3 ' - 
1 rODe rAMRA 


.2* 


35 


358 


jReverse |5 ' -cacatccatctgtcatctcctt-3 ' 






359 | 



Table AB . General_screening_panel_v 1 .4 



1 

Tissue Name 


Rel. Exp.(%) 
Ag3365, Run 
216709759 


Tissue Name 


Rel. Exp.(%) 
Ag3365, Run 
216709759 


Adipose 


0.0 


Renal ca. TK-10 


0.0 


Melanoma* 
Hs688(A).T 


0.0 


Bladder 


0.0 


Melanoma* 
lls>G88(:J).T 


0.0 


Gastric ca. (liver met.) 
NCI-N87 


0.0 


Melanoma' 1 M 1 4 


0.0 


( iastric ca. KATO III 


0.0 


Melanoma* 
LOXIMVI 


0.0 


Colon ca. SW-948 


0.0 


Melanoma* SK- 
MEL-5 


0.0 


Colon ca. SW480 


0.0 


Squamous cell 
carcinoma SCC-4 


0.0 


Colon ca.* (SW480 
met) SW620 


0.0 


Testis Pool 


10.7 


Colon ca. HT29 


0.0 


Prostate ca.* (bone 
met) PC-3 


0.0 


Colon ca. HCT-116 


0.0 


Prostate Pool 


0.0 


Colon ca. CaCo-2 


0.0 


Placenta 


0.0 


Colon cancer tissue 


0.0 


Uterus Pool 


0.0 


Colon ca. SW1116 


0.0 


Ovarian ca. 


0.0 


Colon ca. Colo-205 


0.0 



500 





OVCAR-3 










Ovarian ca. SK- 
OV-3 


4.9 


Colon ca. SW-48 


00 




Ovarian ca. 
OVCAR-4 


0.0 

: 


Colon Pool 


0.0 




Ovarian ca. 
OVCAR-5 


0.0 



" - 


— — — - 
Small Intestine Pool 


0.0 




Ovarian ca. 
IGROV-1 


7.9 


— — - 

Stomach Pool 


0.0 




Ovarian ca. 
OVCAR-8 


26.8 


Bone Marrow Pool 


" : — 

0.0 




Ovary 


o.o " 


Fetal Heart 


o.o 


a 


Breast ca. MC F-7 


"6.6 


Heart Pool 


0.0 




Breast ca. MDA- 
MB-231 


1.7 


Lymph Node Pool 


16.5 




Breast ui BI 5-W 


0.0 


Fetal Skeletal Muscle 




0.1) 


o 


Breastca. T47D 


0.0 


Sk. ki sl Mil ,i Jo Pool 


0.0 




JjiCabL ta. lVli_//\-l\ 




Spleen Pool 


0.0 




Breast Pool 


0.0 

— 


Thymus Pool 


0.0 


w 
o 




0.0 


CNS cancer 
(glio/astro) U87-MG 


0.0 


n 


Lung 


0.0 


CNS cancer 
(glio/astro) 1 -1 18-MG 


0.0 




Fetal Lung 


0.0 


CNS cancer 
neui met) SK-N-AS 


0.0 




Lung ca. NCI-N417 


0.0 


CNS cancer (astro) Si - 
539 


0.0 




Lung ca. LX-1 

J 


3.3 

- — — ■ — : 


CNS cancer (astro) 
SNB-75 


0.0 




Lung ca. NCI-H146 


4.5 


CNS cancer (glio) 
SNB-19 


6.2 




Lung ca. SHP-77 


0.0 

._ ^ 1 


CNS cancer (glio) SF- 
295 


25.7 




Lung ca. A549 


0.0 Brain (Amygdala) Pool 


0.0 


ILung ca. NCI-H526 


0.0 Brain (cerebellum) 


0.0 




Lung ca. NCI-H23 


100.0 


Brain (fetal) 


0.0 




Lung ca. NCI-H460 


0.0 


Brain (Hippocampus) 1 
Pool 


4.8 




Lung ca. HOP-62 


0.0 


Cerebral Cortex Pool 






Lung ca. NCI-H522 


q q (Brain (Substantia 
jnigra) Pool 


- — ^—H 

1.8 




Liver 


0.0 iBrain (Thalamus) Pool 


3.6 




Fetal Liver 


0.0 Brain (whole) 


6.9 




Liver ca. HepG2 . 


0.0 


Spinal Cord Pool 


0.0 
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Kidney Pool 


0.0 


\drenal Claud 


0.0 


Fetal Kidney 


0.0 


Pituitary gland Pool 


0.0 


Renal ca. 786-0 


0.0 


Salivary Gland 


0.0 


Renal ca. A498 


0.0 


I hyroid ( female) 


0.0 


Renal ca. ACHN 


0.0 


Pancreatic ca. 
CAPAN2 


0.0 


Renal ca. UO-31 


0.0 


Pancreas Pool 


0.0 



CNS_neurodegeneration_vl.O Summary: Ag3365 - Expression of this gene is 
low/undetectable (CTs > 35) across all of the samples on this panel (data not shown). 



General_screening_panel_vl.4 Summary: Ag3365 - Significant expression of this gene 
is seen only in the lung cancer cell line NCI-H23 (CT=33.1). Therefore, expression of this 
5 gene may be used to distinguish this sample from the other samples on this panel. 

Panel 4D Summary: Ag3365 - Expression of this gene is low/undetectable (CTs > 35) 
across all of the samples on this panel (data not shown). 

B. CG58520-01: GAMMA- AMINOBUTYRIC-ACID RECEPTOR GAMMA-1 

Expression of gene CG58520-01 was assessed using the primer-probe set Ag3364, 
10 described in Table BA. 



Table B A . Probe Name Ag3364 



Primers Sequences 


(Length 


Start 
Position 


SEQID J 

NO: [ 


[Forward ]5 ' -ttcttctgcggagtcaaagtag-3 ' 


1 22 


43 


360 J 


p^L. ET-5'-ttg 
l rrOPe iTAMRA 


: 26 


75 


361 1 


Reverse -• • ». « t •.• t y cc 1 1 a t caacgt t tc - 3 ' 


| 22 


106 


362 1 



CNS_neurodegeneration_vl.O Summary: Ag3364 - Expression of this gene is 
low/undetectable (CTs > 35) across all of the samples on this panel (data not shown). 



General_screening_panel_vl.4 Summary: Ag3364 - Expression of this gene is 
15 low/undetectable (CTs > 35) across all of the samples on this panel (data not shown). 

Panel 4D Summary: Ag3364 - Expression of this gene is low/undetectable (CTs > 35) 
across all of the samples on this panel (data not shown). 
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Panel CNS_1 Summary: Ag3364 - Expression of this gene is low/undetectable (CTs > 
35) across all of the samples on this panel (data not shown). 

C. CGS8520-03: GAMMA-AMINOBUTYRIC-ACID RECEPTOR GAMMA- 1 
SUBUNIT PRECURSOR (GABA(A) RECEPTOR) 

Expression of gene CG58520-03 was assessed using the primer-probe set Ag5092, 
described in Table CA. 



Table C A. Probe Name Ag5092 



Primers 


Sequences 


Length 


Start 
Position 


SEQID 
M>: 


Forward 


5 1 -gaacattcctgtccactgga-3 ' 


20 


625 


363 


Prnhp TET-5' -attttcaagcgatggataccctaaaa- 3 • - 

rroDe {tamra 


26 


645 


364 


Reverse js ■ -cacttctacggagggctttt-3 ' 


:o 


692 


365 



CNS_neurodegeneration_vl.O Summary: Ag5092 - Expression of this gene is 
low/undetectable (CTs > 35) across all of the samples on this panel (data not shown). 



General screening panel vl.5 Summary: Ag5092 - Expression of this gene is 
low/undetectable (CTs > 35) across all of the samples on this panel (data not shown). 

Panel 4.1D Summary: Ag5092 - Expression of this gene is low/undetectable (CTs > 35) 
across all of the samples on this panel (data not shown). 

D. CG585 18-0 1 : GAMMA-AMINOBUTYRIC-ACID RECEPTOR RHO-3 - 

Expression of gene CG585 18-01 was assessed using the primer-probe sets Ag3363, 
Agl 130, Agl 198, Agl253 and Agl603, described in Tables DA, DB, DC, DD and DE. 
Results of the RTQ-PCR runs are shown in Tables DF, DG and DH. 



Table DA . Probe Name Ag3363 



Primers 


Sequences 


Length) 


Start 
Position 


SEQID 

NO: 


Forward 


c ' tgj -1 - r^ngt -t X'tc 3 ' 


----- 

22 j 


14 


366 


Probe 


TET-5 ' -cacctacatctggatcatattgaaacca-3 1 - 
TAMRA 


_>s 


36 


367 


Reverse 


5 1 -ttgatgttagaagcagcacaaa-3 1 


22 | 


68 


368 
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Table E >B. Probe Name Agl 130 



PrimersJ Sequences 


LenethJ Start 
eDg j Position 


SEQID 
NO: 


Forward: 5 • -gtcctggctttccagttagtct-3 ' 


22 | 


10 


369 


Vrrsh* TET-5'-tc t atctggatcatattgaaacca-3 • - 
rrooe Jtamra 


29 | 


35 


370 


Reverse Is ' -ttgatgttagaagcagcacaaa-3 ' 


22 I 


68 


371 


Table DC. Probe Name As?1 1 98 


Primers] Sequences 


Lengthl 


Start 
Position 


SEQID 
NO: 


Forward r ' 'it tggcttt ag gtc - ' 

PiyxKp. jTET-5' -tcacctacatctggatcatattgaaacca-3 » - | 

_____ Jtamra 


22 j 
29 | 


10 

35 


_372 
373 


Reverse js ' - ttgatgttagaagcagcacaaa - 3 ' 


22 


68 


374 


Table DD. Probe Name Asrl253 


Primers 

Forward 
Probe 


Sequences 


Length 


| Start 
Position 


NO: 1 


5 1 -atctgggtgcctgatatctttt-3 ' 


22 


[66 


375 


TET-5 ' -tgtccactctaaaagatccttcatccatga-3 ' - 

TAMRA 


30 


489 


376 


Reverse 


5 ' -cgcagcatgatattctccatag-3 1 


22 


524 


377 



Table DE . Probe Name Agl 603 



j j — 

Primers Sequences 


Length 


Start 
Position 


SEQID 
NO: 


Forwan ' jt -t gcttt agttag t- ' 


22 


10 


378 


Pmhe- jTET-5 ' -tcacctacatctggatcatattgaaacca-3 1 - 
rrODe j TAMRA 


29 


35 


379 


Reverse 5 • -ttgacgtta?aagcag-a :-aa a- 3 1 


22 


68 


380 



Table D F. General_screening_panel vl.4 



Tissue Name 


Rel. Exp.(%) 
Ag3363, Run 
216709559 


Tissue Name 


Rel. Exp.(%) 
Ag3363, Run 
216709559 


Adipose 


0.0 


Renal ca. TK-10 


0.0 


Melanoma* 
Hs688(A).T 


0.0 


Bladder 


6.6 


Melanoma* 


0.0 


Gastric ca. (liver met.) 


0.0 
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Hs688(B).T 




NCI-N87 




[Melanoma* Ml 4 


0.0 


Gastric ca. k ATO III 


0.0 


IMelanoma* 
LOXIMV1 


0.0 


Colon ca. SW-948 


0.0 


Melanoma* SK- 
MEL-5 


| 0.0 


Colon ca. SW480 


0.0 


j Squamous cell 
carcinoma SCC-4 


0.0 


Colon ca.* (SW480 
met) SW620 


0.0 


Testis fool 


16.7 


Colon ca. HT29 


0.0 


Prostate ca.* (bone 
met)PC-3 


0.0 


Colon ca.HCT-116 


0.0 


Prostate Pool 


0.0 


Colon ca. CaCo-2 


0.0 


Placenta 


0.0 


Colon cancer tissue 


0 0 


Uterus Pool 


0.0 


Colon ca. SW1116 


0.0 


Ovarian ca. 
( )\ CAR-3 


0.0 


Colon ca. Colo-205 


0.0 


lOvariari ca. SK- 
OY-3 


0.0 


Colon ca. SW-48 


0.0 


|Ov3xi_in C3.. 
]OVCAR-4 


0.0 


Colon Pool 


0.0 


(Ovarian ca. 
bvCAR-5 


0.0 


j Small Intestine Pool 


______ 

0.0 


(Ovarian ca. 
llGROV-1 


0.0 




Stomach Pool 


0.0 


Ovarian ca. 
OVCAR-8 


0.0 




Bone Marrow Pool 


' 

0.0 


Ovary 


0.0 




Fetal Heart 


0.0 


Breast ca. MCF-7 


0.0 




Heart Pool 


0.0 


(Breast ca. MDA- 
iMB-231 


0.0 


^ Lymph Node Pool 


6.8 


Breast ca Bl ^4 t! 


0.0 




Fetal Skeletal Muscle 


0.0 


Breast m 1471) 




Mveieta! Muscle Foci 


0.0 


Breast ca. MDA-N 


' "b.o 


— 


Spleen Pool 


8.5 


Breast Pool 


0 0 




Thymus Pool 


0.0 


Trachea 


0.0 


jCNS cancer 
|(glio/astro) U87-MG 


Lung 


0.0 


ICNS cancer 
(gIio/astro)L-118-MG ; 


10.9 


Fetal Lung 


0.0 


I 


CNS cancer 
(neuro;met) SK-N-AS 


0.0 


Lung ca. NCI-N417 


0.0 


jCNS cancer (astro) SF-1 
[539 


0.0 


Lung ca. LX-1 


0.0 


CNS cancer (astro) j 
SNB-75 | 


0.0 
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Lung ca. NCI-H146 


! ' 

77.9 


CNS cancer (glio) 
SNB-19 


0.0 


i^ung ca. ahir- / / 


100.0 


CNS cancer (glio) SF- 
295 


11.4 


Lung ca. A549 


10.1 


Brain (Amygdala) Pool 


0.0 


Lunge-. \CI-1 152ft 


0.0 


Brain (cerebellum) 


_ o.d 


Lung ca. NCI-H23 


34.4 


Brain (fetal ) 


0.0 


Lung ca. JNUl-rl4ftU 


30.6 


Brain (Hippocampus) 
Pool 


0.0 


Lung ca. HOP-62 


0.0 [Cerebral Cortex Pool 


0.0 


Lung ca. NCI-H522 


q q JBrain (Substantia 
pigra) Pool 


0 0 


Liver 


0.0 Brain (Thalamus) Pool 


5.1 


Fetal Liver 

1. iverca. HepG2 


|'() ^Brain (whole) 

6.0 "Spinal Cord Pool ~" 


50.0 
0.0 


Kidney Pool 


3.0 


Adrenal Gland 


0 0 


\ 2lal Kidney 


8.4 


Pituitary gland Pool 


0.0 


Renal c !. 786-0 


0.0 


Salivary Gland 


0.0 


Renal c.i A 4 1 >S 


0.0 ' 


Thyroid (female) 


0.0 


Renal ca. ACHN 


0.0 


Pancreatic ca. 
CAPAN2 


0.0 


Renal ca. LO-3J 


0.0 


Pancreas Pool 


0.0 



Table D G. Panel 1.2 



I Tissue 
Name 


Rel. 
Exp.(%) 
Agll30, 
Run 
125117140 


Rel. 
Exp.(%) 
Agll30, 

Run 
126566764 


Rel. 
Exp.(%) 
Agll98, 
Run 
129140506 


Tissue 
Name 


Rel. 
Exp.(%) 
Agll30, 
Run 
125117140 


Rel. 
Exp.(%) 
Agll30, 

Run 
126566764 


Rel. 
Exp.(%) 
Agll98, 
Run 
129140506 


Endothelial 
.ells 


0.0 


0.0 


0.0 


Renal ca. 
786-0 


0.0 


0.0 


0.0 


[Heart 
(Fetal) 


0.0 


0.0 


0.0 


Renal ca. 
A498 


7.3 


4.7 


0.0 


Pancreas 


0.0 


0.0 


0.0 


Renal ca. 
RXF 393 


0.0 


0.0 


0.0 


Pancreatic 
ca. CAFAN 

2 


9.0 


0.0 


0.0 


Renal ca. 
ACHN 


0.0 


0.0 


0.0 I 


Adrenal 
Gland 

Thyroid 


0.0 


2.6 


0.0 


Renal ca. 
UO-31 


3.9 


0.0 


0.0 ! 


0.0 


0.0 


0.0 


Renal ca. 
TK-10 


0.0 


0.0 


0.0 


Salivary 
gland 


0.0 


0.0 


0.0 


Liver 

j 


26.6 


0.0 


0.0 
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j Pituitary 


0.0 


0.0 


o.o 


25.3 

„ . _ 


0.0 


0.0 


Brain 
(fetal) 

1 


0.0 


0.0 


«Liver ca. 
0.0 j(hepatobla 
jSt) riepuz 


0.0 

. 


0.0 


0.0 


|Brain 
j(whole) 


2.6 


20.0 


0.0 ILung 


0.0 


0.0 


0.0 


jBrain 
j(amygdala) 


1.3 




32.1 


0.0 fc un 8 
j(fetal) 


0.0 


0.0 


0.0 


iBrain 

j(cerebellum 
) 


1, 


3.8 


jLung ca. 
0.0 ((small 

(cell) LX-1 


3.4 


0.0 0.0 


Brain 

(hippocamp 
us) 


0.0 


27.0 


jLung ca. 

oo l (sma11 

cell) NCI- 
|H69 


28.5 


74.2 J 0.0 

I 


Brain 
(thalamus) 


9.9 


22.5 


ILung ca. 
9 8 (s.cell 
fvar.) 
jSHP-77 


3.8 




0.0 


Cerebral 
Cortex 


0.0 


0.0 


jLung ca. 

o o l (,arge 

lcell)NCI- 
IH460 


8.8 


4.1 


5.3 


1 Spinal cord 


4.4 


0.0 


l.ung ca. 
0.0 (non-sm. 

lcIU A 5 49 


51.4 


~ """ 
9.5 


7.2 


| 

jglio/astro 
|U87-MG 


0.0 




0.0 


jLung ca. 

o.o f n ™- 

is.cell) 
[NCI-H23 


0.0 


0.0 


0.0 


glio/astro 
U-118-MG 

' 


0.0 


0.0 


ILung ca. 

o.o N: 

is.cell) 
IIOP-62 


8.4 


2.7 


9.6 


astrocytom 
1aSW1783 

u 


2.9 


0.0 

_ 


(Lung ca. 
a a (non-s.cl) 
INCI- 


0.0 

- 


0.0 


0.0 


neuro*; met 
SK-N-AS 


0.0 


0.0 


jLung ca. 
0.0 j(squam.) 
SW 900 


3, 


8.7 


1 

0.0 


astrocytom 1 
a SF-539 


5.1 


0.0 


iLung ca. 

n a I( s q uam -) 

NCI- 

• H5% : ^ 


2.3 


15.9 


0.0 


astrocytom 


2.3 


0.0 


0.0 jMammary 


0.0 


0.0 


0.0 
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i-3 



a SNB-75 

___ 




; 


Igland 








glioma 
SNB-19 


6.3 


20.7 


9.0 


Breast 
ca.* 
(pl.ef) 
(MCF-7 


0.0 


0.0 


0.0 

! 


glioma 
U251 


1.4 


0.0 


j Breast 
ca.* 
1.8 (pl.ef) 
[MDA- 
IMB-231 


0.0 


0.0 


0.0 


glioma SF- 
295 

L— — 


0.0 


0.0 


0.0 


(Breast 
lea.* (pi. 
[ef) T47D 


14.1 


37.4 


0.0 


j Heart 


0.0 


0.0 


0.0 


iBreast ca. 
JbT-549 


12.5 


21.0 


12.3 


j Skeletal 
(Muscle 


2.3 

- 


0.0 


0.0 


Breast ca. 
jMDA-N 


0.0 


0.0 


0.0 


Bone 
[marrow 


0.0 


0.0 


0.0 


Ovary 


0.0 


0.0 


0.0 


Thymus 


0.0 


0.0 


0.0 


lea. 

JOVCAR- 
3 


0.0 


0.0 


0.0 


(Spleen 


2.2 


0.0 


0.0 


lOvarian 
ca. 

jOVCAR- 

{4 


0.0 


0.0 


0.0 


Lymph 
jnode 

1 


0.0 


0.0 


0.0 


Ovarian 
jca. 

IOVCAR- 
5 


66.9 


35.4 


'-' — H 
4.4 


[Colorectal 
[Tissue 


11.3 


27.7 


21.8 


(Ovarian 

jca. 

lOVCAR- 1 

,8 


2.7 


0.0 


0.0 

_J 


f— -- 
Stomach 

— - - 


0.0 


0.0 


0.0 


lOvarian 
jca. 

JIGROV-1 


6.0 


0.0 


0.0 


Small 
intestine 


5.4 


0.0 


0.0 


Ovarian 
Ica. 

.(ascites) 
SK-OV-3 


30.8 


0.0 




0.0 


Colon ca. 
SW480 


3.2 


0.0 


0.0 


jUterus 


0.0 


0.0 


0.0 


Colon ca.* 

SW620 

(SW480 


0.0 


0.0 j 


0.0 


; Placenta | 

JL ,„„, I 


0.0 




0.0 

i 


0.0 j 
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Colon ca. 
HT29 


1.9 

™ — 


14.4 


0.0 jProstate 


6.9 


0.0 


0.0 


Colon ca. 
HCT-116 


0.0 

_ _ 


0.0 


(Prostate 
0.0 lea.* (bone 
met) PC-3 


100.0 


0.0 


0.0 


Colon ca. 
CaCo-2 


0.0 


0.0 


0.0 (Testis 


54.7 
4.2 


100.0 


36.9 


Colon ca. 

Tissue 

(OD03866) 


72.2 


75.8 


(Melanom 

1000 1hs688(A). 

JT 


0.0 


0.0 


Colon ca. 
HCC-2998 


5.3 


4.8 


i Melanom 

o o r (met) 

Hs688(B). 

|T 




2.7 


34.2 


13.3 


Gastric ca.* 
(liver met) 
NCI-N87 


50.3 


0.0 


jMelanom 
0.0 a UACC- 

J62 


0.0 


0.0 


0.0 


Bladder 


6.0 


22.1 


n n iMelanom 
UU |aM14 


31 .4 


36 3 




20 2 


Trachea 


0.0 


0.0 


Melanom 
0.0 a LOX 
I VI VI 


0.0 


0.0 


0.0 


Kidney 


2.0 


0.0 


(Melanom 
a a a* (met) 
SK-MEL- 

|5 




0.0 

._ ' 


0.0 


Kidney 
(fetal) 


1.1 


2.5 


0.0 [ 




1 

- 1 



Table P H. Panel 4R 



Tissue Name 

1— 


Rel. Exp.(%) 
Agll98,Run 
142014937 


Tissue Name 


Rel. Exp.(%) 
Agll98, Run 
142014937 


Sci otKici} I hi act 


0.0 


HUVEC IL-lbeta 


0.0 


Secondary Th2 act 


0.0 


HUVEC I FN gamma 


0.0 


Secondary Trl act 


2.5 


HUVE( IM 'alpha + 
IFN gamma 


0.0 


Secondary Thl rest 


0.0 


llli\LCTNF alpha + 
IL4 


0.0 


Secondary Th2 rest 


0.0 


HUVEC IL-11 


0.0 1 


Secondary Trl rest 


0.0 


Lung Microvascular EC 
none 


0.0 


Primary Thl act 


2;° , 


Lung Microvascular EC 


0.0 | 
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1 


TNFalpha + IL-1 beta 


'- 1 """"""" 


Primary Th2 act 


0.0 


Microvascular Dermal 
EC none 


0.0 


Primary Trl act 


0.0 


Microsvasular Dermal 
EC TNFalpha + IL-1 beta 


0.0 


Primary Thl rest 


0.0 


Bronchial epithelium 
TNFalpha + IL1 beta 


0.0 


Primary Th2 rest 


0.0 


Small airway epithelium 
none 


0.0 


Primary Trl rest 


0.0 


Small airway epithelium 
rNFalpha ML- 1 beta 


0.0 


CD45RA CD 1 
lymphocyte act 


0.0 




Coronery artery SMC rest 


0.0 


CD45RO CD4 
lymphocyte act 


0.0 


JCoronery artery SMC 
(TNFalpha + IL-1 beta 


0.0 


CD8 lymphocyte act 


0.0 




Astrocytes rest 


0.0 


Secondary CD 8 
lymphocyte rest 


0.0 

_____ 




Astrocytes TNFalpha + 
IL-1 beta 


0.0 


Secondary CD8 
hmpho:\te act 


0.0 




KU-8 12 (Basophil) rest 


0.0 


CD4 lymphocyte none 


0.0 




KU-8 12 (Basophil) 
PMA/ionomycin 


0.0 


2ryThl/Th2/Trl_anti- 
CD95 CHI 1 


0.0 




CCD1106 

(1< itii cytes) none 


0.0 


LAK cells rest 


0.0 




CCD 1106 

(Keratinocytes) 

1 NFalpha + IL-lbeta 


0.0 


LAK. cells 11,-2 


0.0 




Liver cirrhosis 


16.4 


LAK cells IL-2+IL-12 


0.0 




Lupus kidney 


0.0 


LAK cells IL-2+IFN 
gamma 


0.0 




NCI-H292 none 


0.0 


LAK cells IL-2+ IL-1 8 


0.0 




NCI-H292 IL-4 


0.0 


LAK cells 
PMA/ionomycin 


0.0 


i 

INCI-H292 IL-9 


0.0 


NK Cells IL-2 rest 


0.0 




NCI-H292 IL-13 


0.0 


Two Way MLR 3 day 


0.0 


INCI-H292 IFN gamma 


0.0 


Two W iv MLR s day 


0.0 


IIPALC none 


0.0 


Two Way MLR 7 day 


0.0 




HPAEC TNF alpha + IL- ! 
1 beta 


0.0 


PBMC rest 


0.0 




Lung fibroblast none 


0.0 


PBMC PWM 


0.0 


Lung fibroblast TNF 
(alpha + IL-1 beta 


0.0 


PBMC PHA-L 


0.0 




Lung fibroblast IL-4 


o.o 


Ramos (B cell) none 


0.0 


Lunij iihr >bl .isi IL-9 


0.0 


Ramos (B cell) 


0.0 


1 Liny fibroblast IL-13 


0.0 
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ionomycin 


j 




B lymphocytes PWM 


Q Q |Lung fibroblast IFN 

jgamma 


0.0 


B lymphocytes CD40L 
and IL-4 


0.0 


Dermal fibroblast 
CCD 1070 rest 


0.0 


EOL-1 dbcAMP 


0.0 


Dermal fibroblast 
CCD1070 TNF alpha 


0.0 


L;< )L-1 dbcAMP 
PMA/ionomycin 


0.0 


Dermal fibroblast 
CCD 1070 IL-1 beta 


0.0 


Dendritic cells none 


Q Q Dermal fibroblast IFN 
gamma 


0.0 


Dendritic cells LPS 


0.0 


Dermal fibroblast IL-4 


0.0 


Dendritic cells anti- 
CD40 


0.0 


IBD Colitis 1 


100.0 


Monocytes rest 


0.0 


1BD Colitis 2 


0.0 


Monocytes LPS 


0.0 |IBD Crohn's 


0.0 


Macrophages rest 
Macrophages LPS 
MUVLC none 


- °-° ' " "]c°? on 

........ L U ng 

0.0 lii inn 


0.0 

0.0 " I 

'"6.6"" I 


HUVEC starved 


0.0 IKidney 


0.0 



CNS_neurodegeneration_vl.0 Summary: Ag3363 - Expression of this gene is 
low/undetectable (CTs > 35) across all of the samples on this panel (data not shown). 



General_screening_panel_vl.4 Summary: Ag3363 - Significant expression is seen in 
lung cancer cell line NCI-H146 (CT=34.5) and lung cancer cell line SHP-77 (CT=34.2). 
5 Therefore, expression of this can be used to distinguish these samples from the rest of the 
samples on this panel. 

Panel 1.2 Summary: Agl 130/Agl 198 - Three different runs using the same primer 
sequences yield similar results. Significant expression of this gene is seen in testis and a 
colon cancer sample. Therefore, expression of this gene can be used to differentiate these 
10 samples from other samples on these panels. Results from a third experiment using the 

probe and primer set Agl 253 show low/undetectable levels of expression in all the samples 
on this panel. 

Panel 1.3D Summary: Agl253 - Expression of this gene is low/undetectable (CTs > 35) 
across all of the samples on this panel (data not shown). 
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Panel 2D Summary: Agl603 - Expression of this gene is low/undetectable (CTs > 35) 
across all of the samples on this panel (data not shown). 

Panel 4D Summary: Agl 130/Agl 198/Agl253/Ag3363 - Two experiments showed 
possible experimental difficulties, while the other three runs showed expression of this 
gene as low/undetectable (CTs > 35) across all of the samples on the panel. 

Panel 4R Summary: Agl 198 - Significant expression of this gene is seen only in the IBD 
colitis 1 sample (CT=34.2). Therefore, expression of this gene can be used to differentiate 
this sample from others on the panel. 

Panel CNS_1 Summary: Agl253/Agl603 - Expression of this gene is low/undetectable 
(CTs > 35) across all of the samples on this panel (data not shown). 

E. CG58516-01: G-protein beta WD-40 repeats 

Expression of gene CG585 16-01 was assessed using the primer-probe set Ag3362, 
described in Table EA. Results of the RTQ-PCR runs are shown in Tables EB and EC. 



Table EA . Probe Name Ag3362 



Primers 




Sequences 


Length 


Start 
Position 


SEQID 
NO: 


Forward 


5 ■ -gtcgggcaggacctttact-3 ' 


l«> 


1474 


381 


Probe 


TET-5 1 -tcctacagctaattctgcagggcaca-3 ' - 
TAMRA 


26 


1498 


382 


Reverse 


5 ' -tacgctttactcccgtaagtca-3 ' 


22 


1543 


383 



Table E B. CNS_neurodegeneration_vl.O 



Tissue Name 


Rel.Ex P .(%)Ag3362T TissueName 
Run 210153738 H^uelName 


Rel. Exp.(%) Ag3362, 
Run 210153738 


AD 1 Hippo 


9 9 jControl (Path) 3 
: Temporal Ctx 


0.0 


AD 2 Hippo 


33.2 


Control (Path) 4 
Temporal Ctx 


24.3 


AD 3 Hippo 


4.3 


AD 1 Occipital 
Ctx 


2.0 


AD 4 Hippo 


16.5 


AD 2 Occipital 
Ctx (Missing) 


0.0 


AD 5 hippo 


96.6 


AD 3 Occipital 
Ctx 


5.4 
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AD 6 Hippo 


43.2 


AD 4 Occipital 
Ctx 


24.7 


| Control 2 Hippo 


29.1 


AD 5 Occipital 
Ctx 


24.5 


Control 4 Hippo 


16.6 


AD 6 Occipital 
Ctx 


31.9 


Pnntrnl /"Path"! "3 

Hippo 


3.8 


Control 1 Occipital 
Ctx 


0.9 


AD 1 Temporal Ctx 


7.1 


Control 2 Occipital 
Ctx 


89.5 


AD 2 Temporal Ctx 


23.2 


Control 3 Occipital 
Ctx 


12.6 


AD 3 Temporal Ctx 


5.6 


Control 4 Occipital 
Ctx 


6.3 


AD 4 Temporal Ctx 


20.0 


Control (Path) 1 
Occipital Ctx 


65.1 


AD 5 Inf Temporal 
Ctx 


1000 


Control (Path) 2 
Occipital Ctx 


— — — ■ 

15.8 


AD 5 SupTemporal 
Ci\ 


43.8 


Control (Path) 3 
Occipital Ctx 


2.0 


AD o int 1 emporal 

Ctx 


30.8 


Control (Path) 4 
Occipital Ctx 


11.6 


AD 6 Sup Temporal 

Ctx 


69.7 


Control 1 Parietal 
Ctx 


2.8 


Control 1 Temporal 
Ctx 


9.0 


Control 2 Parietal 
Ctx 


39.2 


Control 2 Temporal 
Ctx 


5 o 0 jControl 3 Parietal ! 

JCtx 


23.5 


Control 3 Temporal 

Ctx 


11.7 


Control (Path) 1 
Parietal Ctx 


69.7 


Control 4 Temporal 
Ctx 


— H 

8.2 

- _ i 


Control (Path) 2 
Parietal Ctx 


14.9 


Control (Path) 1 
Temporal Ctx 


56.3 


Control (Path) ? 
»arietal Ctx 


0.9 


Control (Path) 2 
Temporal Ctx 


34.2 


:ontrol (Path) 4 
Parietal Ctx 


38.7 



Table EC . General_screening_panel_vl.4 



r — — i 


Rel. Exp.(%) 




Rel. Exp.(%) 


I Tissue Name 


Ag3362, Run 


Tissue Name 


Ag3362, Run 




216523482 




216523482 


Adipose 


6.3 


Renal ca. TK-10 


44.1 


Melanoma : 


17.6 






|Hs688(A).T 


Bladder 


9.4 
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Melanoma* 
Hs688(B).T 


18.3 


Gastric ca. (liver met.) 
(NCI-N87 


21.6 


Melanoma* Ml 4 


17.1 


Oniric r;i K A TO Iff 

VJaoLI IL/ ta. iVrV JL KJ 111 


| "7 , 


j Melanoma* 
LOXIMVI 


13.6 


Colon ca. SW-948 


5.8 


Melanoma* SK- 
MEL-5 


19.6 


Colon ca. S W480 


34.6 


Squamous cell 
carcinoma SCC-4 


14.6 


C«.|<.ii ca.* (SW480 
met) SW620 


14.2 


Testis Pool 


4.0 


Colon ca. 1 IT29 


7.2 


Prostate ca.* (bone 
met) PC-3 


90.8 


Colon ca. HCT-116 


14.3 


Prostate Pool 


4.0 


Colon ca. CaCo-2 


19.8 




11 4 


Colon cancer tissue 


3.6 


Uterus Pool 


2.1 


Colon ca. SW1116 


9.4 


Ovarian ca. 
OVCAR-3 


17.4 


Colon ca. Colo-205 


8.8 


Ovarian ca. SK- 
OV-3 


47.0 


Colon ca. SW-48 


13.2 


Ovarian ca. 


14.7 


Colon Pool 


5.7 


Ovarian ca. 
OVCAR-5 


31.6 


Small Intestine Pool 


10.2 


IGROV-1 


12.9 


Stomach Pool 


6.2 


Ovarian ca. 
OVCAR-8 


6.7 


' -— — . 

Bone Marrow Pool 


1.3 

— — ~~ -. 4 


Ovary 


12.5 


; ct il ! fcart 


1.1 


Breast ca. MCF-7 


75.8 


Heart Pool 


3.4 


Breast ca. MDA- 
MB-231 


30.4 


Lymph Node Pool 


8.7 


Breast ca. BT 549 


65.5 


Tc 1 il K l t. lei tl ivlif-dc 


2. ? 


Breast ca. T47D 


100.0 


Skeletal Muscle Pool 


9.4 


Breast ca. MDA-N 


33 4 


opiccn rooi 


4.6 


Breast Pool 


4.6 


1 hymus Pool 


7.3 


Trachea 


7.7 


CNS cancer 
(glio/astro) U87-MG 


33.9 


Lung 


4.9 


CNS cancer 
(glio/astro) U-118-MG 


27.2 


Fetal Lung 


7.1 


CNS cancer 
(neuro:met) SK-N-AS 


16.0 


Lung ca. NCI-N417 


9.3 


CNS t incci (astro) 

539 


14.3 


Lungca. LX-1 


15.8 


CNS cancer (astro) 
SNB-75 


60.7 
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Lung ca.NCI-H146 


4.9 


CNS cancer (glio) 
SNB-19 


13.8 


.Lung ca. bJtir- / / 


16.5 


CNS cancer (3H0) SF- 
295 


28.5 


Lung ca. A549 


27.2 


Brain (Amygdala) Pool 


5.3 


Lungca. NCI-H526 


4.1 


Brain (cerebellum) 


5.0 


Lung ca. NCI-H23 


15.0 


Brain (fetal) 


16.4 


JLung ca. JNL-l-ht4oO 


9.5 


Brain (Hippocampus) 
Pool 


5.5 


Lung ca. HOP-62 


7.6 


Cerebral ( 'ortex Pool 


8.7 


Lung ca. NCI-H522 


18.2 


Brain (Substantia 
nigra) Pool 


8 3 

- - - | 


Liver 


o-o 


Brain ( I halamus) Pool 


6 3 


Fetal I iver 
Liver ca. I [ep< .2 


7.3 
2^.5 


Brain (whole) 
Spinal Cord Pool 


7.0 
5.6 


kidnev Pool 


17.7 


Adicnal ( .1 nd 


6.3 


Fetal kidries 


4.6 


Pituitan »'and Pool 


0.8 


Renal ca. 786-0 


17.2 


Salivary Gland 


5.6 


Ren.il ca \498 




Truro id ( female) 


9.7 


Renal ca. ACHN 


17.3 


Pancreatic ca. 
CAPAN2 


11.7 


Renal ca. UO-3 1 


III 


Pancreas Pool 


9.2 



CNS_neurodegeneration_vl.O Summary: Ag3362 Highest expression of the CG58516- 
01 gene is seen in the occipital cortex of a control patient and the temporal cortex of an 
Alzheimer's patient. While the CG585 16-01 gene does not appear to be preferentially 
expressed in Alzheimer's disease, this panel confirms expression of the CG585 16-01 gene 
5 at moderate/high levels in the brain in an additional set of individuals. Please see Panel 1 .4 
for discussion of potential utility of this gene in the central nervous system. 

General_screening_panel_vl.4 Summary: Ag3362 The CG585 16-01 gene is widely 
expressed in this panel, with highest expression in the breast cancer cell line T47D 
(CT=29). Significant expression is also seen in cell lines derived from prostate, breast and 
10 ovarian cancers. In general, expression of the CG5 85 16-01 gene appears to be greater in 
the cancer cell lines than in normal tissue. Thus, the expression of this gene could be used 
to distinguish these cell line types from others in the panel. 

Among tissues involved in central nervous system function, this gene is expressed 
at low but significant levels in all brain regions examined. This gene encodes a protein with 
15 a putative zinc-finger motif. Since these proteins are known to interact with nucleic acids, 
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this suggests that this gene product may play a potential role in transcription. Thus, 
therapeutic modulation of the CG585 16-01 gene product may be used to regulate the 
transcription of disease-related proteins such as ataxin, huntingtin, or various apoptosis 
cascade proteins. 

5 Among tissues with metabolic function, this gene is expressed at low levels in 

pituitary, adipose, adrenal gland, pancreas, thyroid, skeletal muscle, heart, and fetal liver. 
This widespread expression among these tissues suggests that this gene product may play a 
role in normal neuroendocrine and metabolic and that disregulated expression of this gene 
may contribute to neuroendocrine disorders or metabolic diseases, such as obesity and 
1 0 diabetes. 

References: 

1. Zhu W, Chan EK, Li J, Hemmerich P, Tan EM. (2001) Transcription activating 
property of autoantigen SG2NA and modulating effect of WD-40 repeats. Exp Cell Res. 
269(2):3 12-21 

15 Panel 4D Summary: Ag3362 Results from one experiment with the CG58516-01 gene 
are not included because the amp plot corresponding to the run indicates that there were 
problems with the experiment. 

F. CG58473-01: PROTEIN KINASE 

Expression of gene CG58473-01 was assessed using the primer-probe set Ag3357, 
20 described in Table FA. Results of the RTQ-PCR runs are shown in Tables FB and FC. 



Table FA . Probe Name Ag3357 



Primers Sequences 

. ......J 


Length 


Start 
Position 


SEQID 
NO: 


Forward 5' gtca iggt ggc ;< taaa ttc-3 ' 


21 


853 


384 


p„.i jTET-5 ' -ccaggacctcatctccaagctgctta-3 ' - 
rrODe [TAMRA 


26 


897 


385 


Reverse |5 • -agccgttctgaggggttat-3 ' 


19 


926 


386 



Table FB . General_screening_panel_vl.4 



T iss ue Name , Rel E xp.Q/r) ± Tissue Name j Rel" Exp..(Vb) ; 
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Ag3357, Run 
216523477 




A »'3357„ Run 
216523477 


Adipose 


0.0 


Renal ca. TK-10 


13.2 


Melanoma* 
Hs688(A).T 


0.0 


Bladder 


7.2 


Melanoma* 
Hs688(B).T 


1.1 


Gastric ca. (liver met.) 
NCI-N87 


5.4 


Melanoma* Ml 4 


50.0 


i i i\ti 'o ca KA 1 0 111 




Melanoma* 
LOXIMVI 


33.0 


Colon ca. SW-948 


14.9 


Melanoma* SK- 
MEL-5 


24.7 


Colon ca. SW480 


95.9 


Squamous cell 
carcinoma SCC-4 


11.6 


Colon ca.* (SW480 
met) SW620 


53.6 


Testis Fool 


8.2 


Colon ca. HT29 


10.3 


Prostate ca.* (bone 
met) PC-3 


3.2 


Colon ca. HCT-116 


76.3 


Prostate Pool 


0.0 


Colon ca. CaCo-2 


14.1 


Placenta 


2.4 


Colon cancer tissue 


6 3 


Uterus Pool 


0.0 


Colon ca. SW1116 


18.6 


Ovarian ca. 
OVCAR-3 


51.1 


Colon ca. Colo-205 


"'243 


Ovarian ca. SK- 
OV-3 


53.2 


Colon ca. SW-48 


26.1 


Ovarian ca. 
OVCAR-4 


10.4 


Colon Pool 


4.6 


OVCAR-5 


12.3 

I — _ 


— *— — 1 

Small Intestine Pool 


1.7 


Ovsrisn C3.. 
IGROV-l 


10.1 


Stomach Pool 


1.2 


Ovarian ca. 
OVCAR-8 


13.4 


Bone Marrow Pool 


1.0 


Ovary 


0.0 


Fetal Heart 


0.0 


Breast ca. MCF-7 


20.3 


Heart Pool 


0.0 


Breast ca. MDA- 
MB-231 


65.1 


Lymph Node Pool 


1.4 


Breast ca. BT 549 


100.0 


1 etal Skeletal Muscle 


0.0 


Bre^t i inn 


34.2 


Skeletal Muscle Pool 


1.6 


Breast ca. MDA-N 


36.3 


Spleen Pool 




Breast Pool 


1.3 


Thymus Pool 




Trachea 


0.0 


CNS cancer 
(glio/astro) U87-MG 


7.8 


Lung 


0.0 


CNS cancer 
(glio/astro) U-118-MG 


54.0 
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Fetal Lung 


5.0 


CNS cancer 
(neuroimel) Sk-\-<\S 


7.9 


Lung ca. NCI-N417 


17.9 


CNS cancer (astro) SF- 


22.4 


Lung ca. LX-1 


28.5 


CNS cancer (astro) 
SNB-75 


19.2 


jLung ca.NCI-H146 


74.7 


CNS cancer (glio) 
SNB-19 


14.6 


Lung ca. SHP-77 


40.6 


CNS cancer (glio) SF- 
295 


3.0 


|Lung ca. A549 


64.6 fBrain (Amygdala) Pool 


0.0 


jLung ca. NCI-H526 


23.8 ! Brain (cerebellum) 


0.0 


(Lung ca. NCI-H23 


63.7 (Brain (fetal) 


0.0 


Lung ca. NCI-H460 


n Q IBrain (Hippocampus) 
°' 8 Pool 


0.0 


Lung ca. HOP-62 


2.0 JCerebral Cortex Pool 


0.0 


T linn r-n XT^T U^OO 

ijUng C3.. JNCl-riOzz 


^ ^ (Brain (Substantia 
(nigra) Poo! 


2.6 


Liver 

Fetal Liver 


0.0 Brain (Thalamus) Pool 
" " 0.6 Brain (whole) 


2.5 


Liver ca. HepG2 


1 1 .4 Spinal Cord Pool 


0~0 


Kidney Pool 


0.0 Adrenal Claud 


0.0 


I'elal Kidney 


3.1 Pituitan gland Pool 
20.0 Salivar) (Hand 


1.4 


Renal ca. 786-0 


0.0 


Renal ca. A498 


3.6 jlhyroid (female) 


0.0 


Renal ca. ACHN 


18.9 


Pancreatic ca. 
CAPAN2 


20.4 


Renal ca. UO-31 


10.4 I 


Pancreas Pool 


1.3 



Table FC . Panel 4D 



Tissue Name 


Rel. Exp.(%) 
Ag3357, Run 
165231196 


Tissue Name 


Rel. Exp.(%) 
Ag3357, Run 
165231196 


Secondary Thl act 


9.0 


HUVEC IL-lbeta 




Secondary Th2 act 


43.2 


HUVEC I FN gamma 


6.3 


Secondary Trl act 


46.0 


III V] C fNF alpha 
ll-N gamma 


7.3 


Secondary Thl rest 


6.7 


HUVEC TNF alpha + 
IL4 


25.3 


Second: ry Th2 rest 


12.2 


11UVLC IL-11 


13.1 


Secondary Trl rest 


1.9 


Lung Microvascular EC 
none 


3.3 | 


Prinur\ Thl act 


6.1 


Lung Microvascular F.( ' 


1 * 1 
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TNFalpha + IL-lbeta 




Primary Th2 act 


2| g ((Microvascular Dermal 
IiC none 


..... 

10.9 


Primary Trl act 


33.0 


Microsvasular Dermal 
EC TNFalpha + IL-lbeta 


7.3 


Primary Thl rest 


28.1 


Bronchial epithelium 
TNFalpha + 1L1 beta 


1.9 


Primary Th2 rest 


12.1 


Small airway epithelium 
none 


3.6 


Primary Trl rest 


29.7 




Small airway epithelium 
TNFalpha + 1L-1 beta 


36.3 


CD45RA CD4 
lymphocyte act 


28.5 


Coronery artery SMC rest 


0.0 


CD45RO CD4 
lymphocyte set 


39.8 


Coronery arterv SMC 
TNFalpha + IL-lbeta 


0.0 


CD8 lymphocyte act 


18.6 


Astrocytes rest 


1.4 


Secondary CD8 
1) riipliocyte rest 


26.8 


Astrocytes TNFalpha + 
IL-lbeta 


1.2 


Secondary CD8 
lymphocyte act 


19.2 


KU-812 (Basophil) rest 


18.2 


CD4 lymphocyte none 


10.6 


KU-812 (Basophil) 
PMA/ionomycin 


30.4 


2ry Thl/Th2/Trl_anti- 
CD95 CHI 1 


15.6 


CCD1106 

(Keratinocytes) none 


18.3 


LAK cells rest 


0.0 


CCD1106 
(Keratinocytes) 
TNFalpha + IL-lbeta 


7.7 


LAK cells IL-2 


42.6 iLiver cirrhosis 


25.7 


LAK cells 1L-2+IL-12 


24.0 Lupus kidney 


0.0 


1 AK cells 1L-2+IFN 
gamma 


24.8 


NCI-H292 none 


7.8 


LAK cells IL-2+IL-18 


40.3 


NCI-H292 IL-4 


26.4 


LAK cells 
PMA/ionomycin 


0.0 


NCI-H292 IL-9 


29.7 


NK Cells IL-2 rest 


23.5 


NCI-H292 IL-13 


20.7 


Two Way MLR 3 day 


13.7 |NCI-H292 IFN gamma 


27.9 


Two Way MLR 5 day 


13.2 JHPAEC none 


8.6 


Two Way MLR 7 day 


I IPAEC TNFalpha + IL- 
jl beta 


2.4 


PBMC rest 


0.0 (Lung fibroblast none 


5.5 


PBMC PWM 


52.1 


Lung fibroblast TNF 
alpha + IL-1 beta 


2.2 


PBMC PHA-L 


14.6 


Lung fibroblast II -4 


0.0 


Ramos (B cell) none 


16.5 


Lung fibroblast IL-9 


0.0 


Ramos (B cell) 


14.7 


1 ung fibroblast II -13 


0.0 



519 



ionomycin 






B lymphocytes PWM 


j 00 q |Lung fibroblast IFN 
igamma 


_ 

0.0 


B lymphocytes CD40L 
and IL-4 


10.4 

_ 


Dermal fibroblast 
CCD1 070 rest 


40.1 


EOL-1 dbcAMP 


9.9 


Dermal fibroblast 
CCD1070TNF alpha 


43.8 


EOL-1 dbcAMP 
PMA i< >nomycin 


13.2 


Dermal fibroblast 
CCD 1070 IL-1 beta 


23.5 


Dendritic cells none 


4 7 jDermal fibroblast IFN 

jgamma 


3.7 


Dendritic cellsl.PS 


1.1 


Dermal fibroblast IL-4 


4.6 


Dendritic cells anti- 
CD40 


0.0 


IBD Colitis 2 


0.0 


Monocytes rest 


0.0 IBD Crohn's 


0.0 


Monocytes LPS 


0.0 Colon 2S.1 


Macrophages rest 
Macrophages LPS 


4.3 Lung 59.0 

0.0 rhynnis 0.0 


HUVEC none 


28.3 Kidney 10.0 


III VIC starved 





CNS_neurodegeneration_vl.O Summary: Ag3357 - Expression of this gene is 
low/undetectable (CTs > 35) across all of the samples on this panel (data not shown). 



General_screening_panel_vl.4 Summary: Ag3357 This gene is primarily expressed in 
cancer cell lines, with highest expression in a breast cancer cell line BT 549(CT=32.8). 
5 This gene is expressed in the following cell lines but not the corresponding healthy tissue: 
gastric, brain, colon, lung, breast, ovarian cancer and melanomas. Thus, expression of this 
gene could be used as a diagnostic marker for the presence of these cancers. Furthermore, 
therapeutic inhibition using antibodies or small molecule drugs might be of use in the 
treatment of these cancers. 

10 Panel 4D Summary: Ag3357 Highest expression of the CG58473-01 gene is seen in 
pokeweed mitogen-activated purified peripheral blood B lymphocytes (CT=33.2). In 
addition, no expression of the transcript is seen in PBMC that contain normal B cells, but 
the transcript is induced when PBMC are treated with the B cell selective pokeweed 
mitogen. The transcript is not seen in the B cell lymphoma cell line Ramos regardless of 

1 5 stimulation. Thus, the putative protein encoded by this gene could potentially be used 

diagnostically to identify activated B cells. Therefore, therapeutics that antagonize the 

function of this gene product may be useful as therapeutic drugs to reduce or eliminate the 
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symptoms in patients with autoimmune and inflammatory diseases in which B cells play a 
part in the intiation or progression of the disease process, such as lupus erythematosus, 
Crohn's disease, ulcerative colitis, multiple sclerosis, chronic obstructive pulmonary 
disease, asthma, emphysema, rheumatoid arthritis, or psoriasis. 

G. CG58470-01: UDP-N-ACETYLHEXOSAMINE 
PYROPHOSPHORYLASE 

Expression of gene CG5 8470-01 was assessed using the primer-probe set Ag5940, 
described in Table GA. 



Table G A. Probe Name Ag5940 



Primers 


Sequences 


1 

Length 


Start 
Position 


SEQ | 
ID | 

NO: : 


r>-i\* irJ 


5 ' -atatcctgaagct icaa agttagct-3 1 


26 [ 


422 




Probe 


TET-5 ' -tggcaacaaatgcattattccataU acg- 3 1 - 
TAMRA 


29 J 


459 


388 


Reverse 


-c ftgaaci < uccatg-3 ' 


20 f 


489 


389 ( 



General_screening_panel_vl.5 Summary: Ag5940 - Expression of this gene is 
low/uncletectable (CTs > 35) across all of the samples on this panel (data not shown). 



Panel 5 Islet Summary: Ag5940 - Expression of this gene is low/undetectable (CTs > 35) 
across all of the samples on this panel (data not shown). 

EL CG58593-01: UBIQUITIN-52 



Expression of gene CG58593-01 was assessed using the primer-probe set Ag3421, 
described in Table HA. 

Table HA . Probe Name Ag3421 



Primers 


Sequences 


Length 


Start 
Position 


SEQ ID 
NO: 


Forward 


5 ' -atctgctgcaagtgctatgc-3 1 


"0 


291 




Probe 


TET-5 1 -cggtgctatcaactgccacaagaaga-3 1 - 
TAMRA 


26 


323 




Reverse 


5 ' -tgaccttcttcctggggtac-3 ' 


20 


371 
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CNS_neurodegeneration_vl.O Summary: Ag3421 - Expression of this gene is 
low/undetectable (CTs > 35) across all of the samples on this panel (data not shown). 



General_screening_panel_vl.4 Summary: Ag3421 - Expression of this gene is 
low/undetectable (CTs > 35) across all of the samples on this panel (data not shown). 

5 Panel 4D Summary: Ag3421 - Expression of this gene is low/undetectable (CTs > 35) 
across all of the samples on this panel (data not shown). 

I. CG5 7871-01: TOUSLED-LIKE KINASE 

Expression of gene CG57871-01 was assessed using the primer-probe set Ag3351, 
described in Table IA. Results of the RTQ-PCR runs are shown in Tables IB and IC. 



1 0 Table IA. Probe Name As335 1 


i 

jPrimers 


Sequences 


Length 


Start 
Position 


SEQ II) 

NO: 


[Forward 


5 1 -gatcctcactgcaacattcttt-3 ' 




346 


393 


|Probe 


TET-5 ' -aatcccttaccgcgacgagtagaaca-3 1 - 
TAMRA 


26 


372 


394 


Reverse 


5 1 -gcactgccatctaaaccataga-3 ' 


!: 


403 





Table IB . CNS_neurodegeneration_vl.O 



Tissue Name 


Rel. Exp.(%) Ag3351, j ^ 

Run 210141594 | 1,ssueJName 


Rel. Exp.(%) Ag3351, 
Run 210141594 


AD 1 Hippo 


1Q4 iControl (Path) 3 
Temporal Ctx 


3.0 


AD 2 Hippo 


,, 4 Control (Path) -1 
Temporal Ctx 


65 1 


AD 3 Hippo 


- _ jAD 1 Occipital 

JCtx 


20.2 


AD 4 H ippo 


8 4 AD 2 Occipital 
( i\ (Missing) 


0.0 


AD 5 hi ppo 


100.0 


AD 3 Occipital 
Ctx 


3.8 


AD 6 Hippo 


33.4 


AD 4 Occipital 

Ctx 


45.1 


Control 2 Hippo 


29.9 


AD 5 Occipital 
Ctx 


15.2 


Control 4 Hippo 


6.7 


AD 6 Occipital 
Ctx 


46.7 





v_-0nir01 D 

Hippo 


3.7 "~ 


Control 1 Occipital 
Ctx 


2.7 




AD 1 Temporal Ctx 


16.8 


Control 2 Occipital 
Ctx 


52.5 




AD 2 Temporal Ctx 


45.1 


Control 3 Occipital 
Ctx 


45.4 




AD 3 Temporal Ctx 


6.9 




Control 4 Occipital 
Ctx 


6.3 




AD 4 Temporal Ctx 


54.0 




Control (Path) 1 
Occipital Ctx 


79.0 


H 


AD 5 Inf Temporal 
Ctx 


92.0 


'Control (Path) 2 
Occipital Ctx 


34.4 


1 


AD 5 SupTemporal 
Ctx 






Control (Path) 3 
( >ccipital Ctx 


0.8 




AD 6 Inf Temporal 
Ctx 


48.6 


(Control (Path) 4 
Occipital Ctx 


40.6 


a 


AD 6 Sup Temporal 
Ctx 


56.6 




Control 1 Parietal 
Ctx 


6.9 


o 


Control 1 Temporal 
Ctx 


6.2 


1 


Control 2 Parietal 
Ctx 


48.0 




Control 2 Temporal 
Ctx 


29.3 




Control 3 Parietal 
Ctx 


26.1 


m 


Control 3 Temporal 
Ctx 


32.8 




Control (Path) 1 
Parietal Ctx 


73.7 




Control 4 Temporal 
Ctx 


13.9 




Control (Path) 2 
Parietal Ctx 


57.4 




Control (Path) 1 
Temporal Ctx 


79.6 


(Control (Path) 3 
(Parietal Ctx 


3.4 




Control (Path) 2 
Temporal Ctx 


97.3 


[Control (Path) 4 
Parietal Ctx 


78.5 



Table IC . Panel 4D 



Tissue Name 


Rel. Exp.(%) 
Ag3351, Run 
165222896 


j Tissue Name 


Rcl. Exp.(%) 
Ag3351, Run 
l(.5222S')(. 


Secondary Hi 1 act 


16.5 


jHUVEC IL-lbeta 


15.4 


Secondary Th2 act 


26.4 


fHUVEC irN gamma 


13.5 


Secondary Trl act 


23.3 


jHUVEC TNF alpha + 
iIFN gamma 


17.0 


Secondary Th 1 rest 


6.0 


IHUVEC TNF alpha + 
|lL4 


11.0 


Secondary Th2 rest 


10.7 


HUVEC IL-1 1 


5.4 


Secondary Trl rest 


2.1 


ILung Microvascular EC 
none 


12.4 
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Primary Thl act 


19.2 


Lung Microvascular EC 
1 \l ilpha i IL-lbeta 


9.6 


Primary Th2 act 


. _ , Microvascular Dermal 
17.6 .... 

EC none 


14.7 


Primary Trl act 


, , 1 iMicrosvasular Dermal 

EC TNFalpha HL-lbeta 


14.8 


Primary Thl rest 


,. . fBronchial epithelium 
jTNFalpha + ILlbeta 


14.1 


Primary Th2 rest 


43 8 Small airway epithelium 
|none 


7.7 


Primary Trl rest 


1 r g 1 Small airway epithelium 
|TNFalpha + IL-lbeta 


50.3 


CD45RA CD4 
lymphocyte cict 


13.0 


Coronery artery SMC rest 


15.6 


CD45RO CD4 
[lymphocyte ctct 


21.0 


Coronery artery SMC 
TNFalpha + IL-lbeta 


6.1 


CDS lymphocyte act 


12.9 Astrocvles rest 11.5 


| Secondary CD8 
[lymphocyte rest 


14.9 Astrocytes TNFalpha + : |]g 
flL-lfoetci | 


[Secondary CD8 
[lymphocyte act 


14.8 KU-812 (Basophil) rest 


19.2 


CD4 lymphocyte none 


1Q ? lKU-812 (Basophil) 
i PMA/ionomycin 


54.0 


2r> lhlTh2'l'rl anti- 

iCD95 CHI 1 


[27 CCD1106 

(Keratinocytes) none 


12.2 


LAK cells rest 


ICCD1106 
17.2 | (Keratinocytes) 

TNFalpha + IL-lbeta 


9.0 


LAK cells IL-2 


22.4 Liver cirrhosis 


7.4 


OLAK cells IL-2+IL-12 


20.4 1 Lupus kidney 


3.4 


LAK cells IL-2+IFN 
gamma 


37.9 JNCI-H292 none 


47.6 


LAK cells IL-2+IL-18 


18.6 3NCI-H292 IL-4 


42.3 


LAK cells 
PMA/ionomycin 


10.5 [NCI-H292 IL-9 


30.4 


NK Cells IL-2 rest 


17.8 NCI-H292 IL-13 J 15.7 


Two Way MLR 3 day 


33 .2 jNCI-H292 IFN gamma 


25.5 


Two Wax- MLR 5 day 


10.6 iHPAECnone 


13.5 


Two Way MLR 7 day 


HPAEC rNF alpha II - 
,1 beta 


17.7 


PBMC rest 


12.8 jLung fibroblast none 


11.5 


PBMC PWM 


63 1 ung fibroblast TNF 
alpha + IL-1 beta 


12.4 


PBMC PHA-L 


18.0 Lung fibroblast IL-4 


31.2 


R im< • (He none 


14.0 Lung fibroblast IL-9 


22.2 
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Ramos (B cell) 
ionomycin 


"""" j — — ~~ 

77.9 jLung fibroblast IL- 1 3 


27.4 


B lymphocytes PWM 


10Q I Aing fibroblast IFN 
(gamma 


44.8 


and IL-4 


^ „ Dermal fibroblast 
|CCD1070rest 


33.7 


EOL-1 dbcAMP 


, , , jDermal fibroblast 

CCD1070 TNF alpha 


50.0 


FOI.-l dbcAMP 
PMA/ionomycin 


13.7 


Dermal fibroblast 
CCD1070IL-1 beta 


13.4 


Dendritic cells none 


14.7 


Dermal fibroblast IFN 
gamma 


14.3 


Dendritic cells LPS 


19.8 Dermal fibroblast IL-4 


25.7 


Dendritic cells anti- 
CD40 


14.2 


IBD Colitis 2 


2.0 


Monocytes rest 


22.5 IB 1.) Crohn's 


3.2 


Monocytes LPS 


32.8 


Colon 


26.8 


Macrophages rest 


31.0 


Lung 


14.6 


Macrophages LPS 


30.8 


Thymus 


_ 28.7 


HUVEC none 


18.3 


Kidney 


45.4 


HI J V EC starved 


45.7 





CNS_neurodegeneration_vl.O Summary: Ag3351 - This panel confirms the expression 
of this gene at low levels in the brain in an independent group of individuals. While no 
differential expression of this gene is detected between Alzheimer's diseased postmortem 
brains and those of non-demented controls, the widespread expression of this gene in the 
5 brain suggests that therapeutic modulation of the expression or function of this gene may 
be effective in the treatment of neurologic disorders such as Parkinson's disease, epilepsy, 
stroke and multiple sclerosis. 

General_screening_panel_vl.4 Summary: Ag3351 - Results from one experiment are 
not included. The amp plot indicates that there were experimental difficulties with this run. 

10 Panel 4D Summary: Ag335 1 The CG57871-01 gene is expressed at high to moderate 
levels in a wide range of cell types of significance in the immune response in health and 
disease. These cells include members of the T-cell, B-cell, endothelial cell, 
macrophage/monocyte, and peripheral blood mononuclear cell family, as well as epithelial 
and fibroblast cell types from lung and skin, and normal tissues represented by colon, lung, 

1 5 thymus and kidney. This ubiquitous pattern of expression suggests that this gene product 
may be involved in homeostatic processes for these and other cell types and tissues. This 
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pattern also suggests a role for the gene product in cell survival and proliferation. 
Therefore, modulation of the gene product with a functional therapeutic may lead to the 
alteration of functions associated with these cell types and lead to improvement of the 
symptoms of patients suffering from autoimmune and inflammatory diseases such as 
5 asthma, allergies, inflammatory bowel disease, lupus erythematosus, psoriasis, rheumatoid 
arthritis, and osteoarthritis. 

J. CG58590-01 and CG58590-02: PALS Guanylate kinase 

Expression of gene CG58590-01 and CG58590-02 was assessed using the primer- 
probe set Ag3380, described in Table JA. Results of the RTQ-PCR runs are shown in 
10 Tables JB, JC and JD. Please note that CG58590-02 represents a full-length physical clone 
of the CG58590-01 gene, validating the prediction of the gene sequence. 

Table JA . Probe Name Ag3380 



jPrimers Sequences 


Length 


Start 
Position 


SEQID 
NO: 


1 orward *" r " t » l igc^attjt . t j 1 




1931 " 




p,.„u.., jTET-5 ' - ccgatcttgataaagcctatcaggaa-3 ' - 
P° Pe iTAMRA 


26 


1953 


397 


Reverse 5 ' -cccactgaggttcagtatcaag-3 ' 


22 


2000 


~398 



Table JB . CNS neurodegeneration vl.O 



Tissue Name 


Rcl. Exp.(%) Ag3380, 
Run 210153753 


Tissue Name 


ReL Exp.(%) Ag3380, 
Run 210153753 


AD 1 Hippo 


12.9 


Control (Path) 3 
Temporal Ctx 


4.7 


AD 2 Hippo 


27.7 


Control (Path) 4 
Temporal Ctx 


24.3 


AD 3 Hippo 


4.8 


AD 1 Occipital 
Ctx 


15.6 


AD 4 Hippo 


7.7 


AD 2 Occipital 
Ctx (Missing) 


0.0 


AD 5 hippo 


100.0 


AD 3 Occipital 
Ctx 


7.5 


AD 6 Hippo 


64.2 


AD 4 Occipital 
Ctx 


19.1 


Control 2 Hippo 


25.5 


AD 5 Occipital 
Ctx 


29.5 


Control 4 Hippo 


9.9 


AD 6 Occipital 


40.1 
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Ctx 




Control (Path) 3 
Hippo 


8.4 


Control 1 Occipital 
Ctx 


4.2 


AD 1 Temporal Ctx 


17.6 


Control 2 Occipital 
C tx 


65.5 


AD 2 Temporal Ctx 


25.3 


Control 3 Occipital 
Ctx 


13.4 


AD 3 Temporal Ctx 


4.9 


Control 4 Occipital 
Ctx 


6.4 


AD 4 Temporal Ctx 


17.4 


Control (Path) 1 
Occipital Ctx 


78.5 


AD 5 Inf Temporal 
Ctx 


81.8 


Control (Path) 2 
Occipital Ctx 


9.4 


AD 5 SupTemporal 
Ctx 


42.9 


Control (Path) 3 
Occipital Ctx 


3.2 


AD 6 Inf Temporal 
Ctx 


48.6 


Control (Path) 4 
( )ccipital Ctx 


9.9 


AD 6 Sup Temporal 
Ctx 


53.6 


Control 1 Parietal 
Ctx 


6.0 


Control 1 Temporal 
Ctx 


5.7 


Control 2 Parietal 
Ctx 


37.1 


Control 2 Temporal 
( is 


34.6 


Control 3 Parietal 
Ctx 


16.5 


v^onLroi j i ernporai 
Ctx 


10.2 


Control (Path) 1 
Parietal Ctx 


67.4 


Control 4 Temporal 
Ctx 


7 1 IControl (Path) 2 
iParictal Ctx 


18.7 


Control (Path) 1 
Temporal Ctx 


41.5 


Control (Path) 3 
Parietal Ctx 


3.3 


Control (Path) 2 
Temporal Ctx 


29.5 


Control (Path) 4 
Parietal Ct\ 


34.4 



Table JC . General_screening__panel_vl.4 



Tissue Name 

L. 


Rel. Exp.(%) 
Ag3380, Run 
217043276 


Tissue Name 


Rel. Exp.(%) 
Ag3380, Run 
217043276 


Adipose 


9.0 


Renal ca. TK- 10 


25.5 


Melanoma* 
Hs688(A).T 


18.9 


Bladder 


15.9 


Melanoma* 
Hs688(B).T 


16.8 


Gastric ca. (liver met.) 
NCI-N87 


52.5 


Melanoma* M14 


14.9 


( iastric ca. KATO III 


34.6 


Melanoma* 
LOXTMV] 


21.6 


Colon ca. SW-948 


4.9 
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Melanoma* SK- 
MEL-5 


27.0 


Colon ca. SW480 


82.4 


Squamous cell 
carcinoma SCC-4 


28.7 


< :olon ca.* (SW480 
met) SW620 


20.6 


Testis 1 'oo I 


5.1 


|Colon ca. HT29 


9.2 


Prostate ca.* (bone 
met) PC-3 


59.9 


Colon ca. HCT-116 


20.6 


Prostate Pool 


8.6 


Colon ca. CaCo-2 


22.8 


Placenta 


3.9 


Colon cancer tissue 




Uterus Pool 


1.9 


Colon ca. SW1116 


6.2 


Ovarian ca. 
OVCAR-3 


32.5 


Colon ca. Colo-205 


4.9 


Ovarian ca. SK- 
OV-3 


57.4 


Colon ca. SW-48 


4.2 


Ovarian ca. 
OVCAR-4 


14.7 


Colon Pool 


11.4 


Ovarian ca 
OVCAR-5 


59.5 


Small Intestine Pool 


9.8 


IlGROV-1 


13.1 


Stomach Pool 


7.4 


(Ovarian ca. 
jOVCAR-8 


19.2 


Bone Marrow Pool 


4.2 


Ovary 


5.9 


Fetal Heart 


6.3 


(Breast ca. MCF-7 


35.1 


Heart Pool 


4.9 


jBreast ca. MDA- 
MB-231 


58.2 


Lymph Node Pool 


11.4 


[Breast ca. BT 549 


26.8 


Fetal Skeletal Muscle 


3.3 


^e^ca.T47D 


100.0 


Mvti.'Sdi MUv,k' Pin 1 


8.1 


Breast ca. MDA-N 


8 7 


Spleen Pool 


5.6 


Breasl 1 i>, .| 


10.4 


Thymus Pool 


6.3 


Trachea 


5.5 


CNS cancer 
(glio/astro) U87-MG 


39.2 


Lung 


3.8 


CNS cancer 
(glio/astro) U-118-MG 


54.7 


Fetal Lung 


11.8 


CNS cancer 
(neuro;met) ^K-N-AS 


19.6 


Lung ca. NCI-N417 


3.2 


CNS cancer (i-.tr,.) SF- 
539 


12.2 


Lungca. LX-1 


20.7 


CNS cancer (astro) 
SNB-75 


29.7 


Lung ca.NCI-H146 


3.8 


CNS cancer (glio) 
SNB-IO 


13.4 


Lung ca. SHP-77 


17.9 


CNS cancer (glio) SF- , 
295 


28.9 


Lung ca. A549 


30.6 


Bum ( Kim gdala) Pool 


11.8 
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(Lung ca. NCI-H526 


3.6 Brain (cerebellum) 


6.0 


Lung ca. NCI-H23 


29.3 [Brain (fetal) 


8.4 


r 

jLung ca. NCI-H460 


j 4 g (Brain (Hippocampus) 
(Pool 


14.5 


(Lung ca. HOP-62 


19.5 Cerebral Cortex Pool 


16.2 


Lung ca. NCI-H522 


2g j jBrain (Substantia 
jnigra) Pool 


16.0 


iLiver 


0.4 jBrain (Thalamus) Pool 




22.7 


Fetal Liver 


11.9 


Brain (whole) 


5.9 


(Liver ca. HepG2 


12.9 


Spinal Cord Pool 


16.0 


pCidney Pool 
Feiai Kidne\ 
[Renal ca 786-0 
jRenal ca. A498 


18.4 
"~ "22.8 
28.5 

J:° 


Adrenal Gland 
Pituitai} gland Pool 
Salivary Gland 
Thyroid (female) 


{ 

3.8 

2 A 

8.2 


iRenal ca. ACHN 


?? A jPancreatic ca. 

JCAPAN2 


51.4 


JRenal ca. UO-31 


36.9 [Pancreas Pool 


12.3 



Table JD . Panel 4D 



Tissue Name 


Rel. Exp.(%) 
Ag3380, Run 
165296532 


Tissue Name 


Rel. Exp.(%) 
Ag3380, Run 
165296532 


Secondary Hi 1 act 


13.1 


HUVEC IL-lbeta 


15.0 


Secondary Th2 act 


14.6 


HUVEC 1FN gamma 


19.6 


Secondary Trl act 


15.2 


HUVEC TNF alpha + 
(FN gamma 


28.3 


Secondary Thl rest 


4.6 


HUVEC TNF alpha + 
1L4 


26.1 


Secondary Th2 rest 


4.7 


HUVEC II - 1 1 


7.8 


Secondary Trl rest 


8.0 


Lung Microvascular EC 
none 


25.5 


Primary Thl act 


14.9 


Lung Microvascular EC 
TNFalpha + IL-lbeta 


19.5 


Primary Th2 act 

™™~. 


13.2 


Microvascular Dermal 
EC none 


37.9 


Primary Trl act 


20.7 


Microsvasular Dermal 
l-:C TM alpha ! - IL-lbeta 


24.8 


Primary Thl rest 


35.6 


Bronchial epithelium 
TNI alpha f ILlbeta 


37.1 


Primary Th2 rest 


24.0 


Small airway epithelium 
none 


15.0 


Primary Trl rest 

, 


16.2 


Small airway epithelium 
TNFalpha + IL-lbeta 


100.0 
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CD45RA CD4 
lymphocyte act 


23.3 jcoronery artery SMC rest 


30.1 


CD45RO CD4 
lymphocyte act 


-. „ j Icoronery artery SMC 
jTNFalpha + IL-lbeta 




CDS lymphocyte act 


7.4 (Astrocytes rest 


22.5 


occuiiuary i^ivo 
lymphocyte rest 


1 3 A lAstrocytes TNFalpha + 
IIL-lbeta 


21.2 


Secondary CDS 
lymphocyte act 


4.4 KU-812 (Basophil) rest 


17.9 


CD4 lymphocyte none 


g q KU-812 (Basophil) 


68.3 


2.\ Ihl. Ih2 Tri anti- 
CD95 CH11 


10? |CCD1106 

: \ (Keratinocytes) none 


22.1 


LAK cells rest 


IPPDI 1 Cif\ 

13.5 j (Keratinocytes) 

TNFalpha + IL-lbcla 


9.2 


LAK cells IL-2 


12.9 JLiver cirrhosis 


3 1 


LAK cells IL-2+IL-12 


13.2 .Lupus kidney 




2.9 


LAK cells IL-2+IFN 
gamma 


15.6 JNCI-H292 none 


48.6 


LAK cells IL-2+IL-18 


17.0 NC1-H292 IL-4 

■— 


66.9 


LAK cells 
PMA/ionomycin 


9.5 NCI-H292 IL-9 


59.5 


NK Cells IL-2 rest 


7.0 NCI-H292 IL-13 


36.6 


Two Way MLR 3 day 


15.2 ;NCI-H292 I FN gamma 


42.6 


Two Wa) MLR 5 da) 


7.0 HPAEC none 


14.3 


Two Way MLR 7 day 


q HPAEC TNF alpha + IL- 
II beta 


25.9 


PBMC rest 


(>A I ung fibroblast none 


12.5 


PBMC PWM 


60.7 


Lung fibroblast TNF 
alpha + IL-1 beta 


11.0 


PBMC PHA-L 


18.8 


Lung fibroblast IL-4 


25.9 


Ramos B cells none 


31.9 


Lung fibroblast IL-9 


20.6 


xvamos \t> ceil ) 
ionomycin 


94.0 Lung fibroblast IL-1 3 


18.8 | 


B lymphocytes PWM 


^2 9 I . ung fibroblast IFN 
gamma 


23.3 


B lymphocytes CD40L 
j nd 11-4 


~. 7 Dermal fibroblast 
CCD 1 070 rest 


59.5 


EOL-1 dbcAMP 


. ? c , (Dermal fibroblast 

CCD 1070 FNF alpha 


64.2 


LOL-I dhc -WlP 
PMA/ionomycin 


, n . jDermal fibroblast 
( CD1 070 IL-1 beta 


32.8 


Dendritic cells none 


)0 g jDermal fibroblast IFN 
gamma 


10.7 
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Dendritic cells 1 PS 


!(» ~ 


iDemial fibroblast 1L-4 


21.6 


Dendritic cells anti- 
CD40 


18.8 jlBD Colitis 2 


2 0 


Monocytes rest 


15.0 


|IBD Crohn's 


— — 


Monocytes LPS 


13.8 


|Colon 


36.9 


Macrophages rest 


25.3 


(Lung 


1 9. $ 


Macrophages LPS 


8.1 Thymus 


72.2 


HUVEC none 


1 9.9 


kidney 


24.5 


IIUVIX' sta.xed 


X* .^.^..^^..^ _ „ 





CNS_neurodegeneration_vl.O Summary: Ag3380 This panel does not show differential 
expression of the CG58590-01 gene in Alzheimer's disease. However, this expression 
profile confirms the presence of this gene in the brain. Please see Panel 1 .3D for discussion 
of utility of this gene in the central nervous system. 



Genera l screening panel vl. 4 Summary: Ag3380 - This gene is expressed at low to 
moderate levels in all samples on this pattern. The highest level of expression is seen in 
breast cancer cell line T47D (CT=27.8). Based on expression in this panel, this gene may 
be involved in brain, colon, renal, lung, ovarian and prostate cancer as well as melanomas. 
Thus, expression of this gene could be used as a diagnostic marker for the presence of these 
cancers. Furthermore, therapeutic inhibition using antibodies or small molecule drugs 
might be of use in the treatment of these cancers. 

This gene product is also expressed in adipose, pancreas, adrenal, thyroid, pituitary, 
skeletal muscle, heart, and fetal liver. This widespread expression in tissues with metabolic 
function suggests that this gene product may be important for the pathogenesis, diagnosis, 
15 and/or treatment of metabolic and endocrine diseases, including obesity and Types 1 and 2 
diabetes. Furthermore, this gene is more highly expressed in fetal (CT=30.9) liver when 
compared to expression in the adult (CT>35) and may be useful for the differentiation of 
the fetal and adult sources of this tissue. 

In addition, this gene is expressed at moderate levels in the all regions of the CNS 
20 examined. Therefore, this gene may play a role in central nervous system disorders such as 
Alzheimer's disease, Parkinson's disease, epilepsy, multiple sclerosis, schizophrenia and 
depression. 
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Panel 4D Summary: Ag3380 - This gene is expressed from moderate to low levels across 
all of the samples on this panel. The highest expression is seen in small airway epithelium 
treated with TNFalpha and IL-lbeta (CT=28.7). Interestingly, expression is much lower in 
untreated small airway epithelium (CT=31.5). There is also a significant difference 
between mononuclear cells treated with PWM (CT=29.5) and untreated cells (CT=32.7). 
Therefore, expression of this gene can be used to differentiate treated and untreated 
samples. 

Expression of this gene is detected at a moderate level (CT=30.2) in normal colon 
(similar levels for colon are seen on panel 1.4 (CT=30.9), but is significantly lower in the 
IBD Colitis 2 (CT=34.4) and IBD Crohn's (CT=33.5) samples. Therefore, therapies 
designed with the protein encoded for by this gene may potentially modulate colon 
function and play a role in the identification and treatment of inflammatory or autoimmune 
diseases which effect the colon including Crohn's disease and ulcerative colitis. 

K. CG58572-01 and CG58572-02: GLUCOSAMINE-PHOSPHATE N- 
ACETYLTRANSFERASE 

Expression of gene CG58572-01 and full length clone CG58572-02 was assessed 
using the primer-probe set Ag3375, described in Table KA. Results of the RTQ-PCR runs 
are shown in Tables KB, KC and KD. 



Table K A. Probe Name Ag3375 



r~ 

jPrimers 


Sequences 


Length] 


Start 
Position 


SEQID 

NO: 


I'orw ;ird 


-aag agt ] .."'.jr: g a 


22 


58 




probe 


TFT < icattttctc ' ttcc 
TAMRA 


26 


86 


' 400 


[Reverse 


5 ' -agcagtacaaagaggcctcaa-3 ' 


21 ] 


135 


401 



Table KB . CNS neurodegeneration vl.O 



| Tissue Name 


Rel. Exp.(%) A«3375. 
Run 210154239 


Tissue Name 


RtL I \|>.(%) Ag3375, 
Run 21015423') 


AD 1 Hippo 


17.1 


Control (Path) 3 
Temporal Ctx 


4.8 


AD 2 Hippo 


19.3 


Control (Path) 4 
Temporal Ctx 


27.5 


AD 3 Hippo 


7.4 


AD 1 Occipital Ctx 


11.5 
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AD 4 Hippo 




4.5 


\l) 2 Occipital < I > 

(Missing) 


0.0 


|AD 5 Hippo 




72.2 


AD 3 Occipital Ctx 


5.9 


|AD 6 Hippo 


53.6 


AD 4 Occipital Ctx 


12.7 


jControl 2 Hippo 


20.3 


t\\J J ^CCipildi V-,IX 




[Control 4 Hippo 


6.8 


■\D6 ( >ccipital Ctx 


19.8 


Control (Path) 3 
Hippo 


5.5 


Control 1 Occipital 
Ctx 


3.2 


AD 1 Temporal 
Ctx 


11.6 


Control 2 Occipital 
Ctx 


36.1 


AD 2 Temporal 
Ctx 


23.8 


Control 3 Occipital 
Ctx 


7.4 


AD 3 Temporal 
Ctx 




Control 4 Occipital 
Ctx 


4.1 


AD 4 Temporal 
Ctx 


16 5 


( 'ontrol (Path) 1 
Occipital Ctx 


66.0 


AD 5 Inf Temporal 
Ctx 




Control (Path) 2 
Occipital Ctx 


8.2 


AD 5 Sup 
Temporal Ctx 


55 - 9 


Control (Path) 3 
Occipital Ctx 


1.9 


AD 6 Inf Temporal 
Ctx 


37.9 


Control (Path) 4 
Occipital Ctx 


12.2 


AD 6 Sup 
Temporal Ctx 


59.5 


Control 1 Parietal 
Ctx 


2.4 


Control 1 
Temporal Ctx 


3.5 


Control 2 Parietal 
Ctx 


31.6 


Control 2 
Temporal Ctx 


25.3 


Control 3 Parietal 
Ctx 


11.7 


Control 3 
Temporal Ctx 




8.2 


Control (Path) 1 
Parietal Ctx 


49.7 


Control 3 
Temporal Ctx 




4.0 


Control (Path) 2 
Parietal Ctx 


15.4 


Control (Path) 1 
Temporal Ctx 


52.9 


Control (Path) 3 
Parietal Ctx 


4.2 


Control (Path) 2 
Temporal Ctx 


26.6 


Control (Path) 4 
Parietal Ctx 


32.5 


Table KC. Panel 1.3D 


Tissue Name 


Rel. Exp.(%) 
Ag3375, Run 
165674233 


Tissue Name 


Rel. Exp.(%) 
Ag3375, Run 
165674233 


Ln ci adenocarcinoma 


51.8 


(Kidney (fetal) 


9.7 


Pancreas 


9.3 


Renal ca. 786-0 


19.6 


Pancreatic ca. C N.PAN 


52.1 


Uv i i! ..i A498 


26.2 
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Adrenal gland 


8.9 


Renal ca. RXF 393 


15.7 


Thyroid 


6.3 


Renal ca. ACHN 


8.2 


Salivary gland 


1 8.3 


Renal ca.UO-31 




Pituitary gland 


15.1 


Ren iil ca. TK-10 


9.8 


Brain (fetal) 


15.5 


Liver 


20.4 


Brain (whole) 


34.6 


r,i ve (fetal) 


16.5 


Brain (amygdala) 


16.0 


Liver ca. 

(hepatomas!) IIcpC".2 


49.0 


Bniin (cerebellum) 


; [ 


Lung 


4 5 


Brain (hippocampus) 


12.1 


1 uiil' (tetal) 


5.4 


Brain (substantia nigra) 


12.8 


Lung ca. (small cell) 
LX-1 


32.3 


Brain (thalamus) 


17.9 


Lung ca. (small cell) 
NCI-H69 


17.3 


Cerebral Cortex 


10.4 


Lung ca. (s.cell var.) 
SHP-77 


30.1 


Spinal cord 


13.3 


Lung ca. (large 
cell)NCI-H460 


66.4 


glio/astro U87-MG 


14.8 


Lung ca. (non-sm. 
cell) A549 


19.1 


glio/astroU-118-MG 


95.3 


Lung ca. (non-s.cell) 
NCI-H23 


13.8 


Astrocytoma SW1783 


4 ~ n jLung ca. (non-s.cell) 
lHOP-62 




18.7 


neuro*;metSK-N-AS 47.0 i^^ 0 "" 5 ' 00 


19.5 


Astrocytoma SF-539 


, , . jLungca. (squam.) 
SW 900 


9.9 


Astrocytoma SNB-75 


15.6 


Lung ca. (squam.) 
NCI-H596 


19.6 


glioma SNB-19 11.8 


Mammary gland 


14 6 


glioma U251 


40.9 


Breast ca.* (pl.ef) 
MCF-7 


81.2 


glioma SF-295 


10.1 


Breast ca.* (pl.ef) 
MDA-MB-231 


91.4 


Heart (fetal) 


1.3 


Breast ca.* (pl.ef) 
T47D 


35.4 


Heart 


4.7 


Breast ca. BT-549 


97.9 


Skeletal muscle (fetal) 


1.2 


Breast ca. MDA-N 


14.8 


Skeletal muscle 


38.7 


Ovary 


1.6 


Bone marrow 


4.6 


Ovarian ca. 
OVCAR-3 


39.2 


Thymus 


2.7 [Ovarian ca. 


23.0 
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|OVCAR-4 






Spleen 


7.9 


Ovarian ca. 
rw ir* ad ^ 


13.8 




Lymph node 


13.0 


Ovarian ca. 

C\\Tn APS 
\J V LAK-o 


8.5 




Colorectal 


3.3 


Ovarian ca. IGROV- 

L^^^.. m ^ — ,-. 


5.6 




Stomach 


27.7 


Ovarian ca.* 
(ascites) SK-OV-3 


44.8 




Small intestine 




Uterus 


19.5 




Colon ca. SW480 


16.5 


Placenta 






Colon ca.* 

o w OZU^o W toU mclj 


29.1 


Prostate 


15.6 


4j ■ 


Colon ca. HT29 


13.8 


Prostate ca.* (bone 
met)PC-3 


56.6 


ru 


Colon ca.HCT-1 16 


27.7 


Testis 


40.6 


o 


Colon ca. CaCo-2 


17.4 


Melanoma 
Hs688(A).T 


5.5 


Q 
% 


Colon ca. 
tissue(OD03866) 


26.4 


Melanoma* (met) 
Hs688(B).T 


8.9 


y 


Colon ca. HCC-2998 


32.1 


Melanoma UACC- 
62 


17.8 


o 


Gastric ca.* (liver met) 
NCI-N87 


100.0 


Melanoma Ml 4 


27.7 




Bladder- 


28.7 


Melanoma LOX 
IMVI 


6.6 




Trachea 


9.4 


Melanoma* (met) 
SK-MEL-5 


13.0 




Kidney 


9.0 


Adipose 


8.0 [ 



Table KD . Panel 4D 



I ' 

1 Tissue Name 


I Rel. Exp.(%) 
I Ag3375, Run 
165296547 


1 Tissue Name 


Rel. F.\p.(%> 
Ag3375, Run 
165296547 | 


Secondary Thl act 


14.6 


HUVECiL-lbeta" 


24.5 


Secondaiy Th2 act 


13 0 


'HUVEC IFN gamma 


24 5~ 


J Secondary Trl act 


17.3 

" — — 


HUVEC TNF alpha - 
Il-'N gamma 


24.0 


j Secondary Thl rest 


0.9 


~]HUVEC TNF alpha + 
JIL4 




23.2 


jSecondary Th2 rest 


1.5 


IHUVEC 11.-11 


12.1 


Secondary Trl rest 




jLung Microvascular EC 
iiione 


21.3 | 


(Primary Thl act 


16.U 


Lung Microvascular EC 


24.1 
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j ...... . 





TNFalpha i II -Ibeta 


Primary Th2 act 


12.1 


Microvascular Dermal 
EC none 


27.4 


Primary Trl act 


25.0 


Microsvasular Dermal 
EC TNFalpha + IL-lbeta 


24.0 


Primary Thl rest 


10.4 


Bronchial epithelium 
TNFalpha IL lbeta 


20.3 


Primary Th2 rest 


6.1 


Small airway epithelium 
none 


11.3 


[Primary Trl rest 


q n (Small airway epithelium 
TNFalpha + IL-lbeta 


54.0 


CD45RA CD4 


14.6 iCoronery artery SMC rest 23.5 


CD45RO CD4 


. , ICoronery artery SMC I ^ _ . 

[TNFalpha + IL-lbeta | 12,0 


CDS lymphocyte act 


14.2 Astrocvles rest 5.3 


Secondary CD 8 
lymphocyte rest 


1 d A (Astrocytes TNFalpha + 
)lL-lbeta 


5.4 


Secondary CD8 
lymphocyte act 


5.8 KU-812 (Basophil) rest 


19.5 


CD4 lymphocyte none 


04 KU-812 (Basophil) 
jPMA/ionomycin 


56.3 


2ry Thl/Th2/Trl_anti- 
rr>Q^ phi i 


26 ICCD1106 

( Kera l i noc> tes ) none 


26.6 


LAK cells rest 


5.1 


CCD 1106 
(Keratinocytes) 
TNFalpha + IL-lbeta 


7.8 


LAK cells IL-2 


10.7 


Liver cirrhosis 


2.6 


LAK cells IL-2+IL- 12 


12.5 


Lupus kidney 


0.8 


LAK cells IL-2+IFN 
gamma 


20.2 jNCI-H292 none 


28.7 


LAK cells IL-2+ IL-18 


1 6.6 


NCI-H292 IL-4 


54.7 


LAK cells 
PMA/ionomycin 


12.5 


NCI-H292 IL-9 


45.7 


NK Cells IL-2 rest 

,„^..„^,^, m . — y 


7.1 


NCI-H292IL-13 


24.3 


Two Way MLR 3 day 


6.8 


NCI-H292 I FN gamma 


33.2 


l\vo VVaj MLR 5 da\ 


8.9 


HPAEC none 


17.8 


two Way MjLK / day 


6.0 


HPAE< 1 NF alpha - II - 
1 beta 


30.1 


PBMC rest 


0.8 I ung fibroblast none 


10.2 


PBMC PWM 


4 2 3 iLung fibroblast TNF 
jalpha + IL-1 beta 


6.3 


PBMC PHA-L 


11.6 Lung fibroblast IL-4 


27.2 


Ramos (B cell) none 


30.6 Lung fibroblast IL-9 


26.8 


Ramos (B cell) 


100.0 Lung fibroblast 1L- 13 


21 8 



536 



ionomycin 






B lymphocytes PWM 


77 _j Lung fibroblast IFN 
gamma 


29.5 


B lymphocytes CD40L 
and IL-4 


j 2 j (Dermal fibroblast 
jCCDl 070 rest 


42.3 


EOL-1 dbcAMP 


13.0 


Dermal fibroblast 
CCD1070TNF alpha 


51.4 


PAT 1 JU^ATVjTD 

PMA/ionomycin 


6.9 


Dermal fibroblast 
C( DI070 [L-l beta 


22.5 


Dendritic cells none 


4.5 


Dennal fibroblast IFN 
gamma 


111 


Dendritic cells LPS 


3.8 Dermal fibroblast IL-4 


19.5 


Dendritic cells anti- 
CD40 


2.9 IIBD Colitis 2 


0.7 


Mom >cytes rest 


2.2 IBD Crohn's 


0.9 


Monocytes LPS 


1.3 |Colon 


7.6 


Macrophages rest 
Macrophages LI'S 


6.6 Lung 

2.7 (Thymus 


6.2 

9.4 


HUVECnone 17.4 Kidney 


4.2 1 


HUVEC starved 37.4 





CNSneurodegenerationvl.O Summary: Ag3375 This panel does not show differential 
expression of the CG58572-01 gene in Alzheimer's disease. However, this expression 
profile confirms the presence of this gene in the brain. Please see Panel 1 .3D for discussion 
of utility of this gene in the central nervous system. 



Panel 1.3D Summary: Ag3375 - This gene is expressed at moderate to low levels in all 
samples on this panel, with the highest expression in gastric cancer cell line NCI-N87 
(CT=28.8). Based on expression in this panel, this gene may be involved in gastric, 
pancreatic, brain, colon, renal, lung, breast, ovarian and prostate cancer as well as 
melanomas. Thus, expression of this gene could be used as a diagnostic marker for the 
presence of these cancers. Furthermore, therapeutic modulation of the expression or 
function of this gene might be of use in the treatment of these cancers. 

This gene product is also expressed in adipose, pancreas, adrenal, thyroid, pituitary, 
skeletal muscle, heart, and liver. This widespread expression in tissues with metabolic 
function suggests that this gene product may be important for the pathogenesis, diagnosis, 
and/or treatment of metabolic and endocrine diseases, including obesity and Types 1 and 2 
diabetes. 
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In addition, this gene is expressed at moderate levels in the CNS. Therefore, this 
gene may play a role in central nervous system disorders such as Alzheimer's disease, 
Parkinson's disease, epilepsy, multiple sclerosis, schizophrenia and depression. 

Panel 4D Summary: Ag3375 The CG58572-01 gene is ubiquitously expressed on this 
5 panel, with highest expression in the B cell line Ramos treated with ionomycin (CT=26.2). 
Significant levels of expression are also seen in pokeweed mitogen-activated B 
lymphocytes. Therefore, therapies that antagonize the function of this gene product may be 
useful as therapeutic drugs to reduce or eliminate the symptoms in patients with 
autoimmune and inflammatory diseases in which B cells play a part in the initiation or 
10 progression of the disease process, such as lupus erythematosus, Crohn's disease, ulcerative 
colitis, multiple sclerosis, chronic obstructive pulmonary disease, asthma, emphysema, 
rheumatoid arthritis, or psoriasis. 

Interestingly, there is a difference between the levels of expression in resting and 
activated secondary T cells. The level in activated secondary T cells (CT=28.7-29.2) 
1 5 appears to be higher than in resting T cells (CT=3 1 .3-33.1). Therefore, therapeutics 

designed with the protein encoded by this transcript could be important in the regulation of 
T cell function. 

L. CG58564-01 and CG58564-02: PROTEIN TYROSINE PHOSPHATASE - 

Expression of gene CG58564-01 and full length clone CG58564-02 was assessed 
20 using the primer-probe sets Ag3023 and Ag3373, described in Tables LA and LB. Results 
of the RTQ-PCR runs are shown in Tables LC, LD, LE and LF. 



Table LA . Probe Name Ag3023 



Primers 


Sequences 


Length 


Start 
Position 


seqid" 

NO: 


Forward 

|— 
Probe 


5 ' -ctaatgctggatttgtccatca-3 ' 


22 


492 


4(i2 

" 403 ~~ 


TET-5 ' -tcaggaatatgaagccatctacctagca- 
3 ' -TAMRA 


28 


517 


Reverse 


5 > - t jt tga tcatc jta-3 ' 


22 


555 


404 


Table LB. Probe Name Ag3373 


Primers 


Sequences 


Length 


Start 
Position 


si v n> 

NO: 
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(Forward 


5 1 -atttgtccatcaacttcaggaa-3 ' 


22 


"502" 


ins 


j Probe 


TET-5 1 -tgaagccatctacctagcaaaattaaca- 
3 1 -TAMRA 


28 


526 


"~ 406 


(Reverse 


5 ' -tggagtggtgacatcatctgta-3 ' 


22 


555 


407 



Table LC . CNS_neurodegeneration_vl.O 



Tissue Name 


Rel. Exp.(%) 
Ag3023, Run 

ZVyoZ III it 


Rel. Exp.(%) 
Ag3373, Run 

.1 ! ;l 1 r>4\) 1 1 


Tissue 
Name 


Rel. Exp.(%) 
Ag3023, Run 
209821074 


Rel. Exp.(%) 
Ag3373, Run 
210154071 


AD 1 Hippo 

- ■ 


10.9 


16.8 


Control 
(Path) 3 
Temporal 
Ctx 


9.1 


8.0 


AD 2 Hippo 


34.2 


37.6 


Control 
(Path) 4 
Temporal 
Ctx 


40.6 


65.5 


AD 3 Hippo 


12.0 


15.8 


AD 1 

Occipital 

Ctx 


24.7 


29.1 


AD 4 Hippo 


13.8 


10.3 


AD 2 

Occipital 

Ctx 

(Missing) 




0.0 


0.0 


AD 5 hi ppo 


60.7 


57.8 


AD 3 

Occipital 

Ctx 


.4.7 


15.0 


AD 6 Hippo 


80.7 


72.2 


AD 4 

Occipital 

Ctx 


35.4 


22.4 


Control 2 
Hippo 


35.8 


38.4 


AD 5 

Occipital 

Ctx 


3.9 


30.4 


Control 4 
Hippo 


16.5 


11.7 


AD 6 

Occipital 

Ctx 


46.0 


37.4 


Control (Path) ; 
3 Hippo 


13.1 


15.4 


Control 1 
Occipital 
Ctx 


9.9 


10.7 


AD 1 Temporal 
Ctx 


39.0 


31.4 


Control 2 
Occipital 
Ctx 


39.0 


38.4 


AD 2 Temporal 
Ctx 


38.7 


73.2 


Control 3 
Occipital 
Ctx 


i 

23.0 


20.6 


AD 3 Temporal 


9.5 


13.2 


Control 4 ! 


13.3 


13.3 
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Ctx 






Occipital 

Ctx 
















Control 








AD 4 Temporal 

Ctx 


27.9 


34.9 




(Path) 1 

Occipital 

Ctx 


80.1 


76.3 












Control 








AD 5 Inf 
Temporal Ctx 


59.0 


100.0 




(Path) 2 
Occipital 
Ctx 


17.3 


20.0 




AD 5 

Sup Temporal 
Ctx 


33.2 


44 1 


( lontrol 
(Path) 3 
(Occipital 

. l Ctx _ 


8 4 


8.7 


6 


AD 6 Inf 
Tempor al Ctx 


100.0 


73.2 


Control 
(Path) 4 
Occipital 

ICtx 


21.2 


20.6 


□ 


AD 6 Sup 
Temporal Ctx 


79.6 


80.1 


Control 1 
F.uictal tlx 


12.1 


16.3 


Q 


Control 1 
Temporal Ctx 


10.2 


13.7 


Control 2 
Parietal C i\ 


48.0 


40.9 


B 


Control 2 
Temporal Ctx 


41.2 




jControl 3 
Parietal 1 tx 


17.9 


16.3 


ru 


Control 3 
Tempor al Ctx 


20.3 


20.0 


Control 
(Path) 1 
(Parietal Ctx 


74.7 


64.2 

___J 




Control 4 
Temporal Ctx 


9.7 


9.9 


Control 
(Path) 2 
(Parietal Ctx 


28.9 


59.9 1 




Control (Path) 
1 Temporal Ctx 


59.9 


68.3 




Control 
(Path) 3 
Parietal Ctx 


10.2 


9.0 




Control (Path) 
2 Temporal Ctx 


40.3 


41.2 


Control 
(Path) 4 
(Parietal Ctx 


44.8 


43.8 



Table LP . General_screeningjpanel_vl.4 



Tissue Name 


Rel. Exp.(%) 
Ag3373, Run 
217043119 


Tissue Name 


Rel. Exp.(%) 
Ag3373, Run 
217043119 


Adipose 


12.0 


Renal ca.TK-10 


20.3 


Melanoma* 
Hs688(A).l 


r — 

30.8 


Bladder 


23.2 


Melanoma* 
Hs688(3).T 


69.3 


Gastric ca. (liver met.) 
NCI-N87 


25.3 
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IVlclanonia* MQ4 


15.0 Gastric ca. KATO III 


30 8 


Melanoma* 

T OYTMVT 


26.6 


Colon ca. SW-948 


9.7 


lVIClallLUIla OXV 

MEL-5 


21.5 


Colon ca. SW480 


35.1 


Squamous cell 
carcinoma SCC-4 


33.0 


Colon ca.* (SW480 
men SW620 


13.9 


Testis Pool 


19.8 Colon ca. H I 29 


8.5 


Prostate ca.* (bone 
met) PC-3 


100.0 Colon ca.HCT-1 16 


36.9 

- • 


Prostate Pool 


9.2 jColon ca. CaCo-2 


42.9 


Placenta 


3.8 jColon cancer tissue 


9.0 


Uterus Pool 


7.4 i • i s\\ J H6 


5.8 


Ovarian ca. 
OVCAR-3 


28.5 jColon ca. Colo-205 


4.3 


Ovarian ca. SK- 
OV-3 


40.3 


Colon ca. SW-48 


4.2 


Ovarian ca. 

\J V <^AK-4 


20.0 


Colon Pool 


20.7 


Ovarian ca. 
OVCAR-5 


35.1 


Small Intestine Pool 


_ „,,. ™ .... „ 

12.2 


Ovarian ca. 
IGROV-1 


10.9 


Stomach Pool 


9.9 


Ovarian ca. 
OVCAR-8 


9.2 


Bone Marrow Pool 


11.6 


Ovary 


9.7 


Fetal Heart 


20.7 


Breast ca. MCF-7 


37.6 


Heart Pool 


10.6 


Breast ca. MDA- 
MB-231 


37.1 


Lymph Node Pool 


17.9 


Breast ca. BT 549 


62 4 


Fetal Skeletal Muscle 


12.3 


Breast ca. T47D 


61.1 


Skeletal Muscle Pool 


16.0 


Breast ca. MDA-N 


10.0 


Spleen Pool 


1 1.6 


BiC t,t Pnoi 


17.3 


Thymus Pool 


12.2 


Trachea. 


12.0 


CNS cancer 

(glio astro) 1187-NK 


29.1 


Lung 


6.7 


CNS cancer 
(clio-astro) U-118-MG 


69.3 


Fetal Lung 


34.2 


CNS cancer 
(neuro;met) Sk-K-AS 


34.9 


Lung ca.NCI-N417 


5.4 


CNS cancer (astro) SF- 
539 


19.1 


Lung ca. LX-1 


17.2 


CNS cancer (astro) 
SNB-75 


35.8 


Lung ca. NCI-H146 


3.0 


CNS cancer (glio) 
SNB-19 


11.3 
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Lung ca. SHP-77 


„ , CNS cancer fglio) SF- 
295 


26.4 


Lung ca. A549 


29. 1 Brain (Amygdala) Pool 


4.5 


Lung ca. NCI-H526 


4.6 


Brain (cerebellum) 


8.1 


Lung ca. NCI-H23 


31.6 


Brain (fetal) 


13.2 


Lung ca. NCI-H460 


18.2 


Brain (Hippocampus) 
Pool 


5.3 


Lung ca. HOP-62 


14.1 


Cerebral Cortex Pool 


5.4 


.Lung ca. JNv^i-riDzz 


31.6 


Brain (Substantia 
nigra) Pool 


4.8 


Liver 


1.2 


Brain (Thalamus) Pool 


8.0 


Fetal Liver 


32.3 


Brain (whole) 


6.2 


Liver ca HepG2 


14.6 


Spina! t ord Pool 


6.6 


Kidney Pool 


22.1 Adrenal Gland 


8.1 


Fetal Kidnev 


26. 1 -Pituitary gland Pool 


i 0 


Renal ca. 786-0 


28.7 Mliwu ( dand 


4.7 


Renal ca. A498 


11.3 rhyroid (female) 


4.4 


Renal ca. ACHN 


, ~ 9 JPancreatic ca. 

[CAPAN2 


17.3 


Renal c,t. UO-31 


24. 1 Pancreas Pool 


17.1 



Table LE . Panel 1.3D 



Tissue Name 

L _ „ 


Rei.£xp.(%) T 
Ag3023, Run Tissue Name 
167966931 | 


Rel. Exp.(%) 
Ag3023, Run 
167966931 


Liv( adenocarcinoma 


^51.1 [Kidney (fetal) 


26.2 


Pancreas 


6.1 iRenal ca. 786-0 


34.2 


Pancreatic ca. CAPAN 
2 


17.7 


Renal ca. A498 


17.6 


Adrenal gland 


3.8 


Renal ca. RXF 393 


17.2 


Thyroid 


3.0 


Renal ca. ACHN 


13.5 


Salivary gland 


3.9 


Renal ca. UO-3 1 


0.0 


Pituitary gland 


3.6 jRenal ca. TK- 10 


23.0 


B.-Mii (letal) 


8.1 |Liver 


11.7 


Brain (whole) 


8.5 (Liver (fetal) 


8.0 


Brain (amygdala) 


6.7 


Liver ca. 

(hepatoblast) HepG2 


26.2 


Brain (cerebellum) 


15.2 


Lung 


3.1 


Brain (1 ipp< >_ t ,mpu- i 


5.4 


Lung (fetal) 


11.0 


Brain (substantia nigra) 


9.0 


Lung ca. (small cell) 
LX-1 


12.9 


Bram (thalamus) 


4.2 


Lung ca. (small cell) 


9.9 
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j - 


(NC1-H60 




Cerebral Cortex 


2 n 1 HUd ca < cell vjr.) 
JSHP-77 


67.8 


Spinal cord 


6 q iLung ca. (large 
|cell)NCI-H460 


3.4 


glio/astro U87-MG 


28.5 


Lung ca. (non-sm. 
cell) A549 


45.1 


glio/astroU-118-MG 

i 


46.7 


Lung ca. (non-s.cell) 
NCI-H23 


22.7 


astrocytoma SW1783 


40.6 


Lung ca. (non-s.cell) 
HOP-62 


25.7 


jneuro*; met SK-N-AS 


27.2 


Lung ca. (non-s.cl) 
NCI-H522 


38.2 


i astrocytoma SF-539 


29.7 


Lung ca. (squam.) 
SW 900 


27.4 


astrocytoma SNB-75 


~ 5 1 (Lung ca. (squam.) 
NCI-II596 


29.9 


(glioma SNB-19 


15.6 


Mammai} gland j 5.1 


glioma U251 


37.9 


Breast ca.* (pl.ef) 
MCF-7 


47.0 


glioma SF-295 


18.4 


Breast ca.* (pl.ef) 
MDA-MB-231 


22.7 


Heart (fetal) 


2.9 


Breast ca.* (pl.ef) 
T47D 


86.5 


(Heart 


12.9 


Bicasi ca. BT-5 W 


15.9 


(Skeletal muscle (fetal) 


3.4 


Breast ca. MDA-N 


10.4 


Skeletal muscle 


36.3 


Ovary 


_ 2.9 


Bone marrow 


4.5 


Ovarian ca. 

OyCAR-3 26 1 


Thymus 


14.3 


Ovarian ca. 

OVCAR-4 ib - 5 


Spleen 


o 7 (Ovarian ca. 

IOVCAR-5 


83.5 


Lymph node 


. * £ lOvarian ca. 

»OVCAR-8 


9.3 


Colorectal 


1Q 4 jOvarian ca. IGROV-J u Q 


Stomach 


7 g | Ovarian ca.* 

[(ascites) SK-OV-3 


100.0 


Small intestine 


5.1 Uterus 


4.9 


( olon ca. SW480 


19.3 


Placenta 


1.3 


Colon ca.* 
SW620(SW480 met) 


42.9 


Prostate 


3.9 


Colon ca. HT29 


9.9 


Prostate ca.* (bone 
met)PC-3 


78.5 
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Colon ca. HCT-116 




26.2 


Testis 






Colon ca. CaCo-2 


41.5 


jMelanoma 
Hs688(A).T 




5.9 


Colon ca. 
tissue(OD03866) 


6.3 


Melanoma* (met) 
Hs688(B).T 


14.2 


Colon ca. HCC-2998 


16.0 


Melanoma UACC- 
62 


14.0 


Gastric ca.* (liver met) 
NCI-N87 


18.8 


Melanoma Ml 4 


5.7 


Bladder 


30.6 


Melanoma LOX 
IMVI 


8.8 


Trachea 




3.2 


jMelanoma* (met) 
SK-MEL-5 


14.7 


Kidney 






9.6 


JAdipose 




18.9 


Table LF. Panel 4D 

— ■ - 
















Tissue Name 


Rel. 
Exp.(%) 
Ag3023, 

Run 
164516146 


Rel. 
Exp.(%) 
Ag3373, 
Run 
165296617 


Tissue Name 


Rel. 
Exp.(%) 
Ag3023, 
Run 
164516146 


Rel. 
Exp.(%) 
Ag3373, 
Run 
165296617 


Secondary Thl act 


IS (> 


17.9 ~ I 


[1UVTC IL-lbeta 


20.3 


18.6" J 


Secondary Th2 act 


24.3 


28.5 


HUVEC IFN 
gamma 


25.3 


22.7 i 

i 


Secondary Trl act 


22.8 


HUVECTlNl 
21.8 (alpha + IFN 
Igamma 


~"™~- — — — 
16.3 


18.0 


Secondary Thl rest 


7.5 


6.8 


HUVEC TNF 
alpha + IL4 


18.2 


13.4 


Secondary Th2 rest 


11.6 


9.5 


HUVEC IL-11 


13.7 


9.9 


Secondary Trl rest 


12.1 


10.7 


Lung 

Microvascular EC 
none 


25.7 


21.6 


Primary Thl act 


20.7 


■ 65 


Lung 

Microvascular EC; 
TNFalpha + IL- 
lbeta 


26.2 


18.3 I 


Primary Th2 act 


20.2 


Microvascular 
Dermal EC none 


27.5 


21.3 


Primary Trl act 


23.3 


| 

27.7 


Vlicrosvasular 
Dermal EC 
TNFalpha + IL- 
1 bei.i 


20.7 


19.9 


Primary Thl rest 


51.1 


51.4 


Bronchial 
epithelium 


13.0 


16.3 
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jTNFalpha + j 
II Ibeta 




Primary Th2 rest 


26.2 


29 5 | Small airway 

(epithelium none 




8.5 


Primary Trl rest 


23.7 


26.1 


Small airway 
epithelium 
TNFalpha + IL- 
lbeta 


50.3 


39.8 


CD45RA CD4 
j lymphocyte act 


14.6 




11.0 


Coronery artery 
SMC rest 


20.2 


18.9 


1CD45RO CD4 
1 lymphocyte act 


25.2 


22.4 


Coronery artery 
SMC TNFalpha + 
IL- lbeta 


12.0 


9.8 


CD8 lymphocyte 
act 


20.4 


15.8 


Astrocytes rest 


10.4 


ii-i i 

i 


[Secondary CD8 
[lymphocyte rest 


16.5 


Astrocytes 
19.9 TNFalpha + IL- 

Jlbeta 


11.7 


9.8 

- - 1 


[Secondary CD8 
[lymphocyte act 


13.2 


93 lKU-812 

j(Basophil) rest 


47.6 


38.2 


CD4 lymphocyte 


17.1 


JKU-812 
11.6 .(Basophil) 

PMA/ionomycin 


94.0 


92.0 j 


|2ry 

Thl/Th2/Trl_anti- 
C i >95 CHI 1 


18.3 


16.6 


CCD 11 06 

(Keratinocytes) 

none 


19.9 


13.2 | 

j 


LAK cells rest 


25.5 


16.0 


rem loo 

(Keratinocytes) 
TNFalpha + IL- 
lbeta 


6.0 


1 

4.8 | 


LAK cells IL-2 


27.2 


22.5 Liver cirrhosis 


3.1 


2.7 j 


|LAK cells IL- 
2+IL-12 


27.2 


19.3 Lupus kidney 

1 


2.1 


1.7 1 

i 


EAKceUs'lL- 


36.3 


34.4 


NC1-H292 none 1 


30.1 


18.9 | 


LAK cells IL-2+ 
IL-18 

±!r_iz— _ „ 


35.1 


29.7 


NCI-H292 11 -1 


33.9 


34.6 


LAK cells 

PM A/ i onomy c i n 




11.0 


NCI-H292 IL-9 


40.1 


29.1 | 


\KCd s !1 -2 rest 


20.0 j 15.0 INC1-H292 11 -13 


162 


14.2 


Two Way MLR 3 
day 


24.0 


,, n lNCl-11292 U N 
lo./ I 1 
gamma 


16.6 


18.4 


Two Way MLR 5 
day 


12.9 


10.1 IHPAEC none 

I....... ! 


13.6 

I 


13.5 


Two Way Ml. U 7 

day | 


11.4 


9 5 jHPAEC TNF j 
alpha + II - 1 beta I 


25.3 


25.3 



PBMC rest 


13.7 1 10.5 | Lun g fibrob,ast 
taone 


11.4 


14.2 


PBMC PWM 


69.3 


Lung fibroblast 
66.4 )TNF alpha + IL-1 
(beta 


6.1 


7.2 


PBMC PHA-L 


22.8 


. _ _ Lung fibroblast 
"■' 11-4 


28.5 


29.1 


Ramos (B cell) 
none 


24.1 


. Q _ -Lung fibroblast 
flL-9 


23.0 


23.3 


Ramos (B cell) 
ionomycin 


100.0 


1 nn n |Lun § fibroblast 
uw.u IL13 


20.6 


18.9 


B lymphocytes 
PWM 


71.7 


^ 2 Lung fibroblast 
ITN gamma 


39.0 


32.5 


B lymphocytes 
CD40L and IL-4 


29.1 


9R 7 'Dermal fibroblast 
JCCD1 070 rest 


33.9 


31.0 


EOL-1 dbcAMP 


12.1 


iDermal fibroblast 
10.5 JCCD1070 TNF 
(alpha 


76.8 


62.0 


EOL-1 dbcAMP 
PMA/ionomycin 

■■ — - - 


14.5 


iDermal fibroblast 
10.9 |CCD1070IL-1 

jbeta 


20.3 


13.9 

— — 


Dendritic cells 


13.2 


14.8 


Dermal fibroblast 
IFN gamma 


14.2 


9.5 


Dendritic cells LPS 


11.7 


8.3 


Dermal fibroblast 
IL-4 


26.4 


20.4 


Dendritic cells 
anti-CD40 


17.7 


12.7 


IBD Colitis 2 


2.6 


2.2 


Monocytes rest 


16.7 


17.6 


1BIX rohn's 


2.0 


1.0 


Monocytes LPS 


6.4 


5.0 


Colon 


11.9 


10.5 


Macrophages rest 


2v5 


22.8 Lung 


13.3 


11.2 


Macrophages LPS 


9.9 


7.1 


Thymus 


14.4 


12.9 


HUVEC none 


20.6 


17.9 


Kidney 


... 


19.6 


HUVEC starved 


43.5 


38 4 









CNS_neurodegeneration_vl.O Summary: Ag3023/Ag3373 This panel does not show 
differential expression of the CG5 8564-01 gene in Alzheimer's disease. However, this 
expression profile confirms the presence of this gene in the brain. Please see Panel 1 .3D for 
discussion of utility of this gene in the central nervous system. 



General_screening_panel_vl.4 Summary: Ag3373 Highest expression of the CG58564- 
01 gene is seen in a prostate cancer cell line (CT=27). Overall, this gene is expressed at 
moderate levels in the cancer cell lines in this panel. A higher level of expression is 
observed in clusters of cell lines derived from prostate, brain, melanoma, colon, lung, 
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breast and ovarian cancer when compared to expression in normal prostate, brain, colon, 
lung, breast and ovary. Thus, this gene could potentially be used as a diagnostic marker of 
cancer in these tissues. Furthermore, inhibition of the activity of this gene product using 
small molecule drugs may be effective in the treatment of cancer in these tissues. 

5 Among tissues with metabolic function, this gene product has moderate levels of 

expression in adipose, heart, skeletal muscle, adrenal, pituitary, thyroid and pancreas. Thus, 
this gene product may be a small molecule target for the treatment of endocrine and 
metabolic diseases, including obesity and Types 1 and 2 diabetes. 

In addition, this gene appears to be differentially expressed in fetal (CT value = 29) 
10 vs adult liver (CT value =33) and may be useful for differentiation between the two sources 
of this tissue. 

This gene is also expressed at moderate levels in all central nervous system samples 
present on this panel. Please see Panel 1 .3D for discussion of utility of this gene in the 
central nervous system. 

Panel 1.3D Summary: Ag3023 The CG58564-01 gene is ubiquitously expressed among 
the samples on this panel, with highest expression in an ovarian cancer cell line (CT=28.8). 
Overall, the expression of this gene shows good agreement with panel 1.4. A higher level 
of expression is observed in prostate, brain, melanoma, colon, lung, pancreatic, breast and 
ovarian cancer cell lines than the normal prostate, brain, colon, lung, pancreas, breast and 
ovary. Thus, expression of this gene could be used as a diagnostic marker of cancer in these 
tissues. Furthermore, inhibition of the activity of this gene product using small molecule 
drugs may be effective in the treatment of cancer in these tissues. 

Among tissues with metabolic function, expression of this gene is widespread, as in 
the previous panel. Please see Panel 1 .4 for discussion of utility of this gene in metabolic 
25 disease. 

This gene represents a phosphatase that is also expressed at low to moderate levels 
across the CNS. Some phosphatases comprise a family of MAP kinase regulating enzymes, 
members of which are upregulated in brains subjected to insults such as. ischemia and 
seizure activity. MAP kinases are kown to regulate neurotrophic and neurotoxic pathways. 
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Conseq uently, agents that modulate the activity of this gene may have utility in attenuating 
the apoptotic and neurodegenerative processes following brain insults. 



References: 

1. Wiessner C. The dual specificity phosphatase PAC-1 is transcriptionally induced 
in the rat brain following transient forebrain ischemia. Brain Res Mol Brain Res 1995 
Feb;28(2):353-6 

2. Boschert U, Muda M, Camps M, Dickinson R, Arkinstall S. Induction of the 
dual specificity phosphatase PAC1 in rat brain following seizure activity. Neuroreport 1997 
Sep29;8(14):3077-80 

Panel 4D Summary: Ag3023/Ag3373 The CG585864-01 gene is expressed at high to 
moderate levels in a wide range of cell types and tissues of significance in the immune 
response in health and disease. Highest expression of this gene is seen in ionomycin treated 
Ramos B cells (CT=26.83). Therefore, targeting of this gene product with a small molecule 
drug or antibody therapeutic may modulate the functions of cells of the immune system as 
well as resident tissue cells and lead to improvement of the symptoms of patients suffering 
from autoimmune and inflammatory diseases such as asthma, allergies, inflammatory 
bowel disease, lupus erythematosus, and arthritis, including osteoarthritis and rheumatoid 
arthritis. 

M. CG58564-03: Dual specificity phosphatase 

Expression of gene CG58564-03 was assessed using the primer-probe sets Ag3023, 
Ag3373 and Ag5847, described in Tables MA, MB and MC. Results of the RTQ-PCR runs 
are shown in Tables MD, ME, MF, MG and MH. 

Table M A. Probe Name Ag3023 



Primers 



Sequences 




22 I 261 



SEQID 
NO: 



Forward r > ' - ciaatgctggatttgtccatca-3 ' 



408 1 



iProbe TET - 5 ' -tcaggaatatgaagccatctacctagca- 
3 ' -TAMRA 



28 | 230 



409 



Re\ en e - ■ -cggagcggtgacatcatctgta-3 ' 



22 ! 198 



410 



Table MB . Probe Name Ag3373 
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Primers Sequences 


Length 


Start 
Position 


SEQID 
NO: 


Forward 5 ' - atttgtccatcaacttcaggaa-3 ' 


22 


251 


411 


p m ),p JTET-5 1 -tgaagccatctacctagcaaaattaaca- 
n ° ne 3--TAMRA 


28 


221 


412 


Reverse |5 • -tggagtggtgacatcatctgta-3 ' 


22 


198 


413 


Table MC. Probe Name Ae5847 


Primers Sequences 

Forward 5 ' -cattccaaatgtttctgtagt-3 ' 

PrnU TET-5 ' -ttcatagcagatgaatatgggcctaagaac- f 
rroDe j 3 . -TAMRA ] 


Length | 

^~21 [ 
30 ! 


Start j SEQ*Id1 
Position! NO: J 

335 j 414 j 
' 371 j "415 "1 

— } 


Reverse j5 • -ccacagtgcaaggaagac-3 1 


18 


457 j 416 j 



Table M P. CNS neurodegeneration vl.O 



Tissue Name 


Rel. Exp.(%) 
Ag3023, Run 


Rel. Exp.(%) 
Ag3373, Run 
2101 54071 


(Tissue 
Name 


I Rel. Exp.(%) 
Ag3023, Run 
209821074 


Rel. Exp.(%) 
Ag3373, Run 
210154071 








Control 






AD 1 Hippo 


10.9 


16.8 


(Path) 3 
Temporal 

[Ctx 


9.1 










Control 






AD 2 Hippo 


34.2 


37.6 


(Path) 4 

Temporal 

Ctx 


40.6 


65.5 








AD 1 






AD 3 Hippo 


12.0 


15.8 


Occipital 
Ctx 


24.7 


29.1 








AD 2 


— — — 




AD 4 Hippo 


13.8 


10.3 


Occipital 
Ctx 

(Missing) 


0.0 


0.0 


AD 5 hippo 


60.7 


57.8 


AD 3 

Occipital 

Ctx 


14.7 


15.0 






AD -1 






AD 6 Hippo 


80.7 


72.2 


Occipital 
Ctx 


35.4 


22.4 


Control 2 
Hippo 


35.8 


38.4 


AD 5 

Occipital 

Ctx 


3.9 

j 


30.4 


Control 4 


16.5 


11.7 


AD 6 


46.0 j 


37.4 
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Hippo 


[ 




Occipital 
Ctx 






Control. (Path) 
3 Hippo 


13.1 


15.4 


Control 1 
Occipital 
Ctx 


9.9 


10.7 


AD 1 Temporal 
Ctx 


39.0 


31.4 


Control 2 
Occipital 
Ctx 


39.0 


38.4 


AD 2 Temporal 
Ctx 


38.7 


73.2 


Control 3 
Occipital 
Ctx 


23.0 


20.6 


AD 3 Temporal 
Ctx 


9.5 


13.2 

.. _ 


Control 4 
Occipital 

Ctx 


r— — - 

13.3 


13.3 


AD 4 Temporal 
Ctx 


27.9 


[Control 

34 9 F ath > ! 

[[Occipital 

Ctx 




80.1 


76.3 


AD 5 Inf 
Tempor al Ctx 


59.0 


100.0 


Control 
(Path) 2 
Occipital 
Ctx 


17.3 


20.0 


AD 5 

SupTemporal 
Ctx 


33.2 


44.1 


Control 
(Path) 3 
Occipital 
Ctx 


8 4 


8.7 


AD 6 Inf 
(Temporal Ctx 


100.0 


jControl 
73 2 (Path) 4 
Occipital 

jCtx J 


21.2 


20.6 


IAD 6 Sup 
Temporal Ctx 


79.6 


m . (Control 1 

(Parietal Ctx 


12.1 


16.3 


[Control 1 
1 emporal Ctx 


10.2 


. , _ IControl 2 
(Parietal Ct\ 


48.0 


40.9 


Control 2 

Temporal Ctx 1 

— 


41.2 


^ 1 9 Control ; 

(Parietal Ctx 


17.9 


16.3 


Control 3 
Temporal Ctx 


20.3 


IControl 
20.0 (Path) 1 

|Parietal Ctx 


74.7 


64.2 


Control 4 
Temporal Ctx 


9.7 


IControl 
9.9 (Path) 2 
[Parietal Ctx 1 


28.9 ! 

| 


59.9 


Control (Path) 
1 Temporal Ctx 


59.9 j 


68.3 


Control 
(Path) 3 
Parietal Ctx 


10.2 j 


9.0 


Control (Path) 
2 femporal Ctx 


40.3 


41.2 ! 


Control 
(Path) 4 


44.8 j 


43.8 
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Table ME . General_screeningjpanel_vl.4 



Tissue Name 


Rel. Exp.(%) 
Ag3373, Run 
1 217043119 


Tissue Name 


Rel. Exp.(%) 
Ag3373, Run 
217043119 


Adipose 


120 


Renal ca. TK-10 


20.3 


Melanoma* 
Hs688(A).T 


30.8 


Bladder 


23.2 


Melanoma* 
Hs688(B).T 


69.3 


Gastric ca. (liver met.) 
\1 I-N87 


25.3 


Melanoma* Ml 1 


15.0 


Gastric ca. KATO 111 


30.8 


Melanoma* 
LOXIMVI 


26.6 


Colon ca. SW-948 


9.7 


'Melanoma* SK- 
lMEL-5 


21.5 




Colon ca. SW480 


35.1 


Squamous cell 
carcinoma SCC-4 


33.0 


Colon ca.* (SW4S0 
met) SW620 


13.9 


Testis Pool 


19.8 


Colon ca IIT29 


8.5 


Prostate; ca.* (bone 
met) PC-3 


100.0 


Colon ca. HCT-116 


36.9 


Prostate: Pool 


»• 


Colon ca. CaCo-2 


42.9 


! Placenta 


3.8 " 


Colon cancer tissue 


9.0 


: Uterus Pool 


7.4 


Colon ca. SW1116 


5.8 


O'.diian ca 
OVCAR-3 


28.5 


^oion ca. ^oio-zio 


4.3 


Ovarian ca. SK- 
OV-3 


40.3 


Colon ca. SW-48 




4.2 


Ovarian ca. 
OVCAR-4 


20.0 




Colon Pool 


.™_ 

20.7 


Ovarian ca. 
OVCAR-5 


35.1 


Small Intestine Pool 


12.2 


[Ovarian ca. 
[IGROV-1 


10.9 


Stomach Pool 


9.9 


Ovarian ca. 
OVC \R-N 


9.2 


Bone Marrow Pool 


11.6 


Ovary 


9 7 


Fetal Heart 


20.7 


Breast ca MCF-7 


37.6 


Heart Pool 


10.6 


Breast ca. MDA- 
MB-231 


37.1 Lymph Node Pool 


17.9 


Breast ca. B 1 549 


62.4 


Fetal Skeletal Muscle 


12.3 J 


Breast ca. T47D 


61.1 


Nk^ Ictal \ !Li .Cle POO! j 


1 6.0 


Breast ca. MDA-N 


10.0 ;SpleenPooI 


11.6 
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Breast Pool 17.3 


1 hymns P. >ol 


I 12.2 


Trachea 


12.0 


CNS cancer 
(glio/astro) U87-MG 


29.1 


Lung 


6.7 


CNS cancer 
(glio/astro) U-118-MG 


69.3 


Fetal Lung 


34.2 


CNS cancer 
(neuro;met) SK-N-AS 


34.9 


Lung ca. NCI-N417 


5.4 


CNS cancer (astro) SF- 
539 


19.1 


Lung ca. LX-1 


17.2 


CNS cancer (astro) 
SNB-75 


35.8 


Lung ca. NCI-H146 


3.0 


CNS cancer (glio) 
SNB-19 


11.3 


(Lung ca. SHP-77 


. e , CNS cancer (glio) SF- 
186 1295 


26.4 


Lung ca. A549 


. jBrain (Amygdala) Pool 


4.5 


iLung ca. NCI-H526 


4.6 'Brain (cerebellum) 


8.1 


]Lung ca. NCI-H23 


31.6 Brain (fetal) 


13.2 


Lung ca. NCI-H460 


18.2 


Brain (Hippocampus) 
Pool 


5.3 


Lungca HOP 62 


14.1 


Cerebral Cortex Pool 


5.4 


Lung ca. NCI-H522 


^ | g jBrain (Substantia 
jnigra) Pool 


4.5 


Liver 


1 .2 Brain (Thalamus) Pool 


8.0 


Fetal 1 iver 


32.3 


Brain (whole) 


6.2 


Liver ca. HepG2 


14.6 


Spinal Cord Pool 


6.6 


Kidney Pool 


22.1 Adrenal Gland 


8.1 


Fetal Kid ik) 


26.1 .Pituitary gland Pool 


3.0 


Renal ca. 786-0 


28.7 Salivaiy Gland 


4.7 


Renal c i \ 108 


1 1 .3 [Thyroid (female) | 


4.4 


Renal ca. ACHN 
Renal ca. 1 0-3 1 


. ~ ~ IPancreatic ca. 

CAPAN2 
24. 1 .Pancreas Pool 


17.3 

17 A "~ I 


Table MF. General screenins_panel vl.5 


Tissue Name 


Rel. Exp.(%) 
Ag5847, Run 
247590257 


Tissue Name 


Rel. Exp.(%) 1 
Ag5847, Run 
247590257 


Adipose 


0.1 


Renal ca. TK- 10 


0.2 


Melanoma* 
Hs6S8(A).l 


0.1 


Bladder 


0.1 


Melanoma* 
Hs688(B). f 


0.1 


Gastric ca. (liver met.) 
NCI-N87 i 


0.2 
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Melanoma* M14 


0.1 


( ia>!i il ca K\K)1!I 


0.1 


Melanoma* 
LOXIMVI 


J 


Colon ca. SW-948 


0.1 


Melanoma* SK- 
MEL-5 


! 


Colon ca. SW480 


— — — 

0.2 


Squamous cell 
carcinoma SCC-4 


0.2 


L o!< 'i! ca (SW480 
met) SW620 


1.8 


restis F'ool 


1 ST 


Colon ca. HT29 


0.0 


Prostate ca.* (bone 
met) PC-3 


0.6 Colon ca.HCT-1 16 


0.2 


Prostate Pool 


0.0 |Colon ca. CaCo-2 


0.0 


Placenta 


0.0 (Colon cancer tissue 


0.0 


Uterus Pool 


0.0 (Colon ca. SW1116 


0.0 


Ovarian ca. 
OVCAR-3 


0.2 jcolon ca. CoIo-205 


0.0 


Ovarian ca. SK- 
OV-3 


0.1 


Colon ca. SW-48 


0.0 


Ovarian ca. 
loVCAR-4 


0.1 


Colon Pool 


0.1 


Ovarian ca 
|OVCAR-5 


0.2 


Small Intestine Pool 


0.0 


|lGROV-l 


0.0 


Stomach Pool 


0.0 


:Ovarian ca. 
OVCAR-8 


0.1 


Bone Marrow Pool 


0.0 


Ovary 


0.1 


Fetal Heart 


0.1 


Breast ca. MCF-7 


0.3 


Heart Pool 




Breast ca. MDA- 
MB-231 


0.2 


Lymph Node Pool 


— °f — j 

0.1 

1 


Breast ca. BT 549 
Breast ca. T47D 


0 : 
<>j 


Fetal Skeletal Muscle 
Skeletal Muscle Pool 


0.1 I 

0.1 " " i 


Breast ca. MDA-N 


0.1 


Spleen Pool 




Breast Pool 


0.0 


Thymus Pool 


0.1 


Trachea 


0.1 


CNS cancer 
(glio/astro) U87-MG 


0.2 


Lung 


0.0 


CNS cancer 
(glio/astro) U-l 18-MG 


0.5 


Fetal Lung 


0.2 


CNS cancer 

Oui'R inietl Sk-N-AS 


0.2 


Lung ca.NCI-N417 


0.0 


CNS cancer (astro) SF- 
539 


0.1 


Lung ca. LX-1 


0.0 


CNS cancer (astro) 
SNB-75 


0.2 


Lung ca. NCI-H146 


0.0 


CNS cancer (glio) 
SNB-19 


0.1 
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Lung ca. SHP-77 


n . |CNS cancer (glio) SF- 

1295 


0.2 


Lung ca. A549 


0.2 


Brain i \ni gj th-u i )ol 


II u 


Lung ca. NCI-H526 


0.0 


Brain (cerebellum) 


0.0 


Lung ca. NCI-H23 


0.1 


Brain (fetal) 


0.1 


JLung ca. JNCi-ti4oU 


0.1 


Brain (Hippocampus) 
Pool 


0.0 


Lung ca. HOP-62 


0.0 |Cerebral Cortex Pool 


0.0 


It una ca TVTPT-T-TS?? 


q j (Brain (Substantia 
[nigra) Pool 


0.0 


jLiver 


0.0 jBrain (Thalamus) Pool 


0.0 


Fetal Liver 


0.1 jBrain (whole) 


0.0 


Livei ca. HepC.2 


0.1 ! Spinal Cord Pool 


0.0 


kidne\ Pool 


0.1 iAdrenal Gland 


0.0 


Fetal Kidney 


0.1 Pifmtan uland Red 


0.0 


Renal ca. 786-0 


0.2 Salivarv Gland 


100.0 


Renal ca. \4<>X 


0.1 jThyroid (female) 


0.0 


Renal ca. ACHN 


n , (Pancreatic ca. 

CAPAN2 


0.1 


Renal ca. UO-3 1 


0.1 Pancreas Pool 


0.0 


Table MG. Panel 1.3D 


Tissue Name 


Rel. Exp.(%) | 
Ag3023, Run 3 Tissue Name 
167966931 1 


Rel. Exp.(%) 
Ag3023, Run 
167966931 


Li\ er adenocarcint -ma 


51.1 jKidney (fetal) 


26.2 


[Pancreas j 6.1 jRenal ca. 786-0 


34.2 


Pancreatic ca. CAPAN . _ _ 0 . 

j ? 17.7 JRenal ca. A498 


17.6 


[Adrenal gland 


3.8 (Renal ca. RXF 393 


17.2 


[Thyroid 


3.0 Renal ca. ACHN 


13.5 


(Salivary gland 


3.9 .Renal ca. UO-3 1 


0.0 


Pituitary gland 


3.6 f Renal ca. TK- 10 J 23.0 


(Brain (fetal) 


8.1 [Liver 


11.7 


'Brain (whole) 


8.5 1 iver (fetal) 


8.0 


Brain (amygdala) 


6 7 Liver ca. 

(hepatoblast) HepG2 


26.2 


Bijiit ueir-hellum) 


15.2 I 1 1 1 » ■_■ 


3.1 


Brain (hippocampi^) 


5.4 Lung (fetal) 


11.0 


Brain (substantia nigra) 


Q n Lung ca. (small cell) 

;lx-i 


12.9 


Brain (thalamus) 


4.2 Lung ca. (small cell) 


9.9 
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r '-"™""""" 


JncI-H69 I 


Cerebral Cortex 


j Lung ca. (s.cell var.) 
SHP-77 


67.8 


Spinal cord 


6.9 


Lung ca. (large 
cell)NCI-H460 


3.4 


Glio/astro U87-MG 


28.5 


Lung ca. (non-sm. 
cell) A549 


45.1 


Glio/astroU-118-MG 


46.7 


Lung ca. (non-s.cell) 
NCI-H23 


22.7 


(astrocytoma SW1783 


40.6 


Lung ca. (non-s.cell) 
HOP-62 


25.7 


jneuro*; met SK-N-AS 


27.2 


Lung ca. (non-s.cl) 
NCI-H522 


38.2 


[astrocytoma SF-539 


29.7 


Lung ca. (squam.) 
SW 900 


27.4 


[astrocytoma SNB-75 


35.1 


Lung ca. (squam.) 
NCI-H596 


29.9 


jglioma SNB-19 


15.6 


Mammary gland 


5.1 


jglioma U251 


37.9 


Breast ca.* (pl.ef) 
MCF-7 


47.0 


jglioma SF-295 


18.4 


Breast ca.* (pl.ef) 
MDA-MB-231 


22.7 


[Heart (fetal) 


2.9 


Breast ca.* (pl.ef) 
T47D 


86.5 


jHeart 


12.9 


Breast ca. BT-549 


15.9 


jSkeletal muscle (fetal) 


3.4 


Breast ca. MDA-N 


10.4 


Skeletal muscle 


36.3 


Ovary 


2.9 


Bone marrow 


. r Ovarian ca. 

JOVCAR-3 


26.1 


Thymus 


, . ~ jOvarian ca. 

SOVCAR-4 


16.3 


Spleen 


8.7 


Ovarian ca. 
OVCAR-5 


83.5 


Lymph node 


11.8 


Ovarian ca. 
OVCAR-8 


9.3 


Colorectal 


j Q 4 jOvarian ca. IGROV- 


12.0 


Stomach 


- „ JOvarian ca.* 

(ascites) SK-OV-3 


100.0 


Small intestine 


5.1 (kins 


_ 4 - 9 


Colon ca. SW480 


1<>.3 


Placenta 


1.3 


Colon ca.* 
SW620(SW480 met) 


42.9 


Prostate 


3.9 


Colon ca. HT29 


9.9 


Prostate ca.* (bone 
met)PC-3 


78.5 



555 



Colon ca. HCT-116 


26 2 


Testis 


9.7 


Colon ca. CaCo-2 


41.5 


Melanoma 
Hs688(A).T 


~~ " sV " 1 


Colon ca. 
tissue(OD03866) 


6.3 


Melanoma* (met) 
Hs688(B).T 


14.2 


Colon ca. HCC-2998 


16.0 


Melanoma UACC- 
62 


14.0 


Gastric ca.* (liver met) 
NCI-N87 


18.8 


Melanoma Ml 4 


5.7 


Bladder 


30.6 


Melanoma LOX 
IMVI 


8.8 


Trachea 


3.2 


Melanoma* (met) 
SK-MEL-5 


14.7 


Kidney 


9.6 


Adipose 


18.9 



Table MH . Panel 4D 



Tissue Name 


Rel. 

Exp.(%) 
Ag3023, 

164516146 


Rel. 
Exp.(%) 
Ag3373, 
Run 
165296617 


Tissue Name 


Rel. 

Exp.(%) 
Ag3023, 

Run 
164516146 


Rel. 
Exp.(%) 
Ag3373, 
Run 
165296617 


Secondary Thl act 


18.6 


17.9 


HUVEC IL-lbeta 


20.3 


1X6 


Secondary Th2 act 


24.3 


28.5 


HUVEC 1FN 
gamma 


25.3 


22.7 


Secondary Trl act 


22.8 


HUVEC 1 Nh 
21.8 (alpha + IFN 
igamma 


_ 

16.3 


18.0 


Secondary Thl rest 


7.5 


6.8 


HUVEC TNF 
alpha I II 4 




18.2 


13.4 


Secondary Th2 rest 


11.6 


9.5 


HUVEC 11-11 


13.7 


9.9 


Secondary Trl rest 


12.1 


10.7 


Lung 

Microvascular EC 
none 


25.7 


21.6 


Primary Thl act 


20.7 


jLung 

. , - (Microvascular EC 
TNFalpha + IL- 
11 beta 


26.2 


18.3 


Primary Th2 act 


20.2 


19 3 (Microvascular 
J Dermal EC none 


27.5 


21.3 


Primary Trl act 


23.3 


27.7 


Microsvasular 
Dermal EC 
TNFalpha + IL- 
1 beta 


20.7 


19.9 


Primary Thl rest 


51.1 


51.4 


Bronchial 
epithelium 




13.0 


16.3 
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jTNFalpha + 
ILlbeta 






Primary Th2 rest 


26.2 


2^ ^ J Small airway 

(epithelium none 


8.1 


8.5 


Primary Trl rest 


23.7 


1 Small airway 
?fi , jepithelium 

jTNFalpha + IL- 
Jlbeta 


50.3 


39.8 


CD45RA CD4 
l\ mphocyte act 


14.6 


. 1 „ Coronery artery 
|SMC rest 


20.2 


18.9 


ICD45RO CD4 
lymphocyte act 


25.2 


22.4 


Coronery artery 
SMC TNFalpha + 
IL-lbeta 


12.0 


9.8 


CD8 lymphocyte 

:act 


20.4 


15.8 


Astrocytes rest 


10.4 


11.1 


Secondary CD8 
[lymphocyte rest 


16.5 


19.9 


Astrocytes 
TNFalpha + IL- 
1 beta 


11.7 


9.8 


Secondary CDS 
lymphocyte act 


13.2 


93 [KU-812 

< Basophil) rest 


47.6 


38.2 


CD4 lymphocyte 
none 


17.1 


iKU-812 
11.6 j(Basophil) 

iPMA/ionomycin 


94.0 


92.0 


2iy 

Thl/Th2/Trl_anti- 
CD95 C I 1 1 1 


18.3 


(CCD 11 06 
16.6 l(Keratinocytes) 
[none 


19.9 


13.2 


LAK cells rest 


25.5 


16.0 


CCDll06 
(Keratinocytes) 
TNFalpha + IL- U 
lbeta 


- 

4.8 


LAK cells IL-2 


27.2 


22 5 


Liver cirrhosis J 3.1 




2.7 


LAK cells IL- 
2+IL-12 


27.2 


19.3 ILupus kidney f 2.1 

1 1 


1.7 


LAK cells IL- 


36.3 


34.4 (NC1-H292 none 


30.1 


18.9 


1 Ak cells II -2 
IL-18 


35.1 


29.7 :NCl-H292 1L-4 

— 


33.9 


34.6 


LAK cells 
PMLA/i o nom y c in 


12.4 


1 1 .0 ;NCI-1 1292 IL-9 


40.1 


29.1 


NK Cells II -2 rest 


2(M) 


15.0 mCbW92R^3' 


16 : 


""TO 


Two Way MLR 3 
day 


24.0 


NCI-H292IFN 

16. / 1 

gamma 


16.6 


18.4 


Two Way MLR 5 
day 


12.9 


10.1 jHPAEC none 


13.6 


13.5 


Two Way MLR 7 
da> 


11.4 


9.5 


HPAEC TNF 
alpha + IL-1 beta 


25.3 


25.3 
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PBMC rest 


13.7 


10.5 


Lung fibroblast J ^ ^ 
none 


14.2 


rrJMC rWM 


69.3 


66.4 


Lung fibroblast 
TNF alpha + IL-1 
beta 


6.1 


7.2 


PBMC PHA-L 


22.8 


17.7 


Lung fibroblast 
IL-4 


28.5 


29.1 


Ramos (B cell) 
none 


24.1 


. q _ iLung fibroblast 
IL-9 


23.0 23.3 


Ramos (B cell) 
ionomycin 


100.0 


1 nn n i Lun § fibroblast 
jIL-13 


20 6 


18 ' 9 j 


B lymphocytes 
PWM 


71.7 


2 jLung fibroblast 
I FN gamma 


39.0 


32.5 


B lymphocytes 
CD40L and IL-4 


29.1 


28.7 


Dermal fibroblast 
CCD 1070 rest 


33.9 


31.0 


EOL-1 dbcAMP 


12.1 


10.5 


Dermal fibroblast 
CCD 1070 TNF 
alpha 


76.8 


62.0 


EOL-1 dbcAMP 
PMA/ionomycin 


14.5 


IDermal fibroblast 
10.9 [CCD1070IL-1 

jbeta 


20.3 


13.9 

— — j 


Dendritic cells 
none 


13.2 


14 8 .Dermal fibroblast 
1PN gamma 


14.2 


9.5 


Dendritic cells LPS 


11.7 


0 , Dermal fibroblast 
83 llL-4 


26.4 


20.4 


Dendriti c cells 
anti-CD40 


17.7 


12.7 IBD Colitis 2 


2.6 


2.2 


Monocytes rest 


16.7 


1 7.6 IBD Crohn's 


2.0 


1.9 


Mi ic; tcs LPS 


6.4 


5.0 Colon 11.9 


10.5 


Macrophages rest 


23.5 


22.8 Lung j 13.3 


11.2 


Macrophages LPS 




7. 1 Thymus 


14.4 


12.9 


HUVEC none 


20.6 


17.9 (Kidney 


27.5 


19.6 


HUVEC staged 


43.5 


4 





CNS_neurodegeneration_vl.0 Summary: Ag3023/Ag3373 This panel does not show 
differential expression of the CG56804-03 gene, a splice variant of CG56804-01, in 
Alzheimer's disease. However, this expression profile confirms the presence of this gene in 
the brain. Please see Panel 1.3D for discussion of utility of this gene in the central nervous 
system. Ag5847 - This primer pair recognizes only the splice variant CG58564-03. 
Expression of this variant is low/undetectable (CTs > 35) across all of the samples on this 
panel (data not shown). 
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Genera Iscreeningpanelv 1.4 Summary: Ag3373 Highest expression of the CG56804- 
03 gene is seen in a prostate cancer cell line (CT=27). Overall, this gene is expressed at 
moderate levels in the cancer cell lines in this panel. A higher level of expression is 
observed in clusters of cell lines derived from prostate, brain, melanoma, colon, lung, 
breast and ovarian cancer when compared to expression in normal prostate, brain, colon, 
lung, breast and ovary. Thus, this gene could potentially be used as a diagnostic marker of 
cancer in these tissues. Furthermore, inhibition of the activity of this gene product using 
small molecule drugs may be effective in the treatment of cancer in these tissues. 

Among tissues with metabolic function, this gene product has moderate levels of 
expressiion in adipose, heart, skeletal muscle, adrenal, pituitary, thyroid and pancreas. Thus, 
this gene product may be a small molecule target for the treatment of endocrine and 
metabolic diseases, including obesity and Types 1 and 2 diabetes. 

In addition, this gene appears to be differentially expressed in fetal (CT value = 29) 
vs adult liver (CT value =33) and may be useful for differentiation between the two sources 
of this t issue. 

This gene is also expressed at moderate levels in all central nervous system samples 
present on this panel. Please see Panel 1.3D for discussion of utility of this gene in the 
central nervous system. 

General_screening_panel_vl.5 Summary: Ag5847 - This primer pair, specific to this 
splice variant, CG58564-03. Expression of this variant is highest in salivary gland 
(CT=28.6). Therefore, expression of this gene can be used to differentiate this sample from 
others on the panel. 

Panel 1.3D Summary: Ag3023 The CG56804-03 gene is ubiquitously expressed among 
the samples on this panel, with highest expression in an ovarian cancer cell line (CT=28.8). 
Overall, the expression of this gene shows good agreement with panel 1 .4. A higher level 
of expression is observed in prostate, brain, melanoma, colon, lung, pancreatic, breast and 
ovarian cancer cell lines than the normal prostate, brain, colon, lung, pancreas, breast and 
ovary. Thus, expression of this gene could be used as a diagnostic marker of cancer in these 
tissues. Furthermore, inhibition of the activity of this gene product using small molecule 
drugs may be effective in the treatment of cancer in these tissues. 
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Among tissues with metabolic function, expression of this gene is widespread, as in 
the previous panel. Please see Panel 1 .4 for discussion of utility of this gene in metabolic 
disease. 

This gene represents a dual specificity phosphatase that is also expressed at low to 
5 moderate levels across the CNS. Dual-specificity phosphatases comprise a family of MAP 
kinase regulating enzymes, members of which are upregulated in brains subjected to insults 
such as ischemia and seizure activity. MAP kinases are kown to regulate neurotrophic and 
neurotoxic pathways. Consequently, agents that modulate the activity of this gene may 
have utility in attenuating the apoptotic and neurodegenerative processes following brain 
10 insults. 

Panel 4.1D Summary: Ag5847 - This primer pair recognizes a splice variant of 
CG58564-03. Expression of this variant is low/undetectable (CTs > 35) across all of the 
samples on this panel (data not shown). 

Panel 4D Summary: Ag3023/Ag3373 The CG56804-03 gene is expressed at high to 
1 5 moderate levels in a wide range of cell types and tissues of significance in the immune 

response in health and disease. Highest expression of this gene is seen in ionomycin treated 
Ramos 3 cells (CT=26.83). Therefore, targeting of this gene product with a small molecule 
drug or antibody therapeutic may modulate the functions of cells of the immune system as 
well as resident tissue cells and lead to improvement of the symptoms of patients suffering 
20 from autoimmune and inflammatory diseases such as asthma, allergies, inflammatory 

bowel disease, lupus erythematosus, and arthritis, including osteoarthritis and rheumatoid 
arthritis. 

N. CG58564-04: Dual specificity phosphatase 

Expression of gene CG58564-04, a splice variant of CG58564-01, was assessed 
25 using the primer-probe sets Ag3023, Ag3373 and Ag5844, described in Tables NA, NB 
and NC. Results of the RTQ-PCR runs are shown in Tables ND, NE, NF and NG. 

Table NA . Probe Name Ag3023 
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1 

(Primers 


Sequences 


.Length 


Start 
Position 


SEQ ID 

NO: 


[Forward 


5 ' -ctaatgctggatttgtccatca-3 ' 




190 


417 


Probe 


TET-5 ' -tcaggaatatgaagccatctacctagca- 
3 ■ -TAMRA 


I 28 


159 


418 


j Reverse: 


5 1 -tggagtggtgacatcatctgta-3 1 


: 


127 


419 


Table NB. Probe Name Ag3373 


Primers 


Sequences 




i Length 


Start 
Position 


SEQ ID 

NO: | 


jForward|5 ■ -atttgtccatcaacttcaggaa-3 ' 




~180 


420 


Probe 


TET-5 ' -tgaagccatctacctagcaaaattaaca- 
3 ' -TAMRA 


1 28 


150 


421 


Reverse \S 1 -tggagtggtgacatcatctgta-3 1 


I 22 


127 


422 



Table N C. Probe Name Ag5844 



JPrimers 


Sequences 


Length 


Start 
Position 


SEQ ID 
NO: 


Forward 


5 1 -ccttagtctaaataactgctg-3 ' 




377 


423 


jProbe 


- _ r _ r- agtt jct tattttg tg i< a 3 ' 
TAMRA 




415 


~~ 424 


^Reverse 


5 ' -aggagtggacctaccctat-3 ' 


1- 


552 


425 



Table N D. CNS_neurodegeneration_vl.O 



Tissue Name 


Rel. Exp.(%) 
Ag3023, Run 
209821074 


Rel. Exp.(%) 
Ag3373, Run 
210154071 


Tissue 
Name 


Rel. Exp.(%) 
Ag3023, Run 
209821074 


Rel. Exp.(%) 
Ag3373, Run 
210154071 


AD 1 Hippo 


10.9 


16.8 


Control 
(Path) 3 
Temporal 
Ctx 


9.1 


8.0 


AD 2 Hippo 


34.2 


37.6 


Control 
(Path) 4 
Temporal 
Ctx 


40.6 


65.5 


AD 3 Hippo 


12.0 




AD 1 

Occipital 

Ctx 


— — 

24.7 


29.1 


AD 4 Hippo 


13.8 


10.3 


AD 2 

Occipital 

Ctx 

(Missing) 




0.0 


0.0 


AD 5 hippo 


60.7 


57.8 


AD 3 


14 7 


15.0 
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Occipital 

Ctx 






AD 6 Hippo 


80.7 


72.2 


AD 4 
Occipital 

Ctx 


35.4 


22.4 


Control 2 
Hippo 


35.8 


|AD5 
38.4 Occipital 
f \ 


3.9 


30.4 


Control 4 
Hippo 


16.5 


1 1 .7 Occipital 

fCtx 


46.0 


37.4 


3 Hippo 


13.1 


Control 1 
15.4 (Occipital 

[Ctx 


9.9 


10.7 


Ctx 


39.0 


31.4 


Control 2 
Occipital 
Ctx 


39.0 


38.4 


AD 2 Temporal 
Ctx 


38.7 


73.2 


Control 3 
Occipital 
Ctx 


23.0 


20.6 


AD 3 Temporal 
Ctx 


9.5 


IControl 4 
13.2 (Occipital 
( \ 




13.3 


13.3 


AD 4 Temporal 
Ctx 


27.9 


jControl 
349 (Path) 1 
lOccipital 
Ctx 


80.1 


76.3 


AD 5 Inf 
Temporal Ctx 


59.0 


100.0 


Control 
(Path) 2 
Occipital 
Ctx 


17.3 


20.0 


AD 5 

S upTc mp ofci 1 
Ctx 


33 2 




iControl 

44 1 F ath > 3 
Occipital 

iCtx 


8.4 


8 7 


AD 6 Inf 
Temporal Ctx 


100.0 


Control 
?32 (Path) 4 
Occipital 

ICtx 


21.2 


20.6 


AD 6 Sup 
Temporal Ctx 


79.6 


80.1 


Control 1 
Parietal Ctx 


12.1 


16.3 


Control 1 
1 emporal Ctx 


10.2 


13.7 


Control 2 
Parietal Ctx 


48.0 


40.9 


Control 2 
Temporal Ctx 


41.2 


iControl 3 
Parietal Ctx 


17.9 


16.3 


Control 3 
Temporal Ctx 


20.3 


20.0 




Control 
(Path) 1 


74.7 

, 


64.2 







Parietal Ctx 




i 


Control 4 
Temporal Ctx 


9.7 


Control 
9.9 (Path) 2 

(Parietal Ctx 


28.9 




59.9 


Control (Path) 
1 Temporal Ctx 


59.9 


jControl 
68.3 (Path) 3 

(Parietal Ctx 


10.2 


9.0 | 


Control (Path) 
2 Temporal Ctx 


40.3 


jControl 
41.2 (Path) 4 

(Parietal Ctx 


44.8 


43.8 



Table NE . General_screeningjpanel_vl.4 



— ™™— - 

Tissue Name 


Rel. Exp.(%) 
Ag3373, Run 
217043119 


Tissue Name 


Rel. Exp.(%) 
Ag3373, Run 
217043119 


Adipose 


12.0 


Renal ca. rK-10 


20.3 


Melanoma* 
Hs688(A).T 


30.8 


Bladder 


23.2 


Melanoma* 
[Hs688(B).T 


69.3 


Gastric ca. (liver met.) 
NCI-N87 


______ 

25.3 


(Melanoma* Ml 4 


1 5.0 jGastric ca. KATO III 


30.8 


[Melanoma* 
LOX1MVI 


26.6 


Colon ca. SW-948 


9.7 


Melanoma* SK- 
MEL-5 


21.5 


Colon ca. SW480 


35.1 


Squamous cell 
carcinoma SCC-4 


33.0 


Colon ca.* (SW480 
met) SW620 


13.9 


Testis Pool 


19.8 


Colon ca. HT29 


8.5 


Prostate ca.* (bone 
met) PC-3 


100.0 


Colon ca. HCT-116 




36.9 


Prostate Pool 


9.2 


Colon ca. CaCo-2 


42.9 


Placenta 


3.8 


Colon cancer tissue 


9.0 


Uterus Pool 


7.4 


Colon ca. SWI 1 16 


5.8 " "H 


Ovarian ca. 
OVCAR-3 


28.5 


Colon ca. Colo-205 




Ovarian ca. SK- 
OV-3 


40.3 


Colon ca. S W-48 


4.2 


Ovarian ca. 
OVCAR-4 


20.0 


Colon Pool 


20.7 


Ovarian ca. 
OVCAR-5 


35.1 


Small Intestine Pool 


' 12.2 


Ovarian ca. 
IGROV-1 


10.9 


Stomach Pool 


9.9 


Ovarian ca. 


9.2 


Bone Marrow Pool 


11.6 i 
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OVCAR-8 


~ 


:.:::::z:::i 


Ovary 


9.7 


Fetal Heart 


20.7 


Breast ca. MCF-7 


37.6 


Heart Pool 


10.6 


Breast ca. MDA- 
MB-231 


37.1 


Lymph Node Pool 


17.9 


Breast ca. BT 549 


62.4 


Fetal Skeletal Muscle 


12.3 


Breast ca. T47D 




Skeletal Muscle Pool 


16.0 


Breast ca MDA-N 


10.0 


Spleen Pool 




Breast Pool 


17.3 


! hymus Pool 


12.2 


Trachea. 


12.0 


CNS cancer 
(glio/astro) U87-MG 


. 2 .": 1 _. 


Lung 


6.7 


CNS cancer 
(glio/astro) U-l 1 8-MG 


69.3 


Fetal Lung 


34.2 


CNS cancer 

(neuro met) SK-N-AS 


34.9 


Lungca.NCI-N417 


5.4 

- , _ 


CNS cancer (astro) SF- 
539 


19.1 


Lung ca. LX-1 


17.2 


CNS cancer (astro) 
SNB-75 


35.8 


Lung ca. NCI-H146 


3.0 


CNS cancer (glio) 
SNB- 1 9 


11.3 


Lung ca. SHP-77 


18.6 


CNS cancer (glio) SF- 
295 


26.4 


Lung ca. A549 


29.1 


Brain (Amygdala) Pool 


4.5 


Lungca. NCI-H526 




Brain (cerebellum) 


8.1 " ~ 


1 uri" :s NCI-IP^ 

— — 


31.6 


Hr mi f T>t i! i 




Lung ca. NCI-H460 


18.2 


Brain (Hippocampus) 
Pool 


5.3 


Iuiul IK )!'-<>' 


" " 14.1 


Cerebral C orUw Pool 


" 5.4 " 


LAMg Ca.. JNCl-rljZZ 


31.6 


Brain (Substantia 
nigra) Pool 


4.8 


Liver 


1.2 


Bt/ii) ('thalamus) Poo] 


8.0 


Petal I ivcr 


32.3 


Brain (whole) 


6 2 


Liver ca. HepG2 


14.6 


Spinal Cord Pool 


6.6 


Kidney Pool 


22.1 


Adrenal Gland 


8.1 """" 


Fetal Kidney 


26.1 


Pituitan gland P^-l 


~~ "3.0 


Renal ca. 786-0 


28.7 


Salivary Gland 




Renal ca. A498 


11.3 


Thyroid (female) 


4.4 


Renal ca. ACHN 


12.2 


Pancreatic ca. 
CAPAN2 


17.3 


Renal ca. UO-31 


24.1 


Pancreas Pool 


17.1 



Table NF . Panel 1.3D 
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Tissue Name 


Rel. Exp.(%) 
Ag3023, Run 
167966931 


Tissue Name 


rvei. H/Xp.^ /o ) 

Ag3023, Run 
L6796693] 


Liver adenocarcinoma 


51.1 




Kidney (fetal) 


26.2 


Pancreas 


6.1 


Renal ca. 786-0 


34.2 


Pancreatic ca. CAP AN 
2 


17.7 


jxcridl Ca. /\H-yo 


17 6 


Adrenal gland 


3.8 


Renal ca. RXF 393 


17.2 


Thyroid 


3.0 


Renal ca. ACHN 


13.5 


Salivary gland 


3.9 


Renal ca. UO-31 


i) i) 


Pituitary gland 


3.6 




Kcn.-il ca. IK- SO 


23.0 


Brain (fetal) 


8.1 




Liver 


11.7 


Brain (whole) 


8.5 




1 jver (fetal) 


8.0 


Brain (amygdala) 


6.7 




Liver ca. 

(hepatoblast) HepG2 


26.2 




15.2 








Bum (hippocampi! ;) 


5.4 




Lung (fetal) 


11.0 


Brain (substantia nigra) 


9.0 




Lung ca. (small cell) 
LX-1 


12.9 


Brain (thalamus) 


4.2 




Lung ca. (small cell) 
NCI-H69 


9.9 


Cerebral Cortex 


2.0 




Lung ca. (s.cell var.) 
SHP-77 


67.8 


Spinal cord 


6.9 




Lung ca. (large 
cell)NCI-H460 


3.4 


Glio/astro U87-MG 

____ 


28.5 




Lung ca. (non-sm. 
cell) A549 


45.1 


Glio/astro U-118-MG 


46.7 




Lung ca. (non-s.cell) 
NCI-H23 


22.7 


astrocytoma SW1783 


40.6 


1 img ca. (non-s.cell) 
HOP-62 


25.7 


neuro*; met SK-N-AS 


27.2 


|Lung ca. (non-s.cl) 
NCI-H522 


38.2 


astrocytoma SF-539 


29.7 




Lung ca. (squam.) 
SW 900 


27.4 


astrocytoma SNB-75 


35.1 


I. ling ca. (squam.) 
NCI-H596 


29.9 


glioma SNB-19 


15.6 




Mammary gland 


5.1 


glioma U25I 


37.9 


iBreast ca.* (pl.ef) 
jMCF-7 


47.0 


glioma SF-295 


18.4 




Breast ca.* (pl.ef) 
MDA-MB-231 


22.7 


Heart (fstal) 


2.9 


[Breast ca.* (pl.ef) 
JT47D 


86.5 


Heart 


12.9 


(Breast ca. BT-549 


15.9 
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Skeletal muscle (fetal) 


3.4 Breast ca. MDA-N 


10.4 


Skeletal muscle 


36.3 jOvary 


2.9 


Bone marrow 


. , JOvarian ca. 

|OVCAR-3 


26.1 


Thymus 


1 . - lOvarian ca. 

jOVCAR-4 


16.3 


Spleen 


8.7 


Ovarian ca. 
OVCAR-5 


83.5 


Lymph node 


11.8 


Ovarian ca. 
OVCAR-8 


9.3 


Colorectal 


10.4 


Ovarian ca. IGROV- 
1 


12.0 


Stomach 


7.8 


Ovarian ca.* 
(ascites) SK-OV-3 


— 

100.0 


Small intestine 


5.1 


Uterus 


4.9 


Colon ca. SW480 


19.3 


Placenta 


1 3 


Colon ca.* 
SW620(SW480 met) 


42.9 


Prostate 


3.9 


Colon ca. HT29 


9.9 


Prostate ca.* (bone 
met)PC-3 


78.5 


Colon ca. HCT-116 


26.2 


Testis 


9.7 


Colon ca. CaCo-2 


41.5 


Melanoma 

Hs688(A).T 


5.9 


Colon ca. 
tissue(( »D03866) 


6.3 


Melanoma* (met) 
Hs688(B).T 


» ~ ™ 

14.2 


Colon ca. HCC-2998 


16.0 


Melanoma UACC- 
62 


14.0 


Gastric ca.* (liver met) 
NCI-N87 


18.8 jMelanoma Ml 4 


5.7 


Bladder 


,„ , Melanoma LOX 

30 : 6 liMvi 


8.8 


Trachea 


- j Melanoma* (met) 
|SK-MEL-5 


14.7 


Kidney 


9.6 jAdipose 


18.9 | 



Table N G. Panel 4D 



Tissue Name 


Rel. 
Exp.(%) 
Ag3023, 
Run 
164516146 


Rel. 

Exp.(%) 
Ag3373, 
Run 
165296617 


Tissue Name 


Rel. 
Exp.(%) 
Ag3023, 
Run 
164516146 


Rel. 

Exp.(%) 
Ag3373, 
Run 
165296617 


Secondary Thl act 


18.6 


17.9 


HUVEC IL-lbeta 


20.3 


IS h 


Secondeiry Th2 act 


24.3 


28.5 


111 \T.C I FN 
gamma 


25.3 


22.7 
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Secondary Trl act 




21.8 


HUVEC TNF 
alpha + IFN 
gamma 


16 3 . 18.0 

j 


Secondary Thl rest 


7.5 


, _ Ihuvec tnf 

alpha -II 4 


18.2 1 13.4 


Secondary rh2 rest 


11.6 


9.5 Ihuvec il- li 


IT 0.9 


Secondary Trl rest 


12.1 


jLung 

10.7 (Microvascular EC 
|none 


25.7 


21.6 


iPrimaiy Thl act 

1 

1 »" 


20.7 


16.5 


Lung 

Microvascular EC 
TNFalpha + IL- 
1 beta 


26.2 


18.3 


(Primary Th2 act 


20.2 


19.3 


Microvascular 
Dermal EC none 


27.5 


21.3 


[Primary Trl act 


23.3 


j M i cro sv3.su 1 ox 
Dermal EC 
TNFalpha + 1 L- 
1 beta 


20.7 


19.9 


j Primary Thl rest 


51.1 


(Bronchial 
- 1 . jepithelium 
«TNFalpha + 
JILlbeta 


13.0 


16.3 


(Primary Th2 rest 


26.2 


2^ - i Small airway 

epithelium none 


8.1 


8.5 


Primary Trl rest 


23.7 


J Small airway 
lf , lepithelium 
ZO>1 jTNFalpha + IL- 

(lbeta 


50.3 


39.8 


CD45RA CD4 
Kmphocyte act 


14.6 


1 1 _ iCoronery artery 
SMC rest 


20.2 


18.9 


CD45RO CD4 
lymphocyte act 


25.2 




Coronery artery 
22.4 ISMC TNFalpha + 
(IL-Ibeta 


12.0 


9.8 




CD8 lymphocyte 
act 


20.4 


15.8 Astrocytes rest 


-10.4 11.1 


Secondary CD 8 
lymphocyte rest 


16.5 


Astrocytes 
19.9 JTNFalpha + IL- 
llbeta 


11.7 


9.8 


Secondary CD8 
lymphocyte act 


13.2 


9 3 I KU - 812 

(Basophil) rest 


47.6 


38.2 


CD4 lymphocyte 
none 


17.1 


11.6 


KU-812 

(Basophil) 

PMA/ionomycin 


94.0 


92.0 


2r> 

Thl/Th2/Trl anti- 
CD95CH11 


18.3 


16.6 

- , 


CCD1106 

(Keratinocytes) 

none 


19.9 


13.2 
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LAK cells rest 


25.5 


CCD1 106 

. , „ (Keratinocytes) 
TNFalpha + IL- 
jlbeta 


6.0 


4.8 


LAK cells IL-2 


27.2 


22.5 Liver cirrhosis 


3.1 


2.7 


LAK cells IL- 
2+IL-12 


27.2 


1 9.3 ; Lupus kidney 


2.1 


1.7 


LAK cells IL- 
2+IFN gamma 


36.3 


34.4 


NCI-H292 none 


30.1 


18.9 


LAK cells 11 -2+ 
IL-18 


35.1 


29.7 INCI-H292 IL-4 


33.9 


34.6 


LAK cells 
PMA/ionomycin 


12.4 


11.0 jNCI-H292 IL-9 


40.1 


29.1 


NK Cells IL-2 rest" 


20.0 


15.0 INCI-H292 IL-13 


162 




lwo way mlk 3 
day 


24.0 


,, n 1NCI-H292IFN 

Jgamma 


16.6 


18.4 


Two Way MLR 5 


12.9 


10.1 fHPAECnone 

■ 


13.6 


13.5 j 

1 


Two Way MLR 7 
day 


11.4 


9 . iHPAEC TNF 
0 ialpha + IL-1 beta 


25.3 


25.3 


PBMC rest 


13.7 


10.5 


Lung fibroblast 
none 


11.4 


14.2 


JrrJMC Jl WM 


69.3 


66.4 


Lung fibroblast 
TNF alpha + IL-1 
beta 


6.1 


7.2 


PBMC PHA-L 


22.8 


17.7 


Lung fibroblast 
IL-4 


28.5 


29.1 


Ramos (B cell) 
none 


24.1 


19.3 


Lung fibroblast 
IL-9 


23.0 


23.3 


Ramos (B cell) 
lonomycin 


100.0 


mnn (Lung fibroblast 
lUUW jlL-13 


20 6 


18.9 


B lymphocytes 
PWM 


71.7 
29.1 


74.2 
28.7 


Lung fibroblast 
IFN gamma 


39.0 
33.9 


32.5 j 


B lymphocytes 
|CD40L and IL-4 


Dermal fibroblast 
CCD 1070 rest 


31.0 1 


EOL-1 dbcAMP 


12.1 


IDermal fibroblast 
10.5 CCD 1070 TNF 
[alpha 


76.8 


62.0 


EOL-1 dbcAMP 
PMA/ionomycin 


14.5 


Dermal fibroblast 
10.9 ICCD1070IL-1 
Ibeta 


20.3 


13.9 1 


Dendritic cells 
none 


13.2 


14 g (Dermal fibroblast 
I FN gamma 


14.2 


- 

9.5 1 


Dendritic cells LPS 


11.7 


0 - (Dermal fibroblast \ 
8 -" IL-4 


26.4 


20.4 


Dendritic «. ell , 


17.7 


IBD Colitis 2 


2.6 


2.2 
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anti-CD40 f 










Monocytes rest 


16.7 


— — - 


IBD Crohn's 


2.0 


1.9 


plonocytesLPS 


6.4 


5.0 


Colon 


11.9 


10.5 


jMacrophages rest 


23.5 


22.8 


Lung 


13.3 


11.2 


[Macrophages LPS 


9.9 


7.1 


Thymus 


14.4 


12.9 


(HUVEC none 


20.6 


17.9 fKidney 


27.5 


19.6 


IHUVEC starved 


43.5 


38JT " f" 



CNS_neurodegeneration_vl.O Summary: Ag3023/Ag3373 This panel does not show 
differential expression of the CG56804-04 gene in Alzheimer's disease. However, this 
expression profile confirms the presence of this gene in the brain. Please see Panel 1 .3D for 
discussion of utility of this gene in the central nervous system. Ag5847 - This primer pair 
recognizes a splice variant of CG58564-01 designated CG58564-04. Expression of this 
variant is low/undetectable (CTs > 35) across all of the samples on this panel (data not 
shown). 

General_screening_panel_vl.4 Summary: Ag3373 Highest expression of the CG56804- 
04 gene is seen in a prostate cancer cell line (CT=27). Overall, this gene is expressed at 
moderate levels in the cancer cell lines in this panel. A higher level of expression is 
observed in clusters of cell lines derived from prostate, brain, melanoma, colon, lung, 
breast and ovarian cancer when compared to expression in normal prostate, brain, colon, 
lung, breast and ovary. Thus, this gene could potentially be used as a diagnostic marker of 
cancer in these tissues. Furthermore, inhibition of the activity of this gene product using 
small molecule drugs may be effective in the treatment of cancer in these tissues. 

Among tissues with metabolic function, this gene product has moderate levels of 
expression in adipose, heart, skeletal muscle, adrenal, pituitary, thyroid and pancreas. Thus, 
this gene product may be a small molecule target for the treatment of endocrine and 
metabolic diseases, including obesity and Types 1 and 2 diabetes. 

In addition, this gene appears to be differentially expressed in fetal (CT value = 29) 
vs adult liver (CT value =33) and may be useful for differentiation between the two sources 
of this tissue. 

This gene is also expressed at moderate levels in all central nervous system samples 
present on this panel. Please see Panel 1.3D for discussion of utility of this gene in the 
central nervous system. 
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General_screening_panel_vl.5 Summary: Ag5844 - This primer pair recognizes a 
splice variant of CG58564-01. Expression of this variant is low/undetectable (CTs > 35) 
across all of the samples on this panel (data not shown). 

Panel 1.3D Summary: Ag3023 The CG56804-04 gene is ubiquitously expressed among 
5 the samples on this panel, with highest expression in an ovarian cancer cell line (CT=28.8). 
Overall, the expression of this gene shows good agreement with panel 1 .4. A higher level 
of expression is observed in prostate, brain, melanoma, colon, lung, pancreatic, breast and 
ovarian cancer cell lines than the normal prostate, brain, colon, lung, pancreas, breast and 
ovary. Thus, expression of this gene could be used as a diagnostic marker of cancer in these 
10 tissues. Furthermore, inhibition of the activity of this gene product using small molecule 
drugs may be effective in the treatment of cancer in these tissues. 

Among tissues with metabolic function, expression of this gene is widespread, as in 
the previous panel. Please see Panel 1 .4 for discussion of utility of this gene in metabolic 
disease. 

1 5 This gene represents a dual specificity phosphatase that is also expressed at low to 

moderate levels across the CNS. Dual-specificity phosphatases comprise a family of MAP 
kinase regulating enzymes, members of which are upregulated in brains subjected to insults 
such as ischemia and seizure activity. MAP kinases are known to regulate neurotrophic and 
neurotoxic pathways. Consequently, agents that modulate the activity of this gene may 

20 have utility in attenuating the apoptotic and neurodegenerative processes following brain 
insults. 

Panel 4.1D Summary: Ag5844 - This primer pair recognizes a splice variant of 
CG58564-01. Expression of this variant is low/undetectable (CTs > 35) across all of the 
samples on this panel (data not shown). 

25 Panel 4D Summary: Ag3023/Ag3373 The CG56804-04 gene is expressed at high to 
moderate levels in a wide range of cell types and tissues of significance in the immune 
response in health and disease. Highest expression of this gene is seen in ionomycin treated 
Ramos B cells (CT=26.83). Therefore, targeting of ghis gene product with a small 
molecule drug or antibody therapeutic may modulate the functions of cells of the immune 

30 system as well as resident tissue cells and lead to improvement of the symptoms of patients 
suffering from autoimmune and inflammatory diseases such as asthma, allergies, 
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inflammatory bowel disease, lupus erythematosus, and arthritis, including osteoarthritis and 
rheumatoid arthritis. 

O. CGS7819-01: RPGR-INTERACTING PROTEIN-1 

Expression of gene CG578 19-01 was assessed using the primer-probe set Ag3338, 
described in Table OA. Results of the RTQ-PCR runs are shown in Tables OB and OC. 



Table OA . Probe Name Ag3338 



Primers Sequences 


Length 


Start 
Position 


SEQID 
NO: 


Forward 5' - cccattcagcactgaaacag-3 ' 


20 


3021 


426 


r> JTET-5 1 -tcctgtaaatgacaaagaatcctctgaaca-3 ' - 
i torje TAI4RA 


30 


3055 


427 


Reverse J5 ' -tgcttcactgacttcagaacct-3 ' 


22 


3085 


428 



Table OB . General_screening_panel_vl.4 



- " - ■ 

Tissue Name 


Rel. E.\p.(%) 
Ag3338, Run 
215773746 


" - ' 

Tissue Name 


Rel.Exp.(%) 
Ag3338, Run 1 
215773746 


Adipose 


1.1 


R co j lea IK- 10 




Melanoma* 
Hs688(A).T 


0.0 


Bladder 


i.i j 


Melanoma* 
Hs688(B).T 


0.0 


Gastric ca. (liver met.) 
NCI-N87 


0.0 


Melanoma* M ] 4 


0.0 


Gastric ca. KA l< > III 


0.0 


Melanoma* 
LOXIMVI 


0.0 


Colon ca. SW-948 


0.0 

i 


Melanoma* SK- 
MEL-5 


0.2 


Colon ca. S W480 


0.4 


Squamous cell 
carcinoma SCC-4 


0.0 


Colon ca.* (SW480 
>nai SW620 


0.0 


Testis Pool 


100.0 


Colon ca. HT29 


0.5 


Prostate ca.* (bone 
met) PC-3 


0.0 


Colon ca. HCT-116 


0.2 


Prostate Pool 


1.0 


Colon ca. CaCo-2 


1.0 


Placenta 


0.0 


Colon cancer tissue 


0.9 ! 


Uterus Pool 


0.0 


Colon ca. SWI116 


" 0.2 | 


Ovarian ca. 
OVCAR-3 


0.9 


Colon ca. Colo-205 


0.2 


Ovarian ca. SK- 
OV-3 


0.0 




Colon ca. SW-48 


0.0 
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Ovarian ca. 
OVCAR-4 


1.2 


Colon Pool 


0.5 


Ovarian ca. 
OVCAR-5 


3.5 


Small Intestine Pool 


0.3 


Ovarian ca. 
IGROV-1 


0.0 


Stomach Pool 


0.2 


Ovarian ca. 
OVCAR-8 


0.0 


Bone Marrow Pool 


0.0 


Ovary 


0.9 


Fetal Heart 


0.8 


■ *■ — i l ..i MCI -7 


1.9 


Heart Pool 


I.I 


Breast ca. MDA- 
MB-231 


1.2 


Lymph Node Pool 


1.4 


Breast ca. BT 549 


0.2 


Fetal Skeletal Muscle 


0.2 


Breast lu. F47D 


6.7 


Skeletal Muscle Pool 


0.0 


DI Cool ta, iV1JL^/\-1N 


0 0 


opieen rooi 




Breast Pool 


0.5 


Thymus Pool 


0.0 


Trachea 


0.9 


CNS cancer 
(glio/astro) l 87-MG 


0.0 


Lung 


0.2 


CNS cancer 
(glio/astro) U-118-MG 


0.0 


Fetal Lung 


0.4 


CNS cancer 
(neuro;met) SK-N-AS 


0.0 


Lung ca. NCI-N417 


0.0 


CNS cancer (astro) SF- 

pff _ 


0.0 


Lung ca. LX-1 


0.8 


CNS cancer (astro) 
SNB-75 


0.0 


Lung ca. NCI-H146 




0.5 


CNS cancer (glio) 
SNB-19 


0.0 


Lung ca. SHP-77 


0.1 


CNS cancer (elio) si - 
295 


0.2 


Lung ca. A 549 


1.5 


Br?an (Amygdala) Pool 


0.7 


Lung ca. NCI-H526 


0.0 


Brain (cerebellum) 


0.6 


Lungca.NCI-H23 


1.5 


Brain ( fetal) 


0.9 


Lung ca. NCI-H460 


0.0 


Brain (Hippocampus) 
Pool 


0.7 


Lungca. HOP-62 


3.0 


Cerebral Cortex Pool 


0.2 


Lun° ca. NCI-H522 


0.0 


Brain (Substantia 
nigra) Pool 


0.7 


Liver 


°- 4 


Brain (Thalamus) Pool 


1.3 


Fetjl l.i* or 


0.5 


Brain (whole) 


0.0 


I i\cr c; . HepG2 


0.2 


Spinal < \.rJ Pool 


0.9 


Kidney Pool 


"6~9~"" 


Adrenal Gland 


0.0 


Fetal Kidney 


0.6 


Pituitar\ gland Pool 


0.0 


Renal ca. 786-0 


o.o 


Sal i van Gland 


0.0 
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[Renal ca. A498 | 

— -- - 


0.0 


Thyroid (female) 


0.0 


Renal ca. ACHN } 


0.0 


Pancreatic ca. f 
CAPAN2 ' 


3.4 


Renal ca. UO-31 


0.0 


Pancreas Pool ■ 


0.8 


Table OC. Panel 4D 


1 Tissue Name 


Rel. Exp.(%) 
Ag3338, Run 
165221737 


Tissue Name 


Rel. F.\p.(%) 
Ag3338, Run 
165221737 


Secondary Thl act 


0.0 


HUVEC IL-lbeta" 


0.0 


Secondarv Th2 act 


0.0 


HUVEC IFN gamma 


6.9 


Secondary Trl act 


0.0 


HUVEC TNF alpha + 
IFN gamma 


0.0 


Secondary Thl rest 


0.0 


III A EC 1 NF alpha i 
IL4 


0.0 


beconJan 1 h2 rest 


0.0 


HUVEC IL-1 1 


0.0 


Secondary Trl rest 


0.0 


Lung Microvascular EC 
none 


2.6 

1 


jPrimary Thl act 


0.0 


Lung Nlicrovascular EC 
rNFalpha i IL-lbeta 


0.0 


1 

iPrimary Th2 act 


0.0 


Microvascular Dermal 
EC none 


1.9 


{Primary Trl act 


0.0 


Microsvasular Dermal 
EC rNFalpha - II -Ibeta 


0.0 


'Primary Thl rest 


0.0 


Bronchial epithelium 
1"NF alpha - IL ibeta 


0.0 


Primary Th2 rest 


h 0.0 


Small airway epithelium 
none 


0.0 


Primary r Yf \ rest 


0.0 


Small airway epithelium 
TNFalpha + IL-lbeta 




CD45RA ( 1)4 
!> mphocyte act 


0.0 


Coronery artery SMC rest; 


0.0 


(TM5ROCD4 
1} m ph. K:\fe act 


0.0 


Coronery artery SMC 
TNFalpha + TL- lbeta 


0.0 


CDS lymphocyte act 


4.0 


Astroc) tes rest 




Secondi rv < 1 >S 
lymphocyte rest 


0.0 


Astrocytes 1 NFalpha + 
IL-lbeta 


0.0 


Secondary CDS 
lymphocyte act 


0.0 


KU-812 (Basophil) rest 


0.0 


CD4 lymphocyte none 


Q0 KL-8 12 (Basophil) 
jPMA/ionomycin 


0.0 


2ryThl/Th2/Trl anti- 
CD95 CH11 


0Q JCCD1106 

j(Keratinocytes) none 


0.0 


l-AK cells rest 


. , 46 


CCD 1 106 


0.0 
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(Keratinocytes) 
rNFalpha i IL-lbeta 




jLAK cells 11,-2 


0.0 


Liver cirrhosis 


0.0 


jLAK cells IL-2+IL-12 


0.0 


Lupus kidney 


2.4 


jLAK cells IL-2+IFN 
Igamma 


0.0 


NCI-H292 none 


0.0 


LAK cells IL-2+IL-18 


0.0 


NCI-H292 IL-4 


4.5 


LAK cells 
PMA/ionomycin 


3., 


NCI-H292 IL-9 


0.0 


NK Cells IL-2rest 


0.0 JNCI-H292 IL-13 


0.0 


Two Way \ll K , da> 


0.0 


NCI-H292 IFN gamma 


0.0 


jTwo Way MLR 5 day 


0.0 


HPAEC none 


0.0 


Two Way MLR 7 day 


0.0 


HPAEC TNF alpha + IL- 
1 beta 


0.0 


PBMC rest 


14.0 


Lung libroblast none 


(1.0 


PBMC PWM 


0.0 


I, ung fibroblast TNF 
alpha + 1L-1 beta 


0.0 


I>|?\IC PHA-L 


0.0 


Lung fibroblast IL-4 


0 0 


Ramos > B cell) none 


3.0 


! ung fibroblast II -9 


... 


Ramos (B cell) 
lionomycin 


0.0 


Lung fibroblast IL-13 


0.0 


B lymphocytes PWM 


0.0 


Lung fibroblast IFN 
gamma 


0.0 


B lymphocytes CD40L 
and IL-4 


0.0 


Dermal fibroblast 
CCD 1 070 rest 


0.0 


EOL-1 dbcAMP 


0.0 


Dermal fibroblast 
CCD 1070 TNF alpha 


0.0 1 

— — — — — A 


EOL- 1 dbc AMP 
PM \ ionomvein 


4.7 


Dermal fibroblast 
CC D1070 II -I beta 


0.0 


Dendritic cells none 


0.0 


Dermal fibroblast IFN 
gamma 


— -™-H 

0.0 

M J 


Dendrit c cells L PS 


13.9 


Dermal fibroblast IL-4 


0.0 J 


Dendritic cells anti- 
CD40 


6.0 


IBD Colitis 2 


0 0 


Monocytes rest 


100.0 


IBD Crohn's 


. 

0.0 


Monocytes LPS 


0.0 


Colon 


15.2 ~~ ^ ! 


Macrophages test 


1.3 


Lung 


4.0 


Macrophages 1 PS 


0.0 


Thymus 


0.0 


I II IVEC none 


no 


Kidney 


3.1 


HU\K' carved 


0.0 





CNSneurodegenerationvl.O Summary: Ag3338 - Expression of this gene is 
low/undetectable (CTs > 35) across all of the samples on this panel (data not shown). 
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General_screening_panel_vl.4 Summary: Ag3338 - Expression of this gene is highest 
in testis (CT=29.4). Therefore, expression of this gene could be used to distinguish this 
sample from others on the panel. 

There is also low expression in pancreatic cancer cell line CAPAN2, lung cancer 
cell line HOP-62, breast cancer cell line T47D, and ovarian cancer cell line OVCAR-5. 
Thus, expression of this gene could be used to differentiate these samples from other 
samples on this panel. 

Panel 4D Summary: Ag3338 - Significant expression of this gene is seen only in resting 
monocytes (CT=32.3) Therefore, expression of this gene can be used to differentiate 
between this sample and others on this panel. 

P. CG57789-01 and CG57789-02: RAS-LIKE PROTEIN RRP22-like 

Expression of gene CG57789-01 and variant CG57789-02 was assessed using the 
primer-probe set Ag3333, described in Table PA. Results of the RTQ-PCR runs are shown 
in Tables PB, PC and PD. 



Table PA . Probe Name Ag3333 



Primers 


Sequences 


Length 


Start 
Position 


SEQID 
NO: 


Forward 


5 ' -tcgactttccacccatcag- 3 ' 


19 


181 


429 


Probe 


TET-5 ' -cttccctgtcaatacgctccaggagt-3 ' - 
TAMRA 


26 




203 


430 


Reverse 


5 1 -aggatgtaggcgtggacact-3 ' 


20 


258 


431 



Table PB . CNS_neurodegeneration_vl.O 



Tissue Name 


Rel. Exp.(%) Ag3333, 
Run 210146459 


Tissue Name 


Rel. Exp.(%) Ag3333, 
Run 210146459 


AD 1 Hippo 


22.2 


Control (Path) 3 
Temporal Ctx 


7.5 


AD 2 Hippo 


18.8 


Control (Path) 4 
Temporal Ctx 


21.6 


AD 3 Hippo 


17.9 


AD 1 Occipital Ctx 


29.7 


AD 4 Hippo 


8.7 


AD 2 Occipital Ctx 
(Missing) 


0.0 


AD 5 Hippo 


100.0 


AD 3 Occipital Ctx 




AD 6 Hippo 


42.9 


AD 4 Occipital Ctx 


24.7 
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25.9 


A P\ ^ CV-r»tt-\i+ci1 rvv 

t\u j uccipiuu i^ix 


90 1 


Control 4 1 [ippo 


12.1 


\D 6 Occipital Ctx 


1 6.3 


Control (Path) 3 
Hippo 


13.4 


Control 1 Occipital 
Ctx 


4.2 


AD 1 Temporal 
Ctx 


21.3 


Control 2 Occipital 
Ctx 


74.7 


AD 2 Temporal 
Ctx 


29.1 


Control 3 Occipital 
Ctx 


14.5 


AD 3 Temporal 
Ctx 


13.3 


Control 4 Occipital 
Ctx 


4.5 


AD 4 Temporal 
Ctx 


15.8 


Control (Path) 1 
Occipital Ctx 


47.3 


AD 5 Inf Temporal 
Ctx 


92.0 


Control (Path) 2 
Occipital Ctx 


,3.5 


AD 5 Sup 
Temporal Ctx 


43.2 


Control (Path) 3 
Occipital Ctx 


4.1 


AD 6 Inf Temporal 
Ctx 


26.4 


Control (Path) 4 
Occipital Ctx 


— „ 

14.6 


AD 6 Sup 
Tempoi al Ctx 


31.6 


Control 1 Parietal 
Ctx 


7.6 


Control 1 
Temporal Ctx 


5.8 


Control 2 Parietal 
Ctx 


39.2 


Control 2 
Temporal Ctx 


51.8 


Control 3 Parietal 
Ctx 


21.9 


Control 3 
Temporal Ctx 


14.5 


Control (Path) 1 
Parietal Ctx 


56.3 


Control 3 
Temporal Ctx 


R . JControl (Path) 2 
jParietalCtx 


20.2 


Control (Path) 1 
Temporal Ctx 


39.2 


Control (Path) 3 
Parietal Ctx 


6.2 


Control (Path) 2 
Temporal Ctx 


40.9 


Control (Path) 4 
Parietal Ctx 


24.5 



Table P C. General_screening_panel_vl.4 



Tissue Name 


Rel. Exp.(%) 
Ag3333, Run 
216516940 


Tissue Name 


Rel. Exp.(%) 
Ag3333, Run 
216516940 


Adipose 


4.4 


Renal ca. TK-10 


40.1 


Melanoma* 
Hs688(A).T 


0.9 


Bladder 


» ! 


Melanoma* 
Hs688(B)T 


1.8 


Gastric ca. (liver met.) 
NCI-N87 


4.5 


Mclano im VIM 


„ - 7 „ 


Gastric ca KATO III 


20.0 


Melanoma* 


0.3 


Colon ca. SW-948 


0.0 i 
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LOXIMVI 






IIMelanoma* SK- 
|MEL-5 


0.9 IColon ca. SW480 


100.0 


1 Squamous cell 
carcinoma SCC-4 


0.1 


Colon ca.* (SW480 
met) SW620 


33.0 


i estis P'ool 


2.1 


Colon ca. HT29 


5.0 


Prostate: ca.* (bone 
met) PC-3 


2.4 


Colon ca. HCT-116 


0.1 


Prostate Pool 


0.5 


Colon ca. CaCo-2 


ZZJ7-I 


Placenta 


5.7 


Colon cancer tissue 




Uterus Pool 


0.3 


Colon ca. S\\ 1116 


16.2 


Ovarian ca. 
( >\ CAR-3 


52.9 


Colon ca. Colo-205 


0.2 


Ovarian ca. SK- 

|OV-3 


0.6 IColon ca. SW-48 


0.2 


|Ovarian ca. 
loVPAR-4 


17.9 


Colon Pool 


2.2 


Ovarian ca. 
OVCAR-5 


4.5 


Small Intestine Pool 


_ „ 

1.0 


Ov&ri&n Cel. 
IGROV-l 


0.9 


Stomach Pool 


0.9 


Ovarian ca. 
( iVCAR-8 


15.4 


Bone Marrow Pool 


1.8 


Ovary 


4.2 I elal Heart 


10.9 


(Breast ca. MCF-7 


0.7 [Heart Pool 


2.8 


(Breast ca. MDA- 
jMB-231 


0.4 (Lymph Node Pool 


4.4 

1 


iBreast ca. BT 549 


42.ii Fetal Skeletal Muscle 


1.1 


'Breast ca. T47D 


1 3 .< i Skeletal Muscle Pool " 


46.7 


^Breast ca. MDA-N 


0.1 Spleen Pool 




[Breast Pool 


2.4 Thymus Pool 


2.3"""" 


Trachea 


? 4 jCNS cancer 

(glio/astro) U87-MG 


0.9 


Lung 


f. j ICNS cancer 

|(glio/astro) U-118-MG 


0.3 


Fetal Lung 

i-™- — — — — — — * 


r^, NS cancer 

j(neuro;met) SK-N-AS 


69.7 


Lung ca. NCI-N417 


, - , CNS cancer (astro) SF- 
1/1 |539 


2.2 


Lung ca. LX-1 


1.1 


CNS cancer (astro) ! 
SNB-75 


15.9 


Lung ca. NCI-H146 


14.5 


CNS cancer (glio) 
SNB-19 


0.6 


Lung ca. SHP-77 


37.6 


CNS cancer (glio) SF- 
295 


6.0 
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Lung ca. A549 [ 0.4 


Btain (Ann gdala) Pool 


28.5 


Lungca.NCI-H526j 23.5 


Brain (cerebellum) 




Lungca. NCI-H23 


8.2 


Brain (fetal) 


21.3 


Lung ca. NCI-H460 


14.3 


Brain (Hippocampus) 
Pool 


27.7 


Lung ca. HOP-62 


17 


Cerebral Cortex Pool 


36.1 


Lung ca. NCI-H522 


86.5 


Brain (Substantia 
nigra) Pool 


40.1 


Liver 


1.6 


Brain {'1 halamus) Po^l 


37.6 


Fetal Li ver 


07 


Brain (whole) 


59.5 


Liver ca. HepG2 


6.2 


Spinal ( ora Pool 


12.3 


Kidney Pool 
Fetal Kidney 


3.8 [Adrenal Gland 
7.4 ; Pituitary gland Pool 


4.7 

3.7 " 


Renal ca. 786-0 


0.2 jSalivary Gland 


48.0 


Renal ca. A 498 


20.9 (Thyroid (female) 


1.1 


Renal ca. ACHN 


8 _ jPancreatic ca. 

CAPAN2 


0.0 


Renal ca. UO-31 


3.0 fPancreas Pool 


4.0 



Table PP . Panel 4D 



Tissue Name 


Rel. Exp.(%) 
Ag3333, Run 
165084139 


j Tissue Name 


Rel. Exp.(%) 
Ag3333, Run 
165084139 


Secondary Thl act 


0.8 


HUVEC II -Ibeta 


0.0 


Secondary Th2 act 


3.0 


JHUVEC IFN gamma 


0.5 


Secondary Trl act 


0.6 


[HUVEC TNF alpha + 
IFN gamma 


0.8 


Secondary Thl rest 


0.0 


IHUVEC TNF alpha + 
(lL4 


0.0 


Secondary Th2 rest 


0.5 


HUVEC 11-1 1 


0.3 


Secondary Trl rest 


0.0 


Lung Microvascular EC 
jnone 


0.6 


Primary Thl act 


5.7 


jLung Microvascular EC 
iTNFalpha + IL- lbeta 


0.4 


Primary Th2 act 


9.8 


(Microvascular Dermal 
|EC none 


0.0 


Primary Trl act 


3.8 


(Microsvasular Dermal 
JEC TNFalpha + IL- lbeta 


0.4 


Primary Thl rest 


0.0 


(Bronchial epithelium 
jTNFalpha + ILlbeta 


1.1 


Primary Th2 rest 


0.4 


•Small airway epithelium 
jnone 


1.9 


Primary Trl rest 


0.6 


jSmall airway epithelium 


1.4 
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|TNFalpha + IL-lbeta 


|CD45RA CD4 
1 lymphocyte set 


4.1 jCoroneiy artery SMC rest 


1.7 


Ed45RO CD4 
1 lymphocyte act 


1 „ Coronery artery SMC 
ITNFalpha + IL-lbeta 


1.2 


|CD8 lymphocyte act 


1 .4 Astrocytes rest 


100.0 


jSecondary CD8 
j lymphocyte rest 


7 . lAstrocytes TNF alpha + 
jlL-lbeta 


59.9 


[Secondary CD8 
j lymphocyte act 


0.0 IKU-8 1 2 (Basophil) rest 


2.0 


CD4 lymphocyte none 


0 8 KU-812 (Basophil) 
|PMA/ionomycin 


4.1 


2ryThl/Th2/Trl_anti- 
CD95 CHI 1 


Q5 1CCD1106 

^J(Keratinocytes) none 


12.5 


LAK cells rest 


CCD 1106 
U .5 (Keratinocytes) 

(TNFalpha + IL-lbeta 


6.2 


jLAK cells IL-2 


0.3 iLiver cirrhosis 


0 ( ) 


[LAK ceils IL-2 HI .- 12 


0.5 iLupus kidney 


3.9 


[LAK cells IL-2+IFN 
[gamma 


0.0 [NCI-H292 none 


29.3 


ILAK cells IL-2+ IL-18 


0.6 ! NCT-H292 11,-4 


39.5 


LAKcells 
f'MA ionom\cin 


0.3 INCI-H292 IL-9 


23.3 


NK Cel ls IL-2 rest 


0.0 NCI-II292 IL-13 


21 9 


Two W xy MLR ?, dav 


0.8 NCI-J 1292 IFN gamma 


14 5 


Two Way MLR 5 day 


0.9 HPAEC none 


0.5 


Two Way MLR 7 day 


^ __ ^ ^ 
1 beta 


0.0 


PBMC rest 


0.0 Lung fibroblast none 


4.5 


PBMC PWM 


8.1 


Lung fibroblast TNF 
alpha + IL-1 beta 


2.2 


PBMC PHA-L 


11.6 


1 uisg fibroblast IL-4 


12.9 


Ramos (B cell) none 


0.0 


I ung fibroblast IL-9 


9.2 


Ramos (B cell) 
ionomycin 


0.0 


Lung fibroblast IL-13 


8.5 


B lymphocytes PWM 


154 jLung fibroblast IFN 
j gamma 


8.4 


B lymphocytes CD40L 
and IL-4 


? . Dermal fibroblast 
!CCD1070rest 


40.6 


EOL-1 dbcAMP 


n 0 .Dermal fibroblast | 
CCD 1070 TNF alpha J | 


EOL-1 dbcAMP 
PMA/ionomycin 


0 0 Dermal fibroblast ! 

|CCD1070 IL-1 beta j 19-3 


Dendritic cells none 


0.0 Dermal fibroblast IFN 1.8 







gamma 




Dendritic cells LPS 


0.5 


Dermal fibroblast IL-4 


3.8 


Dendritic cells anti- 
CD40 


0.0 jlBD Colitis 2 


0.0 


Monocytes rest 


0.0 JIBD Crohn's 


2.5 


Monocytes LPS 


0.0 ;Colon 


4.2 


Macrophages rest 


0.0 Lung 


9.1 


Macrophages LPS 


0.0 (Thymus 


11.3 ! 


HUVEC none 


0.4 Kidne\ 


2.6 


HUVEC starved 


0.6 | 





CNS_neurodegeneration_vl.O Summary: This panel confirms the expression of this 
gene in the brain in an independent group of individuals. However, no differential 
expression of this gene was detected between Alzheimer's diseased postmortem brains and 
those of non-demented controls in this experiment. Please see Panel 1 .4 for a discussion of 
the potential utility of this gene in treatment of central nervous system disorders. 



General_screening_panel_vl.4 Summary: Ag3333 This gene is expressed at moderate 
to low levels in many of the samples on this panel, with the highest expression in colon 
cancer cell line SW480 (CT=27.8). Expression is significantly lower in SW680, a cell line 
derived from a metastasis of the primary tumor represented by SW480. Thus, expression of 
this gene could be used to differentiate between these two cell lines and potentially 
between primary colon cancer and its metastases. 

Based on expression in this panel, this gene may be involved in gastric, brain, 
colon, renal, lung, breast, ovarian and prostate cancer as well as melanomas. Thus, 
expression of this gene could be used as a diagnostic marker for the presence of these 
cancers. Furthermore, therapeutic inhibition using antibodies or small molecule drugs 
might be of use in the treatment of these cancers. 

This gene product is also expressed in adipose, pancreas, adrenal, thyroid, pituitary, 
skeletal muscle, heart, and liver. This widespread expression in tissues with metabolic 
function suggests that this gene product may be important for the pathogenesis, diagnosis, 
and/or treatment of metabolic and endocrine diseases, including obesity and Types 1 and 2 
diabetes 

This gene is expressed at low levels throughout the CNS, including in amygdala, 
substantia nigra, thalamus, cerebellum, cerebral cortex, and spinal cord. Therefore, this 
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gene may play a role in central nervous system disorders such as Alzheimer's disease, 
Parkinson's disease, epilepsy, multiple sclerosis, schizophrenia and depression. 

Panel 4D Summary: Ag3333 The CG57789-01 gene is expressed at moderate to 
low levels in several samples on this panel, with the highest expression in resting astrocytes 
(CT=28.4). Moderate expression of this gene is seen in treated and untreated dermal and 
lung fibroblasts and the airway epithelial tumor line NCI-H292 cells. Thus, the transcript or 
the protein it encodes may be involved in pathological and inflammatory skin and lung 
conditions, including psoriasis, asthma, allergy, emphysema, and COPD. 

Q. CG57758-01 and CG57758-02: SODIUM/LITHIUM-DEPENDENT 
DICARBOXYLATE TRANSPORTER 

Expression of gene CG57758-01, a splice variant of CG57758-02, and CG57758-02 
was assessed using the primer-probe sets Ag3326 and Ag3692, described in Tables QA and 
QB. Results of the RTQ-PCR runs are shown in Tables QC, QD, QE and QF. 



Table O A. Probe Name Ag3326 



Primers 


Sequences 


Length 


Start 
Position 


seqid] 

_ NO: ! 


Forward 


5 ' -ccatttactggtgcacagaagt-3 ' 


?J 


149 


432 • 


Probe 


TET- 5 1 -atc:ctctggctgtcacctctctcat-3 ' - 
TAMRA 


26 


172 


433 "1 


Reverse 


5 ' -ggagtccagaatctggaagagt-3 1 


22 


216 


434 I 


Table OB. Probe Name Ae3692 


Primers 


Sequences 


Length; 


Start 
Position 


SEQ ID [ 

NO: 1 


Forward 

■ — < 


5 1 -ccatttactggtgcacagaagt-3 ' 


22 


149 




Probe 


TET-5'-atc :tctggctgtca tctctcat 
TAMRA 


26 | 172 


436 | 


Reverse 


5 ' -ggagtccagaatctggaagagt-3 ' 


22 


216 


437 



Table Q C. CNS_neurodegeneration_vl.O 
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l Tissue 
I Name 


ExpJ/o) 
Ag3326, 
Run 
210144197 


Exp.(%) 
Ag3692, 
Run 
211145262 


Exp.(%) 
Ag3692, 
Run 
224337942 


Tissue 
Name 


Rel. 
Exp.(%) 
Ag3326, 
Run 
210144197 


Rel. 
Exp.(%) 
Ag3692, 
Run 
211145262 


Rel. 

Exp.(%) 
Ag3692, 
Run 
224337942 


|AD 1 
(Hippo 


2.1 


4.3 


1.0 


Control 
(Path) 3 
Tempora 
ICtx 


8.5 


15.3 


12.0 


•AD 2 
jHippo 


20.9 


28.3 


25.0 


Control 
(Path) 4 
Tempora 
1 Ctx 


31.2 


36.6 


52.1 


AD 3 
jHippo 

jAD4 


0.0 


0.9 


0.6 


AD 1 

Occipital 

Ctx 


2.7 


3.0 




0.0 


2.1 


7.1 




2.6 


AD 2 

Occipital 

Ctx 

(Missing 
> 


0.0 


0.0 


0.0 


(AD 5 
(hippo 


72.7 


97.9 


85.3 


AD 3 

Occipital 

Ctx 


1.5 


r ^ 

7.2 


1.3 


\D 6 

jHippo 


13.7 


18.3 


5.5 [ 


AD 4 

Occipital 

Ctx 


71.7 


35.6 


30.6 


Control 2 


14.5 


20.2 


15.2 


AD 5 

Occipital 

Ctx 


25.3 


31.9 


12.4 


(Control 4 
(Hippo 


11.7 


7.4 


5, 


AD 6 
Occipital 

Ctx 


17.2 ! 


19.1 


11.2 


Control 
j(Path) 3 
jHippo 

t 


6.7 


4.4 


(Control 
1 

4 5 

Occipital 
Ctx 


7.0 


9.0 


8.1 


AD 1 
jTemporal 

(Ctx 


4.0 


1.7 


(Control 

(Occipital 
[Ctx 


33.2 


44.8 


26.1 


AD 2 

Temporal 

Ctx 


80.7 


50.7 


37.4 


Control 
3 

Occipital 
Ctx 


30.1 


37.6 


21.9 


AD 3 

ITemporal 
Ctx 


3 () 


0.0 


1.1 


Control 
4 

Occipital 


l(» 3 


12.6 


8.2 
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f 








Ctx 








•AD 4 

jTemporal 

Ctx 


19.5 


30.6 


15.2 


Control 
(Path) 1 
Occipital 
Ctx 


42.0 




55.9 


52.9 


AD 5 Inf 
Temporal 
Ctx 


100.0 


100.0 


99.3 


Control 
(Path) 2 
Occipital 
Ctx 


6.7 


13.0 


7.7 


AD 5 
SupTemp 
oral Ctx 


32.8 


29.1 


33.2 


Control 
(Path) 3 
Occipital 
Ctx 


8.7 


6.6 


5.4 


[AD 6 Inf 
ITemporal 
Ctx 


27.7 


21.3 


jControl 

26.6 H-> 4 , 
lOccipital 

Ctx 


8.1 


9.0 


7.4 


AD 6 Sup 
(Temporal 
Ctx 


41.8 


53.6 


jControl 

17.0 I . , , 
Parietal 

Jctx 


21.2 


23.0 


15.3 


jControl 1 
iTemporal 
Ctx 


12.0 


33.9 


18.3 


Control 
2 

Parietal 
Ctx 


48.6 


38.2 


22.1 


|Control 2 
ITemporal 
Ctx 


30.1 


49.3 


44.4 j 


Control 
3 

Parietal 
Ctx 


28.3 


34.4 


32.8 


Icontrol 3 
ITemporal 
Ctx 


38.7 


39.5 


Control 

33 4 (Path) 1 
Parietal 

(Ctx 


78.5 


97.3 


100.0 

1 


Control 4 
JTemporal 
Ctx 


17.6 


25.2 


sControl 
241 (P^h) 2 
jParietal 

(Ctx 


50.7 


50.7 


37.9 


Control 
(Path) 1 
Temporal 
Ctx 


69.7 


70.7 


JControl 
497 (Path) 3 
Parietal 
jCtx 


10.7 


10.1 

j 


9.6 


Control 
(Path) 2 
Temporal 
Ctx 


35.4 

1 


50.7 


Control 
33 4 (Path) 4 
Parietal 

(Ctx 


30.6 ! 




24.5 


40.9 



Table QD . General_screening_panel_vl.4 



Tissue Name 


Rel. Exp.(%) 
Ag3326, Run 
215678613 


Rel. Exp.(%) 
Ag3692, Run 
?171^11Q1 

AX f 1J1171 


Tissue Name 


Rel. Exp.(%) 
Ag3326, Run 


Rel. Exp.(%) 
Ag3692, Run 
inni ioi 


Adipose 


0.0 


0.0 


Renal ca. TK- 10 


11.4 


12.0 


Melanoma* 
Hs688(A).T 


0.0 


0.0 


Bladder 


0.0 


0.1 


Melanoma* 
Hs688(B).T 


0, 


0.0 


Gastric ca. (liver 
met ) N< 1-N87 


0.0 


0.0 


Melanoma* 
M14 


0.0 


0.0 


Gastric ca. 
KATO III 


0.0 


0.0 


Melanoma* 
F OXTMVI 


0.0 


0.0 


Colon ca. SW- 
948 


0.0 


0.0 


Melanoma* 
SK-MEL-5 


0.0 


0.0 


Colon ca. 
SW480 


0.0 


0.0 


Squamous 
cell 

carcinoma 
SCC-4 


0.9 

— ----- — 


0.7 


Colon ca.* 
(SW480 met) 
SW620 


0.0 


0.0 


1 estis Pool 





0.2 


Colon ca. HT29 


0.0 


0.0 


Prostate ca.* 
(bone met) 
PC-3 


0.0 

- 


0.0 


Colon ca. HCT- 
116 


0.0 


0.0 


Prostate Pool 


0.0 


0.0 


Colon ca. CaCo- 
2 


0.0 


0.0 


Placenta 


0.0 


0.0 


Colon cancer 
tissue 


0.1 


0.0 


Uterus Pool 


0.0 


0.0 


Colon ca. 
SW1116 


0.0 


0.0 


Ovarian ca. 
OVCAR-3 


0.0 


0.0 


Colon ca. Colo- 
205 


0.0 


0.0 


Ovarian ca. 
SK-OV-3 


0.0 


0.0 


Colon ca. SW-48 


0.0 


0.0 


Ovarian ca. 
OVCAtl-4 


0.1 


0.0 


Colon Pool 


0.6 


0.0 

— — - — J 


Ovarian ca. 
OVCAR-5 


0.0 


0.0 


Small Intestine 


0-1 


0.0 1 


Ovarian ca. 
I(iR()\-l 


0.0 


°-° 


Stomach Pool 


0.0 


00 \ 


Ovarian ca. 
OVCAR-8 
( )var\ 
Breast ca. 
MCF-7 


2.8 
0.7 

_____ 
0.0 


2.2 
0.6 
0.0 


_ 

Bone Marrow 
Pool 

Fetal Heart 
Heart Pool 


0.0 
0.0 
0.0 


0.1 I 

0.0 J 

0.0 


Breast ca. 

MDA-MB- 

231 


0.0 


0.0 


Lymph Node 
Pool 


0.1 


0.0 
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Breast ca. BT 
549 


0.6 


0.8 


Fetal Skeletal 
Muscle 


0.0 


0.0 


Breast ca. 
T47D 


0.0 


0.0 


Skeletal Muscle 
Pool 


0.0 


0.0 


Breast ca. 
MDA.-N 


0.0 


0.0 


Spleen Pool 


0.4 


0.2 


Breast Pool 




0.0 


0.1 


Thymus Pool 


0.0 


0.0 


Trachea 


0.2 


0.1 


CNS cancer 
(glio/astro) U87- 
MG 


0.0 


0.0 


Lung 


0.0 


jCNS cancer 
0.0 (glio/astro) U- 
118-MG 


0.0 


0.0 


Fetal Lung 


0.2 


jCNS cancer 
0.1 |(neuro;met) SK- 

Jn-as 


0.0 


0.0 


Lung ca. 
NCI-N417 


0.0 

— — — 


n () 1CNS cancer 
((astro) SF-539 


0.0 


0.0 


Lung ca. LX- 
] 


0.0 


„ n jCNS cancer 

[(astro) SNB-75 


0.0 


0.0 


Lung ca. 
NCI-H146 


0.0 


n „ 1CNS cancer 
(glio) SNB-19 


0.0 


0.0 


Lung ca. 
SHP-77 


0.0 


„ „ CNS cancer 
|(glio) SF-295 


0.1 


0.1 


i ung ca. 

A 5 49 


0.0 


o i l Brain 

(Amygdala) Pool 


0.4 


0.4 


Lung ca. 
NCI-H526 


2.0 


oo Brain 

(cerebellum) 


1.4 


1.0 


Lung ca. 
NCI-1 123 


0.7 


0.6 [Brain (fetal) 


0.7 


0.4 


Lung ca. 
NCI-H460 


0.0 


IBrain 

0.0 (Hippocampus) 
IPool 


0.5 


0.7 


Lung ca. 
HOP-6" 




0.1 


0.2 


Cerebral Cortex 
Pool 


1.4 


1.5 


Lung ca . 
NCI-H522 


" - 1 

0.0 


0.0 


Brain (Substantia 
nigra) Pool 


1.4 


1.4 


Liver 


28.7 


24.1 


Brain 

{ ! li.ihiuius) Poi ! 


1.1 


0.9 


i efal Liver 


100.0 


100.0 


Brain (whole) 




........ „. . , 


Liver ca. 
HepG2 


29.5 


26.2 


Spinal Cord Pool 


0.1 


0.2 


Kidney Pool 


0.0 


0.0 Adrenal Gland 


2.6 




Fetal Kidnev 




„ . Pituitarv aland 
UJ Pool 


0.0 




Renal ca. 


0.0 


0.0 Salivary Gland 


40.9 


35 : ! 1 



!786-0 












! Rena! ca. 
(A498 
■Renal ca. 
JACHN 


0.0 
0.0 


0.0 
0.0 


Thyroid (female) 

Pancreatic ca. 
CAPAN2 


0.0 
0.5 


0.0 
0.8 


iRenal ca. 
IUO-31 


0.0 


0.0 


Pancreas Pool 


0.0 


0.0 



Table O E. Panel 4. ID 



] Rel. Exp.(%) 
Tissue Name j Ag3692, Run 
169987356 


Tissue Name 


Rel. Exp.(%) 
Ag3692, Run 
169987356 


Secondary Thl act 0.0 


HUVEC II -lbeta 


0.0 


Secondary Th2 act 


0.0 


HUVEC IFN gamma 


0.0 


Secondary Trl act 


0.0 


HUVEC TNF alpha + 
IFN gamma 


0.0 


Secondary Thl rest 


0.0 


HUVEC TNF alpha + 
IL4 


0.0 


Secondary Th2 rest 


0.0 


HUVEC IL-11 


o'o 


Secondary Trl rest 


q q Tung Microvascular EC 
|none 


0.0 


Primary Thl act 


„ „ Lung Microvascular EC 
TNFalpha I 11,-1 beta 


O.o 


Primary Th2 act 


q q iMicrovascular Dermal 
EC none 


11.3 


Primary Trl act 


. _ (Microsvasular Dermal 

L( TOFalpha + IL- lbeta 


00 
u.u 


Primary Thl rest 


„ 'Broii'. hij 1 epithelium 
{TNFalpha + TL1 beta 




Primary Th2 rest 


q q | Small airway epithelium 
Inone 




Primary Trl rest 


„ n (Small airway epithelium 
TNFalpha HL-lbeta 


0.0 


CD45RA CD4 
lymphocyte act 


3.9 Co ronery artery SMC rest 


0.0 


CD45RO CD4 
lymphocyte act 


„ n Coronery artery SMC 
T M alpha II -lbeta 


0.0 


CDS lymphocyte act 


0.0 Astrocytes rest 


0.0 


Secondary CD8 
lymphocyte rest 


0 „ (Astrocytes TNFalpha + 
IL-lbeta 


0.0 


Secondary CD8 
lymphocyte act 


0.0 


KU-812 (Basophil) rest 


3.6 


CD4 lymphocyte none 


0.0 


KU-812 (Basophil) 
PMA/ionomycin 


4.3 
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2ryThl/Th2/Trl anti- 
CD95 CHI 1 


0.0 


CCD1106 

(Keratinocytes) none 


10.7 


LAK cells rest 


0.0 


CCD 11 06 
(Keratinocytes) 
LNFalpha + IL-1 beta 


0.0 


LAK cells IL-2 


0.0 


Liver cirrhosis 


94.0 


LAK cells IL-2+IL-12 


0.0 


NCI-H292 none 


0.0 


LAK cells IL-2+IFN 
gamma 


0.0 1NCI-H292 IL-4 


0.0 


LAK cells IL-2+IL- 18 


0.0 


NCI-H292 IL-9 


0.0 


LAK cells 
PMA/ionomycin 


0.0 


NCI-H292 IL-1 3 


0.0 


NK Cells IL-2 rest 


0.0 


NCI-H292 IFN gamma 


0.0 


Two Way MLR 3 day 


I o.o 


HPAEC none 


0.0 


Two Way MLR 5 day 


3 2 iHPAEC TNF alpha + IL- 
11 beta 


0.0 


Two Way MLR 7 day 


0.0 jLung fibroblast none 


0.0 


PBMC rest 


q q (Lung fibroblast TNF 
(alpha + IL-1 beta 


0.0 


PBMC PWM 


0.0 Lung fibroblast II -4 


0 0 


t) D A AC DU A T 

rriJVlL- rhlA-JL 


0.0 Lung fibroblast IL-9 


0.0 


Ramos (B cell) none 


0.0 Lung fibroblast IL- 1 3 


0.0 


Ramos (B cell) 
ionomycin 


Q0 Lung broblastlFN 
Jgamma 


0.0 


B lymphocytes PWM 


n 0 [Dermal fibroblast 
ICCD1070 rest 


0.0 


r> lyrnpnocyLcs ^i^uju 
and IL-4 


n n iDermal fibroblast 

jCCDl 070 TNF alpha 


_ 

0.0 


EOL-1 dbcAMP 


„ „ IDermal fibroblast 
CCD1070TL-1 beta 


0.0 


EOL-1 dbcAMP 
PMA/ionomycin 


00 Dermal fibroblast IFN 
gamma 


0.0 


Dendritic cells none 


0.0 Dermal fibroblast IL-4 


0.0 


1 >endritic cells 1 PS 


0.0 Dermal Fibroblasts rest 


0.0 


Dendritic cells anti- 
CD40 


0.0 (Neutrophils TNFa+LPS 


0 0 


Monoc} tes rest 


0.0 Neutrophils rest 


0.0 


Monocytes LPS 


0.0 Colon 


0.0 


Macrophages rest 


0.0 Lung 


0.0 


Macrophages LPS 


0.0 Thymus 


2.4 


HUVEC none 


0.0 kidney 


100.0 


HUVEC starved 


.... 





Table OF . Panel 5 Islet 
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Tissue Name 


Rel.Exp.(%) : 

Ag3326, Run Tissue Name 
242385365 j 


Rel. Exp.(%) 
Ag3326, Run 
242385365 


y /4o / Jratient- 
0 'v;.' adipose 


i 

0.0 94709JDonor 2 AM - Aadipose 


0.2 


97476_Patient- 
07sk_skeletal muscle 


0.0 94710 Donor 2 AM - B_adipose 


0.0 


97477_Patient- 
()7u1 uterus 


0.0 [9471 l_Donor 2 AM - C_adipose 


0.0 


97478_Patient- 
07pljplacenta 


0.0 |94712_Donor 2 AD - A_adipose 


0.0 


99167 Bayer Patient 
1 


0.3 9471 3_Donor 2 AD - B adipose 


0.0 


97482JPatient- 
08ut utsrus 


0.0 |94714_Donor 2 AD - C_adipose 


0.0 


97483JPatient- 
08pl_placenta 


0 Q 94 /42 Donor 3 U - 

A r [e - ^ lymal Stem Cells 


0.0 


97486_Patient- 
09sk_skeletal muscle 


oo 94743 - Donor ? 1 ' 

JB Mesenchymal Stem Cells 


0.0 


97487JPatient- 
09ut uterus 


0.0 |94730_Donor 3 AM - A_adipose 


0.0 


97488_Patient- 
09pl_pl iccnta 


0.0 19473 1 JDonor 3 AM - B_adipose 


0.0 


97492 'atient- 
lOut uterus 


0.0 }94732_Donor 3 AM - C_adipose 


0.0 


97493JPatient- 
lOplplacenta 


0.0 94733_Donor 3 AD - A_adipose 


0.0 


97495. Patient- 
llgoadipose 


0.0 ]94734_Donor 3 AD - B_adipose 


0.0 


97496_Patient- 

1 1 sk_skeletal muscle 


0.0 19473 5_Donor 3 AD - C_ad ipose 


0.0 


97497JPatient- 
llututerus 


0.0 |77138_Liver_HepG2untreated 


100.0 


97498JPatient- 
11 pi placenta 


q q j73556_Heart^Cardiac stromal j ^ 
Icells (primary) 


97500_Patient- 
12go_adipose 


1 \ 
0.1 8 1 73 5_Small Intestine j 39.5 


9750i~?atient- 

1 2sk_skeletal muscle 


0% "2 l09JCidney_Proximal | 

C 01 \ .iluU 1 Hihifk i * ! 


97502j 5 at[ent- 
12ut_uterus 


0.0 P^-»f 0.0 

UntestineDuodenum 1 I 


97503J'atient- 
12pl placenta 


q q |90650_Adrenal_Adrenocortical ^ ^ 
iadenoma J 


94721 ;)onor2U- 
AMesenchymal 
Stem Cells 


0.0 J72410 Kidney HRCE 

i 


0.0 
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94722_Donor2U- 
B_Mesenchymal 
Stem Cells 



0.0 



72411_Kidney_HRE 



0.0 



94723_Donor 2 U - 
C_Mesenchymal 
Stem Cells 



0.0 



73139_Uterus_Uterine smooth 
muscle cells 



0.0 



CNS_nieurodegeneration_vl.O Summary: Ag3326/Ag3692 - Three experiments done 
with two primer pairs (same sequence) are in excellent agreement. This panel confirms the 
expression of this gene at low levels in the brain in an independent group of individuals. 
However, no differential expression of this gene was detected between Alzheimer's 
5 diseased postmortem brains and those of non-demented controls in this experiment. Please 
see Panel 1 .4 for a discussion of the potential utility of this gene in treatment of central 
nervous system disorders. 

General_screening_panel_vl.4 Summary: Ag3326/Ag3692 Two experiments with the 
smae probe and primer set produce results that are in excellent agreement. This gene is 

10 highly expressed in fetal liver (CT=26.5-27.0) and moderately expressed in adult liver 

(CT=28.5-28.8) and liver cancer cell line HepG2 (CT=28.4-28.8). This result agrees with 
the results seen in Panel 5 (expression in HepG2 (CT=29.2). These results are in agreement 
with published data that show a novel sodium dicarboxylate transporter in brain, choroid 
plexus kidney, intestine and liver. Thus, expression of this gene could be used to 

15 differentiate between these samples and other samples on this panel and as a marker for 
liver derived tissue. 

This gene is expressed at low levels throughout the CNS, including in amygdala, 
substantia nigra, thalamus, cerebellum, and cerebral cortex. Therefore, this gene may play a 
role in central nervous system disorders such as Parkinson's disease, epilepsy, multiple 
20 sclerosis, schizophrenia and depression. 

Low but significant levels of expression are also seen in the adrenal gland. Thus, 
this gene product may also be involved in metabolic disorders of this gland, including 
adrenoleukodystrophy and congenital adrenal hyperplasia. 

References: 
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1 . Pajor AM, Gangula R, Yao X. Cloning and functional characterization of a high- 
affinity Na(+)/dicarboxylate cotransporter from mouse brain. Am J Physiol Cell Physiol 
2001 May;280(5):C1215-23. 

2. Chen XZ, Shayakul C, Berger UV, Tian W, Hediger MA. Characterization of a 
rat Na+--dicarboxylate cotransporter. J Biol Chem 1998 Aug 14;273(33):20972-81. 

Panel 4.1D Summary: Ag3692 Significant expression of this gene is seen only in kidney 
and a liver cirrhosis sample (CTs=34.0). These results confirm that this gene is expressed 
in liver derived samples. The presence in the kidney is also in agreement with published 
results. Please see Panel 1.4. This gene product may be involved in maintaining or 
restoring normal function to the kidney during inflammation. 

Panel 4D Summary: Ag3326 Results from one experiment are not included. The amp 
plot ind icates that there were experimental difficulties with this run. 

Panel 5 Islet Summary: Ag3326 - The highest expression of this gene is in liver cancer 
cell line HepG2 (CT=29.2). There is also moderate expression in the small intestine 
(CT=30.5). These results compare well with previously published reports of sodium 
dicarboxylate transporter expression in mouse and rat (see discussion Panel 1 .4). 

R. CG57758-04 and CG57758-05: Sodium:sulfate symporter 

Expression of gene CG57758-04 and CG57758-05, both splice variants of 
CG577584-01, was assessed using the primer-probe sets Ag3326, Ag3692 and Ag5818, 
described in Tables RA, RB and RC. Results of the RTQ-PCR runs are shown in Tables 
RD, RE, RF, RG and RH. 



Table RA . Probe Name Ag3326 



1 

Primers 

) 




Sequences 


Length 


Start 
Position 


SEQID 
NO: 


Forwarc 


5 ' -ccatttactggtgcacagaagt-3 ' 


22 


138 


438 


Probe 


TET-5 1 -atccctctggctgtcacctctctcat-3 ' - 
TAMRA 


26 


161 


439 


Reverse 


5 ' -ggagtccagaatctggaagagt-3 ' 


22 


205 


440 



Table RB . Probe Name Ag3692 
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Primers 


Sequences 


Length 


Start 
Position 


SEQID 
NO: 


Forw arc! 


5 ' -ccatttactggtgcacagaagt-3 ' 


22 


138 1 441 


Probe 


TET-5 1 -atccctctggctgtcacctctctcat-3 1 - 
TAMRA 


20 


161 ! 442 


Reverse 


5 ' -ggagtccagaatctggaagagt-3 1 


22 


205 443 


Table RC. Probe Name As58 1 8 


Primers 


Sequences 


Length 


Start 
Position 


SEQID 
NO: 


Forw arcl 


5 ' -ccatcaccttgatcttgtcc-3 1 




1 34 1 


444 


Probe 


hi -l-.tr ^ C . ".V. ,• ::t -1 

3 ' -TAMRA 


29 


1429 


445 


Reverse 


5 ' -cagaagactccaattatgttca-3 ' 




145S 


446 



Table RD . CNS_neurodegeneration_vl.O 



Tissue 
Name 


Rel. 

T? v „ /O/ \ 

JhiXp.(voJ 
Ag3326, 
Run 
210144197 


Rel. 
Exp.(%) 
Ag3692, 

Run 
211145262 


Rel. 

Exp.(%) i 

Ag3692, * 1SSUe 
Name 

Run J 
224337942 | 


Rel. 
Exp.(%) 
Ag3326, 
Run 
210144197 


Rel. 

Exp.(%) 

Ag3692, 

Run 
211145262 


Rel. 
Exp.(%) 
Ag3692, 
Run 
224337942 


AD 1 

Hippo 

..... 


2.1 


4.3 


(Control 
1 0 ((Path) 3 
(Tempora 
|l Ctx 


8.5 


,53 


12.0 


! 

AD 2 
Hippo 


20.9 


28.3 


[control 

25.0 |< Path > 4 
Tempora 

;l Ctx 


31.2 


— — 

36.6 


52.1 


AD 3 
Hippo 


0.0 


0.9 


IAD 1 
0.6 [Occipital 
Ctx 


2.7 


3.0 


0.0 


AD 4 
Hippo 


2.1 


7.1 


IAD2 
Occipital 

2.6 Ctx 

((Missing 
) 


0.0 


— 

0.0 


0.0 


AD 5 
hippo 

~ 


72.7 


97.9 


AD 3 
85.3 lOccipital 

(ctx 


1.5 


7.2 


1.3 


AD 6 
Hippo 


13.7 


18.3 


[AD 4 
5.5 (Occipital 

'Ctx 


7,7 


35.6 


30.6 


Control 2 
Hippo 


14.5 


20.2 


15 2 1 AD5 

(Occipital 


2 ,3 


31.9 




12.4 



( l\ 



Control 4 

"""" " 


11.7 


7.4 


|AD6 
5.1 jOccipital 
jCtx 


17.2 


19.1 


11.2 


Control 
(Path) 3 
Hippo 


6.7 


4.4 


IControl 

^ (Occipital 
Ctx 


7.0 


9.0 


8.1 


AD 1 

Temporal 

Ctx 


4.0 


1.7 


IControl 

2.8 . ., , 
jOccipital 

|Ctx 


33.2 


44.8 


26.1 


AD 2 
Temporal 

Ctx 


80.7 


50.7 


jControl 

37.4 E 

Occipital 

JCtx 


30.1 


37.6 


21.9 


AD 3 

Temporal 

Ctx 


3.6 


0.0 


iControl 

ii I 4 

(Occipital 
JCtx 


16.3 


12.6 


8.2 


AD 4 

Temporal 

Ctx 


19.5 


30.6 

— — 


JControl 
152 l(Path)l 
(Occipital 
Ctx 


42.0 


55.9 


52.9 


AD 5 Inf 
Temporal 
Ctx 


100.0 


100.0 


"(Control 
99 3 j(Path) 2 
JOccipital 
Ctx 


6.7 


13.0 


__ 
7.7 


AD 5 
SupTemp 
oral Ctx: 


32.8 


29.1 


IControl 
33 2 |(Path) 3 
jOccipital 

|Ctx 


8.7 




6.6 


5.4 


AD 6 Inf 
Temporal 

Ctx 


27.7 


21.3 


IControl 

26 6 | (Path)4 
jOccipital 

JCtx 


8.1 


9.0 


7.4 


AD 6 Sup 
Temporal 
Ctx 


41.8 


53.6 


jControl 

17.0 11 . + , 
iPanetal 

jCtx 


21.2 


23.0 


15.3 


Control 1 
Temporal 
Ctx 


12.0 


33.9 


JControl 

12 

jParietal 
Ctx 


48.6 


38.2 


22.1 1 


Control 2 
Temporal 
Ctx 


30.1 


49.3 


IControl 

44 4 P 

jParietal 


28.3 


34.4 


32.8 | 

i 



Control 3 
Temporal 
Ctx 


38.7 


39.5 


(Control 
334 (Path)l 
|Parietal 
|Ctx 


78.5 


97.3 


100.0 


Control 4 
Temporal 
Ctx 


17.6 




25.2 


jControl 
241 (Path) 2 
IParietal 
Ctx 


50.7 


50.7 


37.9 


"r " — 

Control 

(Path) 1 

Temporal 

( (\ 


69.7 

— — 


70.7 


[Control 
497 (Path) 3 
[Parietal 
|Ctx 


10.7 


10.1 


9.6 


Control 
(Path) 2 
Temporal 
Ctx 


35.4 


50.7 


(Control 
334 !(Path)4 
IParietal 
-Ctx 


30.6 


__ 
24.5 


40.9 



Table RE . General_screening_panel_vl.4 



j 

Tissue Name 

! 


Rel. Exp.(%) 
Ag3326, Run 
215678613 


Rel. Exp.(%) 
Ag3692, Run 
217131191 


Tissue Name 


Rel. Exp.(%) 
Ag3326, Run 
215678613 


RpI F*n (%\ 
Ag3692, Run 
217131191 


Adipose 


0.0 


0.0 


Renal ca. TK-10 


11.4 


12.0 


' Melanoma* 
Hs688(A>.1 


0.0 


0.0 


Bladder 


0.0 


0.1 


Melanoma* 
Hs688(B) 1' 


0.! 


0.0 


Gastric ca. (liver 
met.)NCI-N87 


0.0 


0.0 


VIel inona* 
M14 


0.0 


0.0 


Gastric ca. 
KATOIII 


0.0 


0.0 


Melanoma* 
LOXIMVI 


0.0 


n n JColon ca. SW- 
UU 1948 


0.0 


0.0 


Melanoma* 
SK-MEL-5 


0.0 


0.0 


Colon ca. 
SW480 


0.0 


0.0 


i iiiamous 
cell 

carcinoma 
Si C-4 


0.9 


0.7 


Colon ca.* 
(SW480 met) 
SW620 


0.0 


0.0 


Testis Pool 


0.1 j 0.2 iColon ca. 1 I I 29 


0.0 


0.0 


Prostate ca.* 
(bone met) 
PC-3 


0.0 


0.0 


Colon ca. HCT- 
116 


0.0 


0.0 


Prostate Pool 


0.0 


0.0 


Colon ca. CaCo- 
2 


0.0 


0.0 


Placenta 




0.0 


0.0 


Colon cancer 
tissue 


0, 


0.0 


Uterus P«vl 


0.0 


0.0 


Colon ca. 


0.0 


0.0 
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L 




SW111 6 






Ovarian ca. 
OVCAR-3 


0.0 


Colon ca. Colo- 

|205 


0.0 


0.0 


Ovarian ca. 
SK-OV-3 


0.0 


0.0 


Colon ca. SW-48 


0.0 


0.0 


Ovarian ca. 
OVCAR-4 


0.1 


0.0 


Colon Pool 


0.6 


0.0 


Ovarian ca. 
OVCAR-5 


0.0 


0.0 


Small Intestine 
Pool 


0.1 


0.0 


Ovarian ca. 
IGROV-1 


0.0 


0.0 .Stomach Pool 


0.0 


0.0 


Ovarian ca. 
OVCAR-8 


2.8 


7 _ Bone Marrow 
I Pool 


0.0 


0.1 


Ovary 


0.7 


0.6 


Fetal Heart 


0.0 


0.0 


Breast ca. 
MCF-7 


0.0 


0.0 


Heart Pool 


0.0 


0.0 


Breast ca. 

MDA-MB- 

231 


0.0 


0.0 


Lymph Node 
Pool 


0.1 


0.0 


Breast ca. BT 
549 


0.6 


Q g Total Skeletal 
(Muscle 


0.0 


0.0 


Breast ca. 
1471) 


0.0 


ft „ Skeletal Muscle 
U Pool 


0.0 


„ — ™, „ , 

0.0 


Breast ca. 
MDA-N 


0.0 


0.0 Spleen Pool 


0.4 


0.2 


Breast Pool 


0.0 


04 riivimis Pool 


0.0 


0.0 


Trachea 


0.2 


JCNS cancer 
0.1 (glio/astro) U87- 

|mg 


0.0 


0.0 


Lung 




0.0 




0.0 


CNS cancer 
(glio/astro) U- 
118-MG 


0.0 


0.0 


Fetal Lung 


0.2 


0.1 


CNS cancer 
(neuro;met) SK- 
N-AS 


0.0 


0.0 


Lung ca. 
NCI-N417 


0.0 


„ „ CNS cancer 

(astro) SF-539 


0.0 


0.0 


Lung ca. LX- 
1 


0.0 


o o ! CNS cancer 0 0 

(astro) SNB-75 UU 


0.0 


Lung ca. 
NCI-III46 


0.0 


0 0 ? CNS cancer 
(glio)SNR-19 




Lung ca. 
SHP-77 


0.0 


_ n CNS cancel 
(ulu.i SF-295 


0.1 




Lung ca. 
A549 


0.0 


0 1 Brain 

(Amygdala) Pool 


0.4 




Lung a . 


2.0 


0.0 Brain 


1.4 


1.0 
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Ml l-l [ISO 




i(cerebellum) } 




Lung ca. 

ML l-l 1 _ j 


0.7 


0.6 jBrain (fetal) 


0.7 


0.4 


Lung ca. 
NCI-H460 


0.0 


jBrain 

0.0 l(Hippocampus) 
IPool 


0.5 


0.7 


Lung ca. 
HOP-62 


0.1 


q 2 |Cerebral Cortex 
JPool 


1.4 


1.5 


Lung ca 
NCI-H522 

— 


0.0 


n „ .Brain (Substantia 
jnigra) rool 


1.4 


1.4 


Liver 


1 

28.7 


2^ j iBrain 

( Thalamus) Pod! 


1.1 


0.9 




100.0 


LOO.O Brain (whole) 


4.1 




Liver ca. 
HepG2 


29.5 


26.2 JSpinal Cord Pool 


0.1 


0.2 


kidne\ Pool 


0.0 


0.0 Adrenal v 1 fan J 


2.6 


1.9 


Fetal Kidney 


0.1 


n . jPituitary gland 
jPool 


0.0 


0.2 


Renal ca. 
786-0 


0.0 


0.0 


Salivary Gland 


40.9 


35.1 


Renal ca. 
A498 


0.0 

h — \ 


0.0 


Thyroid (female) 


0.0 


0.0 


Renal ca. 
ACHN 


0.0 


0.0 


Pancreatic ca. 
C \PAN2 


0.5 


0.8 


Renal ca. 
UO-31 


0.0 


0.0 


Pancreas Pool 


0.0 


0.0 



Table RF . General_screening_panel_vl.5 



Tissue Name 


Rel. Exp.(%) 
Ag5818, Run 
245382899 


Tissue Name 


Rel. Exp.(%) 
Ag5818, Run 
245382899 


Adipose 

Melanoma* 

Hs688(A).T 


0.0 
0.0 


Renal ca. TK-10 
Bladder 


13.4 
0.0 


Melanoma* 
Hs688(B).T 


0.0 


Gastric ca. (liver met.) 
NCI-N87 


0.0 


Melanoma* Ml 4 


0.0 


Gastric ca. K A TO III 


0.0 


Melanoma* 
LOXIMYI 


0.0 


Colon ca. SW-948 


0.0 


Melanoma* SK- 
MEL-5 


0.0 


Colon ca. SW480 


0.0 


Squamous cell 
carcinoma SCC-4 


1.4 


Colon ca.< (SW480 
met) SW620 


0.0 


Testis Pool 


0.5 


Colon ca. HT29 


0.0 
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Prostate ca.* (bone 
met) PC-3 


0.0 


Colon ca. HCT-116 


0.0 


Prostate Pool 


0.0 


Colon ca. CaCo-2 


0.4 


Placenta 


0.0 




0 0 


Lterus Pool 


0.0 


Colon ca. SW1116 


0.0 


Ovarian ca. 
OVCAR-3 


0.0 


Colon ca. Colo-205 


0.0 


Ovarian ca. SK- 
OV-3 


0.1 


Colon ca. SW-48 


0.0 


Ovarian ca. 


0.0 


Colon Pool 


0.0 


Ovarian ca. 
OVCAR-5 


0.0 


Small Intestine Pool 


— 

0.0 


Ov3.ri3.nt C3. 

IGROV-l 


0.0 


Stomach Pool 


0.0 


Ovarian ca. 
OVCAR-8 


1.9 


Bone Marrow Pool 


0.0 


Ovary 


0.3 


Fetal Heart 


0.0 


Breast ca. MCF-7 


0.0 


Heart Pool 


_ 0.0 


Breast ca. MDA- 
MB-23 1 


0.0 


Lymph Node Pool 


0.0 


Breast ca. BT 549 


0.4 


Fetal Skeletal Muscle 


0.0 


Breast ca. T47D 


0.0 


Skeletal Muscle Pool 


0.0 


Breast ca. MDA-N 


0.0 


Spleen Pool 


0.7 


Breast Pool 


0.0 


Thymus Pool 


0.0 


Trachea 


0.2 


( NS cancel 
(glio/astro) U87-MG 


0.0 


Lung 


n „ jCNS cancer 

fglio/astro) U-118-MG 


0.0 


Fetal Lung 


„ ~ jCNS cancer 

|(neuro;met) SK-N-AS 


0.0 


Lung ca.NCI-N417 

— — — — — 


0.0 


CNS cancer (astro) SI - 
539 


0.0 


Lung ca. LX-1 


0.0 


CNS cancer (astro) 
SNB-75 


0.0 


Lung ca. NCI-H146 


ft n jCNS cancer (glio) 
!SNB-19 


0.0 


Lung ca. SHP-77 


nn ICNS cancer (glio) SF- 
295 


0.0 


Lunge A:4 ! > 


0.2 Brain ( Vmygdala) Pool 


0.7 


1 ungca. NCI-H526 


0.0 jBrain (cerebellum ) 


1.1 


Lung ca.NCI-H23 


1.5 JBrain (fetal) 


0.8 


Lung ca. NCI-H460 


n „ jBrain (Hippocampus) 
U-U jPool 


0.6 
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jLung ca. HOP-62 


0.0 Cerebral Cortex Pool 


1.7 


IT linn ~- X 1 C T I KOI 

JLung ca. JNCl-HDzz 


q q jBrain (Substantia 
|nigra) Pool 


— 

1.2 


[Liver 


40.3 'Brain (Thalamus) Pool 


1.3 


Fetal 1 iver 


100.0 |Brain (whole) 


5.6 


(Liver ca. HepG2 


33.2 iSpinal Cord Pool 


0.3 


|Kidney Pool 


0.0 j Adrenal Gland 


6.0 


petal Kidney 


0.0 Pituitary gland Pool 


0.2 


|Renal ca. 786-0 


0.0 iSalivary Gland 


67.4 


ptodca. A498 


0.0 


Thyroid (female) 


0.0 


|Renal ca. ACHN 


0.0 


Pancreatic ca. 
CAPAN2 


0.7 


Renal ca. UO-31 


0.0 


Pancreas Pool 


0.0 



Table RG . Panel 4. ID 



Tissue Name 


Rel. 

Exp.(%) 
Ag3692, 
Run 
169987356 


Rel. 

Exp.(%) 
Ag5818, 
Run 
246920287 


Tissue Name 


Rel. 
Exp.(%) 
Ag3692, 

Run 
169987356 


Rel. 

Exp.(%) 
Ag5818, 

246920287 


: Secondary Thl act 


0.0 


0.0 


HUVEC II -Ibeta 


0.0 


0.0 


Secondary Th2 act 


0.0 


00 IhWecifn 

gamma 


0.0 


0.0 


Secondary Trl act 


0.0 


0.0 


HUVEC TNF 
alpha + IFN 
gamma 


0.0 


0.0 


Secondary Thl rest 


0.0 


0.0 


HUVEC TNF 
alpha + IL4 


0.0 


0.0 


Secondary Th2 rest 




------»- ... 


HUVEC I L- 11 


00 




Secondary Trl rest 


0.0 


0.0 


Lung 

Microvascular EC 
none 


0.0 


0.0 


Primary Thl act 


0.0 


0.0 


Lung 

Microvascular EC 
TNFalpha + IL- 
lbeta 


0.0 


0.0 


Primary Th2 act 0.0 


q 0 (Microvascular 
Dermal EC none 


II 3 


0.0 


Primary Trl act 


4.2 


0.0 


Microsvasular 
Dermal EC 
TNFalpha + IL- 
lbeta 


0.0 


0.0 


Primary Thl rest 


0.0 


0.0 


Bronchial 
epithelium 


28.5 


0.0 
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TNFalpha + 
ILlbeta 




Primary Th2 rest 


0.0 


q q j Small airway 

epithelium none 


5.7 


0 0 


Primary Trl rest 


0.0 




Small airway 
epithelium 
TNFalpha + IL- 
lbeta 


0.0 


0.0 


CD45R \ ( D4 
Kmphocyte act 


3.9 

, 


0.0 


Coronery artery 
SMC rest 


0.0 


0.0 


CD45RO CD4 
lymphocyte act 


0.0 


0.0 


Coronery artery 
SMC TNFalpha + 
IL-1 beta 


0.0 


0.0 


CD8 lymphocyte 
act 


0.0 


0.0 


Astrocytes rest 


0.0 


0.0 


Secondary CD8 
lymphocyte rest 


0.0 


0.0 


Astrocytes 
TNFalpha + IL- 
lbeta 


0.0 


0.0 


Secondary CD8 
lymphocyte act 


0.0 


0 0 F" 812 

(Basophil) jest 


3.6 


24.3 


CD4 lymphocyte 


0.0 


ijKU-812 
0.0 j(Basophil) 

PMA/ionomycin 


4.3 


0.0 


2ry 

Th 1 /Th2/Tr 1 anti- 
CD95 CHI 1 


0.0 


CCD1 106 

0.0 j(Keratinocytes) 
jnone 


10.7 


0.0 


LAK cells rest 


0.0 


CCD 1 106 

ft „ j(Keratinocytes) 

iTNFalpha + IL- 1 
jlbeta 


0.0 


0.0 1 


LAK cells IL-2 


0.0 


0.0 Liver cut hos^ 


l M 0 


— 

27.5 j 


LAK cells IL- 
2+IL-12 


0.0 


0.0 jNCI-l 1292 none 


0.0 


0.0 | 


LAK cells IL- 
2 U N «amma 


0.0 


0.0 |NCI-H292 IL-4 


0.0 


0.0 | 


T A V ,,1 1 ^ TT n i 

JLAK cells 1L-Z+ 
IL-18 


0.0 


0.0 NCI-H292 IL-9 


0.0 


0.0 


LAK cells 
PMA/ionomycin 


0.0 


0.0 [NCI-H292IL-13 


0.0 


0.0 


NK Cells IL-2 rest 




(ni \< i-i r<p i fn 

Ltamma 


0.0 


0.0 1 


Two Way MLR 3 
day 


0.0 


0.0 HP A EC none 


0.0 


o.o 1 


Two Way MLR 5 ! 
day 


3.2 


on HPAECTNF 

alpha + IL-1 beta 


0.0 


0.0 


Iwo Way Ml R 7 


0.0 


0.0 Lung fibroblast 


0.0 


0.0 
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day 






none 






JrJtJML- rest 


0.0 


0.0 


Lung fibroblast 
TNF alpha + IL-1 
beta 


0.0 


— 

0.0 


PBMC PWM 


0.0 


„ n |Lung fibroblast 
JIL-4 


0.0 


0.0 


PBMC PHA-L 


0.0 


0.0 


Lung fibroblast 
IL-9 


0.0 


0.0 


Ramos (B cell) 
none 


0.0 


0.0 


Lung fibroblast 
IL-13 


0.0 


0.0 


Ramos (B cell) 
ionomycin 


0.0 


0.0 


Lung fibroblast 
IFN gamma 


0.0 


0.0 


B lymphocytes 
PWM 


0.0 


0.0 


Dermal fibroblast 
CCD 1070 rest 


0.0 


0.0 


B lymphocytes 
CD40L and IL-4 


0.0 


0.0 


Dermal fibroblast 
CCD 1070 TNF 
alpha 


0.0 


0.0 


EOL-1 dbcAMP 


0.0 


' 


Dermal fibroblast 
CCD 1070 IL-1 
beta 


0.0 


0.0 


EOL-1 dbcAMP 
PMA/ionomycin 


0.0 


0.0 


Dermal fibroblast 
IFN yam ma 


0.0 


0.0 


Dendritic cells 
none 


0.0 


0.0 


Dermal fibroblast 
IL-4 


0.0 


0.0 


Dendritic cells LPS 


0.0 


0.0 


Dermal 

Fibroblasts rest 


0.0 


0.0 


Dendritic cells 
anti-CD40 


0.0 


0.0 


Neutrophils 
TNFa+LPS 


0.0 


o o ! 


Monocytes rest 


0.0 


0.0 


Neutrophils rest 


0.0 


0.0 I 


Monocytes LPS 


0.0 


0.0 


Colon 


0.0 


0.0 1 


Macrophages rest 


().() 


0.0 


Lung 0.0 


0.0 j 


Macrophages LPS 


0.0 


0.0 


Thymus f 2.4 


0.0 


HUVEC none 


0.0 


0.0 


Kidney | 


100.0 


100.0 


HUVEC starved 


0.0 


0.0 







Table RH . Panel 5 Islet 



Tissue Name 


Rel. Exp.(%) 
Ag3326, Run 
242385365 


Tissue Name 


Rel. Exp.(%) 
Ag3326, Run 
2423S5365 


97457_Patient- 
02go_adipose 


0.0 


94709_Donor 2 AM - A adipose 


0.2 


97476_Patient- 

07 >k_sk :k t il m isc It 


0.0 


94710_Donor 2 AM - B_adipose 


0.0 


97477_Patient- 


00 


9471 l_Donor 2 AM C_.adipose 


0.0 
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07ut_uterus 




97478 P.itient- 
07pl_placenta 


0.0 


94712_Donor 2 AD - A adipose 


0.0 


99 1 67 Bayer Patient 
1 


0.3 


94713_Donor 2 AD - B adipose 


0.0 


y /4oi_i ducni- 
08ut uterus 


0.0 


94714_Donor 2 AD - C_adipose 


0.0 


97483 Patient- 
08pl_placenta 


0.0 




94742_Donor 3 U - 
A_Mesenchymal Stem Cells 


0.0 


QHASttk Putiont 

y/H-oo j aiienx- 
0 ( >sl._ skeletal niuidc 


0 0 l94743_Donor 3 U - 

jB_Mesenchymal Stem Cells 


0.0 


y /^fo / i aueni- 
( 9u1 uterus 


0.0 94730 Donor 3 AM -A adipose 

i 


0.0 


5/ /tooj. dtienx- 
09pl_placenta 


0.0 9473 l_Donor 3 AM - Badipose 


0.0 


9 749 2_Patient- 
10ut_uterus 


0.0 


94732_Donor 3 AM - C_adipose 


0.0 


y / 4y 3_ratient- 
10pl_placenta 


0.0 


94733_Donor 3 AD - A adipose 0.0 


y / 4y j J atient- 
1 1 go adipose 


0.0 


94734_Donor 3 AD - B adipose 


0.0 


y / 4y o_j atient- 

1 Iskskeletal muscle 


0.0 94735_Donor 3 AD - C_adipose 


0.0 


97497_Patient- 
1 1 ut_uterus 


0.0 77 1 3 8_Liver_HepG2untreated 


100.0 


y / tfo j aiieni- 
1 lpl_placenta 


q q 73556_Heart_Cardiac stromal 
icells (primary) 


0.0 


97500 Patient- 
12go_adipose 


0.1 


81735_Small Intestine 


39.5 


97501_Patient- 

1 2sk_skeletal muscle 


0.3 


72409 JCidneyJProximal 
Convoluted Tubule 


0.0 


97502_Patient- 
i 2 Lit uterus 


0.0 


82685_Small 
intestineDuodenum 


0.0 


97503_Patient- 
12pl_placenta 


q q 19065 0_Adrenal_Adrenocortical 
jadenoma 


0.0 


'"72! _l)onor2U- 
A_Mesenchymal 
Stem Cells 


0.0 172410 Kidney HRCE 

1 


0.0 


94722JDonor 2 U - 
B Mesenchymal 
Stem Cells 


0.0 


72411_Kidney_HPvE 


0.0 


- 1 1 '23 Donor 2~U~ 
C Mesenchymal 
Stem Cells 


0.0 


73139_Uterus_Uterine smooth 
muscle cells 


0.0 
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CNS_neurodegeneration_vl.O Summary: Ag3326/Ag3692 - Three experiments done 
with two primer pairs (same sequence) are in excellent agreement. This panel confirms the 
expression of this gene at low levels in the brain in an independent group of individuals. 
However, no differential expression of this gene was detected between Alzheimer's 
5 diseased postmortem brains and those of non-demented controls in this experiment. Please 
see Panel 1 .4 for a discussion of the potential utility of this gene in treatment of central 
nervous system disorders. Ag5818 Results from one experiment are not included. The amp 
plot indicates that there were experimental difficulties with this run. 

General_screening_panel_vl.4 Summary: Ag3326/Ag3692 Two experiments with the 
10 same probe and primer set produce results that are in excellent agreement. This gene is 
highly expressed in fetal liver (CT=26.5-27.0) and moderately expressed in adult liver 
(CT=28 .5-28.8) and liver cancer cell line HepG2 (CT=28.4-28.8). This result agrees with 
the results seen in Panel 5 (expression in HepG2 (CT=29.2). These results are in agreement 
with published data that show a novel sodium dicarboxylate transporter in brain, choroid 
15 plexus kidney, intestine and liver. Thus, expression of this gene could be used to 

differentiate between these samples and other samples on this panel and as a marker for 
liver derived tissue. 

This gene is expressed at low levels throughout the CNS, including in amygdala, 
substantia nigra, thalamus, cerebellum, and cerebral cortex. Therefore, this gene may play a 
20 role in central nervous system disorders such as Parkinson's disease, epilepsy, multiple 
sclerosis, schizophrenia and depression. 

Low but significant levels of expression are also seen in the adrenal gland. Thus, 
this gene product may also be involved in metabolic disorders of this gland, including 
adrenoleukodystrophy and congenital adrenal hyperplasia. 

25 References: 

1. Pajor AM, Gangula R, Yao X. Cloning and functional characterization of a high- 
affinity Na(+)/dicarboxylate cotransporter from mouse brain. Am J Physiol Cell Physiol 
2001 May;280(5):C1215-23. 



30 



2. Chen XZ, Shayakul C, Berger UV, Tian W, Hediger MA. Characterization of a 
rat Na+ -dicarboxylate cotransporter. J Biol Chem 1998 Aug 14;273(33):20972-81. 
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General_screening_panel_vl.5 Summary: Ag5818 Results using this primer pair are in 
excellent agreement with the results seen in panel 1.4. See Panel 1.4 for discussion. 

Panel 4.1D Summary: Ag3692 Significant expression of this gene is seen only in kidney 
and a liver cirrhosis sample (CTs=34.0). These results confirm that this gene is expressed 
5 in liver derived samples. The presence in the kidney is also in agreement with published 
results. Please see Panel 1 .4. This gene product may be involved in maintaining or 
restoring normal function to the kidney during inflammation. 

Panel 4D Summary: Ag3326 Results from one experiment are not included. The amp 
plot indicates that there were experimental difficulties with this run. 

10 Panel 5 Islet Summary: Ag3326 The highest expression of this gene is in liver cancer cell 
line HepG2 (CT=29.2). There is also moderate expression in the small intestine (CT=30.5). 
These results compare well with previously published reports of sodium dicarboxylate 
transporter expression in mouse and rat (see discussion Panel 1.4). 

S. CG57732-01 and CG57732-02 and CG57732-03: CA2+/CALMODULIN- 
1 5 DEPENDENT PROTEIN KINASE IV KINASE 

Expression of gene CG57732-01 and full length clones CG57732-02 and CG57732- 
03, was assessed using the primer-probe set Ag3317, described in Table SA. Results of the 
RTQ-PCR runs are shown in Tables SB, SC and SD. Please note CG57732-03 represents a 
splice variant of CG57732-01 . 

20 Table SA . Probe Name Ag33 1 7 



Primers 

L 


Sequences 


Length 


Start 
Position 


SEQID I 
NO: 


Forward 


5 ' -ggcctacaacgaaagtgaaga- 3 ' 


21 


451 


447 J 


Probe 


TET-5 ' -cagacactatgcaatgaaagtcctttcca- 
3 ' -TAMRA 


29 


472 


448 


Reverse 


5 1 -ggaaagccatactgcttcagta-3 ' 


22 


510 


449 ) 



Table S B. CNS_neurodegeneration_vl.0 



Tissue Name 


Rel. Exp.(%) Ag3317, 
Run 210144081 


Tissue Name 


Rel. Exp.(%) 
Ag3317, Run 
2 101 441(81 


\l ) 1 1 lippo 


10.7 


Control i Path) 3 


4.1 
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Temporal Ctx 




AD 2 Hippo 


23.7 


Control (Path) 4 
|Temporal Ctx 


42.6 


AD 3 Hippo 


4 5 


]AD 1 Occipital Ctx 


12.5 


AD 4 Hippo 


7.5 


AD 2 Occipital Ctx 
1 (Mussing) 


0.0 


AD 5 hippo 


97.9 


AD 3 ( fccipital Ctx 


5.4 


AD 6 Hippo 


25.7 


AD 4 Occipital ( 't\ 


1 8.4 


Control 2 Hippo 


24.8 


(AD 5 Occipital Ctx 


21.8 


Control 4 Hippo 


4.3 


J AD 6 Occipital Ctx 


58.6 


Control (Path) 3 
Hippo 


2.8 


[Control 1 Occipital 

]Ctx 


1.5 


AD 1 Temporal Ctx 


10.4 


(Control 2 Occipital 
Ctx 


94.0 


AD 2 Temporal Ctx 


35.8 


Control 3 Occipital 
Ctx 


21.5 


AD 3 Temporal Ctx 


5.8 


[Control 4 Occipital 

ICtx 


2.6 


AD 4 Temporal Ctx 


23.2 


[Control (Path) 1 
[Occipital Ctx 


100.0 


A!) 5 fi t lemporal 
Ctx 


88.9 


Control (Path) 2 
(Occipital Ctx 


13.8 


AD 5 SupTemporal 
Ctx 


26.6 


[Control (Path) 3 
Occipital Ctx 


0.9 


AD 6 Inf Temporal 

Ctx 


39.5 


Control (Path) 4 
Occipital Ctx 


™ 

19.6 


AD 6 Sup Temporal 
Ctx 


47.3 


Control 1 Parietal 

(Ctx 


4.9 


Control 1 Temporal 
Ctx 


4.4 


(Control 2 Parietal 

ICtx 


33.0 


Control 2 Temporal 
Ctx 


63.3 


IControl 3 Parietal 
(Ctx 


— - „„,,,,, ,„„, m ..„. T ..„. rm . 

27.4 


Control 3 Temporal 
Ctx 


20.4 


[Control (Path) 1 
jParietal Ctx 


95.9 


Control 4 Temporal 
Ctx 


8.7 


jControl (Path) 2 
iParietal Ctx 


24.5 


Control (Path) 1 
Temporal Ctx 


77.4 


[Control (Path) 3 
JParietal Ctx 


2.0 


Control (Path) 2 
Temporal Ctx 


— 

38.7 


Control (Path) 4 
Parietal Ctx 


51.8 


Table SC. General screening_panel vl.4 


Tissue Name I 


Rel. Exp.(%) 
Ag3317, Run 


Tissue Name 


Rel. Exp.(%) 
Ag3317, Run 
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215678602 




21*678602 


Adipose 


2.4 


Renal Tk"-1 ft 
IvClIdl La. 1 rv 1 V 




Melanoma* 
[Hs688(A).T 


6.2 


Bladder 


10.5 


[Melanoma* 
[Hs688(B).T 


7.9 


Gastric ca. (liver met.) 
NCI-N87 


22.2 


]VIcl3.iiom3.* Mil 4 


18.2 


OflQtnV rn K A TO TTT 
VJabLIie La. rv_r\ 1 111 


23 0 


Melanoma* 
LOXIMVI 


9.4 


Colon ca. SW-948 


11.1 


^Melanoma* SK.- 
1mEL-5 _ 


9.8 


— - • — — 

Colon ca. SW480 


1 " — 

20.9 


Squamous cell 
[carcinoma SCC-4 


1.6 


Colon ca.* (SW480 
met) SW620 


21.6 


Testis Pool 


13.1 


Colon ca. HT29 


11.3 


iProstate ca.* (bone 
met) PC-3 


6.4 


Colon ca. HCT-116 


27.0 


[Prostate Pool 


3.1 


Colon ca. CaCo-2 


1.6 : 


Placenta 


1.0 


l oluii cancer tissue 


i i.j 


[Uterus Pool 


3.9 


Colon ca~SW1116 


9.7 "~ 


Ovarian ca. 
iOVCAR-3 


11.6 


Colon ca. Colo-205 


1.7 


Ovarian ca. SK- 
OV-3 


18.7 


Colon ca. SW-48 


8.8 


Ovarian ca. 

OVf AT? A 


1 

3.4 

.. , 


Colon Pool 


17.1 


Ovarian ca. 
OVCAR-5 


17.2 


_______ 

Small Intestine Pool 


21.2 


Ov3ri3.Il C3.. 

IGROV-l 


6.2 


Stomach Pool 


5.3 


Ovarian ca. 
OVCAR-8 


4.7 


Bone Marrow Pool 


5.1 


Ovary 


2.9 


Fetal Heart 


6.8 


Breast ca. MCF-7 


6.1 


Heart Pool 


5.4 


Breast ca. MDA- 
MB-231. 


20.3 


Lymph Node Pool 


13.4 


Breast ca. BT 549 




Fetal Skeletal Muscle 


2.6 


Breast ca. T47D 


37.9 


Skeletal Muscle Pool 


2.3 


Breast ca. MDA-N 


9.0 


Spleen Pool 


2.8 


Breast Pool 


12.0 (Thymus Pool 


9.0 


Trachea 


, - j jCNS cancer 

|(glio/astro) U87-MG 


66.4 


Lung 


0.7 


CNS cancer 

(glio: astro) U-l 18-MG 


53.2 


Fetal Lung 


6.0 


CNS cancer 


4.6 
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1 


I(neuro;met) SK-N-AS 




Lung ca.NCI-N417 


16.5 


» Nn cancer (astro) SF- 


17.2 


Lung ca. LX-1 


20.9 


CNS cancer (astro) 
SNB-75 


21.5 


Lung ca. NCI-H146 


7.0 


CNS cancer (glio) 
SNB-19 


5.1 


Lung cel. SHP-77 


23.0 


CNS cancer (glio) SF- 
295 


12.2 


ILung ca. A549 




23.7 


Brain (Amygdala) Pool 


46.3 


[Lung ca. NCI-H526 




4.4 


Brain (cerebellum) 


92.7 


|Lung ca. NCI-H23 




5.8 


Brain (fetal) 


25.7 


Lung ca. NCI-H460 




10.3 


Brain (Hippocampus) 
Pool 




42.9 


Lung ca.HOP-62 




7.0 


Cerebral Cortex Pool 




100 0 


r~ - • -i 

jjLung ca. in^i-Hjzz 


2.9 


Brain (Substantia 
nigra) Pool 


76.3 


(Liver 


0.1 


Brain ( 1 lialamus) Pool 


"" 63.7 


Fetal Liver 


1.3 " 


Brain (whole) 


56.6 


Liver ca. HepG2 


14 


Spinal Cord Pool 


9.3 


Kidney Pool 




26.2 


\dren ii ulai.d 




16.2 1 


Fetal KiJne\ 




3.5 


Pituitary gland Pool 




\6A ! 


Renal ca. 786-0 




26.4 ;Salivary Gland 




13 J ' i 


Renal ca. 




14.2 (Thyroid (female) 


6.3 


Renal ca. ACHN 




33.2 


Pancreatic ca. 
CAPAN2 


2.6 


Renal ca. UO-3 1 




4.3 


Pancreas Pool 


19.6 


Table SD. Panel 4D 


Tissue Name 


Rel. Exp.(%) 
Ag3317, Run 
164683049 


Tissue Name 


Rel. Exp.(%) I 
Ag3317, Run [ 
164683049 


Secondary Thl act 


21.6 


HUVEC IL-lbeta 


3.8 


Secondary Th2 act 


23.2 


HUVEC IFN gamma 


12.5 


Secondary Trl act 


22.8 


HUVEC TNF alpha + 
IFN gamma 


2.9 


Secondary Thl rest 


12.7 


HUVEC TNF alpha + 
IL4 




9.0 1 


Seconders- Th2 rest 


9.3 


HUVEC IL-11 




4.0 


Secondary Trl rest 


33.7 


Lung Microvascular EC 
none 




24.3 


Primary Thl act 


44.1 


Lung Microvascular EC 
TNFalpha + IL-lbeta 


11.3 



605 



b 



——- — — ~ — — — 
Primary Th2 act 


49 3 (Microvascular Dermal 
1 (.' none 


41.5 


Primary Trl act 


„. ™ Microsvasular Dermal 

EC TNFalpha i IL-lbeta 


17.2 


Primary Thl rest 


- (Bronchial epithelium 
TNFalpha + 1L1 beta 


31.2 


Primary Th2 rest 


^ ^ Small airway epithelium 


8.0 


Primary Trl rest 


- 0 „ (Small airway epithelium 

iTXTFnlnlin -1- TT 1 hf=tn 
1 1 fNr dipna J.JL- I DCld 


11.1 


CD45RA CD4 
lymphocyte set 


41.2 jCoronery artery SMC rest 


20.6 


CD45RO CD4 
lymphocyte act 


? - f> Coronery artery SMC 
i in r aipna ^ ll,- i oeta 


19.6 


CD8 lymphocyte act 


17.8 Astrocytes rest 


14.7 


Secondary CD8 
lymphocyte rest 


0 , o {Astrocytes TNFalpha + 
IlL-lbeta 


11.0 


Secondary CD8 
1} mphocyte act 


7.4 Kl 1-812 (Basophil) resl 


2.1 


CD4 lymphocyte none 


, ! g K.U-812 (Basophil) 
iPMA/ionomycin 


12.9 


2ry Thl/Th2/Trl_anti- 

PFJOS PHI 1 


- s CCD1106 

j(Keratinocytes) no no 


. ..... P 

30.6 




LAK cells rest 


|CCD1 106 
51.1 j (Keratinocytes) 

INI alpha i IL-lbeta 


23.7 


LAK cells IL-2 


7.0 Liver cirrhosis 


0.8 


jLAK cells IL-2-HL- 12 


25.5 Lupus kidney 


2.2 


LAK cells IL-2+IFN 
(gamma 


35.4 


NCI-H292 none 


33.7 


[LAK cells IL-2+ IL-18 


28.7 


NCI-H292 IL-4 


43.5 


LAK cells 
[PMA/ionomycin 


20.6 


NCI-H292 IL-9 


36.3 


Nk Cells IL-2 rest 


13.5 NCI-H292IL-13 


35.6 


Two Way MLR 3 day 


33.0 {NCI-H292 IFN gamma 


24.3 


Two Way MLR 5 day 


9.6 HPAEC none 


22.8 


Two Way MLR 7 day 


1 II' \ECTNF alphas 11 - 
1U U jl beta 


8.3 


PBMC rest 


12.0 JLung fibroblast none 


11.8 


PBMC PWM 


24 ? Lung fibroblast TNF 
lalpha + IL-1 beta 


1.2 


reMc PLI \-l 


32.5 1 ung fibroblast IL-4 


19.2 


Ramos [ B cell) none 


1 .5 Lung fibroblast IL-9 


12.1 


Ramos (B cell) 
ionomycin 


' 0.0 jLung fibroblast IL-1 3 


14.8 
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B lymphocytes PWM 


41 2 Lung fibroblast IFN 

gamma j 


B lymphocytes CD40L 
and IL-4 


. . c ; Dermal fibroblast ! + „ 
145 |cCD1070rest 1000 


EOL-1 dbcAMP 


9nn jDermal fibroblast j 
CCD 1070 TNF alpha j 


l ; OI - 1 f-i \ VIP 

PMA/ionomycin 


60.3 


Dermal fibroblast 
CCD1070IL-1 beta 


14.2 


Dendritic cells none 


55.5 


Dermal fibroblast IFN 
gamma 


24.1 


Dendritic cells LPS 


26.1 


Dermal fibroblast 1 1 I 


39.0 


Dendritic cells anti- 
CD40 


74.7 


IBD Colitis 2 


1.6 


Monoc) tes rest 


4X o IBD Crohn's 


2.7 


Monocytes LI'S 


15.4 (Colon 


19.1 


Maci ,>phages rest 


98.6 (Lung 


14.4 


Mach .phages LPS 


5.6 Thymus 


10.5 


HUV1 Cnone" 


27.9 Kidney 


100.0 


HUVEC starved ~ j 


27.0 





CNS_neurodegeneration_vl.O Summary: Ag3317 - This panel does not show 
differential expression of this gene in Alzheimer's disease. However, this expression profile 
confirms the presence of this gene in the brain. Please see Panel 1.4 for discussion of utility 
of this gene in the central nervous system. 



5 General_screening_panel_vl.4 Summary: Ag33 17 - There is low to moderate 

expression this gene across all samples on this panel. This gene is expressed at moderate 
levels throughout the CNS, including in amygdala, substantia nigra, thalamus, cerebellum, 
and cerebral cortex. Highest expression is observed in the cerebral cortex (CT=29.0). This 
gene encodes a calmodulin-dependent protein kinase IV homolog, which is known to play 
1 0 a role in Ca2+ signaling in the CNS that controls neuronal growth, differentiation, and 
plasticity. Mice deficient in calmodulin-dependent protein kinase IV were found to have 
cerebellar defects. Therefore, this gene may play a role in central nervous system disorders 
such as Parkinson's disease, epilepsy, multiple sclerosis, schizophrenia and depression. 

This gene product is also expressed in adipose, pancreas, adrenal, thyroid, pituitary, 
15 skeletal muscle, heart, and liver. This widespread expression in tissues with metabolic 

function suggests that this gene product may be important for the pathogenesis, diagnosis, 
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and/or treatment of metabolic and endocrine diseases, including obesity and Types 1 and 2 
diabetes. 

Based on expression in this panel, this gene may be also be involved in gastric, 
pancreatic, brain, colon, renal, lung, breast, ovarian and prostate cancer as well as 
5 melanomas. Thus, expression of this gene could be used as a diagnostic marker for the 
presence of these cancers. Furthermore, therapeutic inhibition using antibodies or small 
molecule drugs might be of use in the treatment of these cancers. 

References: 

1. Okuno S, Kitani T, Fujisawa H. Evidence for the existence of Ca2+/calmodulin- 
10 dependent protein kinase IV kinase isoforms in rat brain. J Biochem (Tokyo) 1996 

Jun; 11 9(6): 11 76-81. 

2. Ribar TJ, Rodriguiz RM, Khiroug L, Wetsel WC, Augustine GJ, Means AR. 
Cerebellar defects in Ca2+/calmodulin kinase IV-deficient mice. J Neurosci 2000 Nov 
15;20(22):RC107. 

15 Panel 4D Summary: Ag3317 - This gene was found to have low expression across almost 
all the samples on this panel, with the highest level of expression seen in kidney and resting 
dermal fibroblasts (CTs=32). Expression of Ca2+/calmodulin-dependent kinase type IV in 
thymocytes has been found in mice, where it plays a role in Ca2+-dependent gene 
transcription. 

20 Reference 

1. Raman V, Blaeser F, Ho N, Engle DL, Williams CB, Chatila TA. Requirement 
for Ca2+/calmodulin-dependent kinase type IV/Gr in setting the thymocyte selection 
threshold. J Immunol 2001 Dec 1;167(1 1):6270-8. 

T. CG57709-01: Novel mitochondrial protein 

25 Expression of gene CG57709-01 was assessed using the primer-probe set Ag3323, 

described in Table TA. Results of the RTQ-PCR runs are shown in Tables TB, TC and TD. 

Table TA . Probe Name Ag3323 
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Primers 


Sequences 


Length 


Start 
Position 


SEQID 
NO: 


Forward 


5 ' -atgtgcagaggatacgcatg-3 ' 


~~ 20~*" 


589 


450 


Probe 


TET-5 1 -tgcaaaacaggaagacaaaggaaggg-3 1 - 
TAMRA 


26 


626 


451 


Reverse 


5 ' -tggttctggcattctagacg-3 ' 


20 


665 


452 



Table TB . CNSneurodegenerationvl.O 



Tissue INjuiic 


Rel.Exp.(%)Ag3323,| 

Run210144152 I issue Name 


ReL Exp.(%) Ag3323, 
Run 210144152 


— — — ■ — — 

AD 1 Hippo 


22 5 (Control (Path) 3 
[Temporal Ctx 


5.2 


AD 9 Hmnn 


29.5 


Control (Path) 4 
Temporal Ctx 




f\LJ j raippo 


6.9 


AD 1 Occipital 
Ctx 


18 6 


API A T 1 \rw\t ^ 


? 4 |AD 2 Occipital 
Ctx (Missing) 


0 0 

"... 




8 - a jAD 3 Occipital 

jCtx 


7.6 


AD 6 Hippo 


66.4 


AD 4 Occipital 
Ctx 


17 8 

L ' ™ 


Control 2 Hippo 


27.5 


AD 5 Occipital 
Ctx 


30.8 


Control 4 Hippo 


, . Q AD 6 Occipital 
11 y Ctx 


48.6 


Control (Path) 3 
Hippo 


„ . IControl 1 Occipital 

jCtx 


4.0 


AD 1 Temporal Ctx 


0 , (Control 2 Occipital 
186 Ctx 


58.2 


AD 2 Temporal Ctx 


g [Control 3 Occipital 
Ctx 


14.2 


! ™ 

AD 3 Temporal Ctx 


7 , IControl 4 Occipital 

[Ctx 


6.6 


AD 4 Temporal Ctx 


91 . IControl (Path) 1 
(Occipital Ctx 


70.7 


AD 5 Inf Temporal 
Ctx 


100.0 Con^KPa^ 
sOccipital Ctx 


12.6 


AD 5 SupTemporal 
Ctx 


42 6 IControl (Path) 3 
JOccipital Ctx 


2.4 


AD 6 Inf Temporal 

Ctx 


4g 6 'Control (Path) 4 
Occipital Ctx 


14.6 


AD 6 Sup Temporal 

Ctx 


„_ _ IControl 1 Parietal 
420 Ctx 


6.5 
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Control 1 Temporal j , , Control 2 Parietal 

Ctx _ j b - 3 ]Ctx 


48.0 


Control 2 Temporal 
Ctx 


_ Q _ jControl 3 Parietal 

JCtx 


~ T 

19.6 


Control 3 Temporal 
Ctx 


Ul IControl (Path) 1 
Parietal Ctx 


61.1 1 


Control 4 Temporal 
Ctx 


8.9 


Control (Path) 2 
Parietal Ctx 


19.3 


Control (Path) 1 
Temporal Ctx 


53.6 


Control (Path) 3 
Parietal Ctx 


3.8 


Control (Path) 2 
Temporal Ctx 


34.2 


Control (Path) 4 
Parietal Ctx 


42.6 



Table TC . Panel 1.3D 



Tissue Name 


Rel. Exp.(%) 
Ag3323, Run 
165678151 


Tissue Name 


Rel. Exp.(%) 
Ag3323, Run 
165678151 


Liver adenocarcinoma 


25.0 


Kidney (fetal) 


'■' 


Pancreas 


12. S 


Renal ca. 786-0 


14.3 


Pancreatic ca. CAP AN 

2 


24.5 


Renal ca. A498 


34.2 


Adrenal gland 


12.2 


Renal ca". RXFl93~ 


14 2 


Thyroid 


6.9 


Renal ca. ACHN 


12.9 


Salivarv gland 


14.0 


Renal ca. I K )-3 1 


48.6 


Pituitary gland 


10.1 


ul «..! I k-10 


7.2 


Brain (fetal) 


13.7 


Liver 


20.2 


(.{rain (v. hole) 


29.7 


Liver (fetal) 


22.1 


Brain (amygdala) 


21.3 


Liver ca. 

(hepatoblast) HepG2 


21.3 


Brain (cerebellum) 


24 7 


Lung 


6.7 


Brain (hippocampus) 


25.7 


Lung (fetal) 


14.8 


Brain (substantia nigra) 


20.0 


Lung ca. (small cell) 
LX-1 


39.8 


Brain (thalamus) 


27.2 


Lung ca. (small cell) 
NCI-H69 


25.0 


Cerebral Cortex 


33.0 


Lung ca. (s.cell var.) 
SHP-77 


42.3 


Spinal cord 


16.5 


Lung ca. (large 
cell)NCI-H460 


25.7 


glio/astro U87-MG 


8.9 


Lung ca. (non-sm. 
cell) A549 


12.0 


glio/astro U-118-MG 


100.0 


Lung ca. (non-s.cell) 
NCI-H23 


9.1 
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astrocytoma SW1 783 


14.6 


Lung ca. (non-s.cell) 
HOP-62 


9.5 


neuro*; met SK-N-AS 


43.2 


Lung ca. (non-s.cl) 


10.7 


astrocytoma SF-539 


13.9 


Lung ca. (squam.) 
o w yuvj 


12.4 


astrocytoma SNB-75 


29.7 


Lung ca. (squam.) 

IN V. 1 1 1 ^' yyy 


59.0 


gllUIilct o J/N £5-1:/ 


13 5 


Mammary gland 


10 6 


glioma U251 


43.8 


Breast ca.* (pl.ef) 
MCF-7 


46.3 


glioma SF-295 


17.7 


Breast ca.* (pl.ef) 
MDA-MB-231 


31.6 


Heart (fetal) 


22.7 


Breast ca.* (pl.ef) 
T47D 


15.1 


Heart 


14.5 


Breast ca. BT-549 


54.0 


Skeletal inn >t !e ( ic f il ) 




Breast ca. MDA-N 




(Skeletal muscle 


55.5 


Ovary 


8.7 


I — 

Bone marrow 


10.7 


Ovarian ca. 
OVCAR-3 


26.2 


Thymus. 


5.5 


Ovarian ca. 
OVCAR-4 


21.6 


Spleen 


13.3 


Ovarian ca. 

U V LAK-j 


20.9 


Lymph node 


24.8 


Ovarian ca. 
OVCAR-8 


12.6 


Colorectal 


8.8 


Ovarian ca. IGROV- 
1 


4.4 


| Stomach 


15.1 


Ovarian ca.* 
(ascites) SK-OV-3 


23.5 


[Small intestine 


28.3*" 


Uterus 


14.3 


jv^oion ca. 5W48U 




Placenta 




Colon ca.* 

o WOZUI v o W H-oU Ul&l) 


17.6 


Prostate 


9.5 


Colon ca. HT29 


14.6 


Prostate ca.* (bone 
metJrC-3 


17.7 


|Colonca.HCT-116 


43.2 


Testis 


10.1 


i Colon ca. CaCo-2 


12.7 


Melanoma 
Hs688(A).T 


7.6 


i Colon ca. 
tissue(OD03866) 


22.5 


Melanoma* (met) 
Hs688(B).l 


6.6 


Colon ca. HCC-2998 


25.2 


Melanoma UACC- 
62 


19.6 


Gastric ca.* (liver met) 
NCI-N87 


29.5 


Melanoma Ml 4 


39.2 
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Bladder 


. . (Melanoma LOX 
6A IMVI 


13.4 


Trachea. 


1 ~ ? Melanoma* (met) 
3 - >k-MEL-5 


21.2 


kldlKV 


15.6 | Adipose 


« 



Table TP . Panel 4D 



Tissue Name 




Rel. Exp.(%) 
Ag3323, Run 
[65296416 


Tissue Name 


Rel. Exp.(%) 
Ag3323, Run 
165296416 


Secondary Fhl act 


32.3 ~ ' 


HUVEC IL-lbeta " 


3.8 


Secondary Th2 at l 


22.S 


HUVlciFN"gamma 


12.0 


Secondary Trl act 


29.9 


HUVEC TNI • alpha v 
IFN gamma 


8.1 


Secondary Thl rest 


1HUVECTNF alpha + 
II 1 


11.1 


Secondary Th2 rest 


4.3 HUVEC IL- 11 


84 


Secondary Trl rest 


, ,. Lung Microvascular EC 
6.0 1 

none 


7.6 


Primary Thl act 


n 1 ung Microvascular EC 
TNFalpha + IL-lbeta 


6.9 


Primary Th2 act 


0 . 0 Microvascular Dermal 
EC none 


1 

14.7 


Primary Trl act 


. . \ I Lrosvasular Dermal 

EC TNI 'alpha + IL-lbeta 


7.6 


Primary Thl rest 


17 „ {Bronchial epithelium 
: TNFalpha . ILlbeta 


17.3 


Primary Th2 rest 


j j g jSmall airway epithelium 
Inone 




Primary Trl rest 


1 _ n Small airway epithelium 
TNFalpha + IL-lbeta 


18.4 


CD45RA CD4 

lymphocyte act 

________ 


15.0 Coronery artery SMC rest 


9.9 


lymphocyte act 


?4 _ Coronery artery SMC 
1 (TNFalpha + IL-lbeta 


6.5 


CD8 lymphocyte act 


19.3 Astrocytes rest 


5.1 


Secondary CD8 
lymphocyte rest 


22.7 


Astrocytes TNFalpha + 
IL-lbeta 


3.9 


Secondary CD? 
lymphocyte act 


12.9 


KU-812 (Basophil) rest 


14.0 


CD4 lymphocyte none 


2.9 


Kl 1-812 (Basophil) 
PMA/ionomycin 


22.1 


2ry Thl/Th2/Trl anti- 
CD95 CHI 1 


5.4 


CCD1106 

(keratinoc_\tes) none 


16.0 
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LAK cells rest 


|CCD1106 
7.2 jj(Keratinocytes) 

TNFalpha + lL-lbeta 


8.1 


LAK cells JL-2 


17.2 Liver cirrhosis 


1.7 


LAK cells IL-2+IL-12 


15.1 


Lupus kidney 


1.0 


LAK cells IL-2+IFN 
gamma 


27.9 


NCI-H292 none 


30.1 


LAK cells IL-2+ IL-18 


17.7 NC1-I I292 IL-4 


49.0 


LAK cells 
PMA/ionomycin 


1.9 |NCI-H292 IL-9 


33.2 


NK Cells JL-2 rest 


8.4 (NCI-H292IL-13 


26.2 | 


Two Way MLR 3 day 


v.v ;[nL1-I \lvl IIIN gamma 


26.6 


Two Way MLR 5 day 


18.4 HPAECnone 


11 7 1 

., J 


Two Way MLR 7 day 


8Q HPAECTNF alpha + IL- 
™ 1 beta 


7.5 


PBMC rest 


3.8 (Lung fibroblast none 


8.0 


PBMC PWM 


<-q ^ |Lung fibroblast TNF 
alpha + II - 1 beta 


5.5 


PBMC PI 1 \ 1 


29.3 Lung fibroblast IL-4~ 




Ramos (B cell) none 


33.9 Lung fibroblast IL-9 


15.3 J 


Ramos (B cell) 
ionomycin 


83.5 Lung fibroblast IL-13 


11.4 


B lymphocytes PWM 


ioo o 1 un ~ 1,1 ' 1 1 !l N 

jgamma 


16.5 


B lymphocytes CD40L 
and IL-4 


. iDermal fibroblast 
[CCDl 070 rest 


28.9 


EOL-1 dbcAMP 


lf) „ ; Dermal fibroblast _ 1 _ 
JCCD1070 TNF alpha J 31-2 


iiU.L-1 QDCAMr 
PMA/ionomycin 


- 7 Dermal fibroblast 

CC Dl 070 IL-1 beta 


11.3 


Dendritic cells none 


9 9 Dermal fibroblast IFN 

igamma 


" "1 


Dendrit c cells 1 PS 


6.3 Dermal fibroblast IL-4 


12.3 1 


jDendritic cells anti- 
jCD40 


7.3 1IBD Colitis 2 0.7 


]Monoc>tes rest 


7.0 IBD Crohn's 


1.0 


Monocvtes LPS 


1.4 Colon 


8.8 


Macrophages rest 


13.5 iLung 


5.8 


Macrophages LPS 


3.5 sThymus 


17.3 


III IVEC none 


11.9 Kidney 


12.3 


|5UVEC starved 


24.8 ! j | 



CNS_neurodegeneration_vl.0 Summary: Ag3323 This panel does not show differential 
expression of the CG57709-01 gene in Alzheimer's disease. However, this expression 
613 



profile confirms the presence of this gene in the brain. Please see Panel 1 .3D for discussion 
of utility of this gene in the central nervous system. 

Panel 1.3D Summary: Ag3323 - This gene is expressed at moderate levels in all samples 
on this panel, with highest expression in a brain cancer cell line. Expression is also seen in 
5 all the cancer cell lines on this panel. Thus, expression of this gene could be used to 

differentiate between this brain cancer cell line sample and other samples on this panel and 
as a marker for brain cancer. 

Among tissues with metabolic function, this gene is expressed at moderate to low 
levels in pituitary, adipose, adrenal gland, pancreas, thyroid, and adult and fetal skeletal 
1 0 muscle, heart, and liver. This widespread expression among these tissues suggests that this 
gene product may play a role in normal neuroendocrine and metabolic and that disregulated 
express! on of this gene may contribute to neuroendocrine disorders or metabolic diseases, 
such as obesity and diabetes. 

This molecule is also expressed at moderate to low levels in the CNS and may be a 
15 small molecule target for the treatment of neurologic diseases such as Alzheimer's disease, 
Parkinson's disease, epilepsy, schizophrenia, stroke and multiple sclerosis. 

Panel 4D Summary: Ag3323 - This gene is expressed at high to moderate levels in all 
samples, on this panel, with highest expression in B lymphocytes stimulated with polkweed 
mitogen (CT=24.5). In addition, this gene is expressed at higher levels in ionomycin- 

20 activated Ramos B lymphocytes. The highl levels of expression in activated B lymphocytes 
suggests that therapies that antagonize the function of this gene product may reduce or 
eliminate the symptoms in patients with autoimmune and inflammatory diseases in which 
B cells play a part in the initiation or progression of the disease process, such as lupus 
erythematosus, Crohn's disease, ulcerative colitis, multiple sclerosis, chronic obstructive 

25 pulmonary disease, asthma, emphysema, rheumatoid arthritis, or psoriasis. 

U. CG57700-01: HYDROXY ACYLGLUTATHIONE HYDROLASE 
(GLYOXALASE II) 

Expression of gene CG57700-01 was assessed using the primer-probe set Ag331 1, 
described in Table UA. Results of the RTQ-PCR runs are shown in Table UB. 
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Table UA . Probe Name Ag33 1 1 
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Primers 


Sequences 


Length 


Start 
Position 


SEQ ID NO: 


Forward 
Probe 


5 ' -acgcttagcaacctggagtt-3 ' 


20 




h" ; 


TET-5 ' -accacgtgagagccaagctgtcct-3 1 - 

TAMRA 


24 


582 


454 


Reverse 


5 ' -gtcatcctcatccctcttctg-3 ' 


21 


611 
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Table UB . Panel 4D 



Tissue Name 


Rel. Exp.(%) 
Ag3311,Run 
164682845 


Tissue Name 


Rel. Exp.(%) 
Ag3311,Run 
164682845 


Secondary Th 1 act 


10.2 


HUVEC IL-lbeta 


0.0 


Secondary Th2 act 


3.8 


HUVEC I FN gamma 


0.0 


Secondary Trl act 


0.0 


HUVEC TNF alpha + 
IFN gamma 


0.0 


Secondary Thl rest 


HUVEC TNF alpha + 

m 


0.0 


Secondary Th2 rest 


0.0 111 VLX IL-1 1 


0.0 


Secondary Trl rest 


q q Lung Microvascular EC 

(none 


0.0 


Primary Thl act 


„ n iLung Microvascular EC 
,TNFalpha + IL-1 beta 


0.0 


„ . ™ „ . _ n (Microvascular Dermal 
Primary Th2 act 0.0 L _ 

IX none 


0.0 


„ . m i , , jMicrosvasular Dermal 
Primary Trl act 1.6 L_ „.„ . , , TT , 
i ^EC TNFalpha + IL-lbeta 


0.0 


Primary Thl rest 


0.0 


Bronchial epithelium 
TNFalpha - ILIbeta 


5.1 


Primary Th2 rest 


0.0 


Small airway epithelium 
none 


0.0 


Primary Trl rest 


0.0 


Small airway epithelium 
1 Nl alpha + IL-lbeta 


0.0 


CD45RACD4 
lymphocyte act 


0.0 [Coronery artery SMC rest 


0.0 


CD45RO CD4 
lymphocyte act 


„ n JCoronery artery SMC 
TNFalpha + IL-1 beta 


0.0 


C'D.S lymphocy te act 


0.0 Astrocj ics rest 


0.0 


Secondary CI)<S 
lymphocyte rest 


„ „ lAstrocytes TNFalpha + 
|IL-lbeta 




Secondary CD8 
lymphocyte act 


0.0 KU-8 12 (Basophil) rest 

1 — - — 


0.0 


CD4 lymphocyte none 


00 KU-8 12 (Basophil) 
PMA/ionomyciii 


0.0 


2n rhiah2Trl anti- 


0.0 CCD 1106 


4.2 
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(Keratinocytes) none 










CCD1106 






LAK cells rest 


0.0 


(Keratinocytes) 
TNFalpha + IL-lbeta 


2.7 




LAK cells IL-2 


0.0 


Liver cirrhosis 


0.0 




LAK cells IL-2+IL-12 


0.0 


Lupus kidney 


0.0 




LAK cells IL-2+IFN 
gamma 


0.0 


NCI-H292 none 


0.0 




LAK cells IL-2+IL-18 


0.0 NCI-H292 IL-4 


0.0 




LAK cells 
PMA/ionomycin 


0.0 


NCI-H292 IL-9 


0.0 




NK Cells IL-2 rest 


0.0 


NCI-H292 IL-13 


0.0 


s 


Two Way MLR 3 day 


4.5 


NCI-H292 IFN gamma 


1.9 




Two Way MLR 5 day 


0.0 I IP ABC none 


0.0 


yy 


Two Way MLR 7 day 


0 0 


HPAEC TNF alpha + IL- 
1 beta 


0.0 


a 


I'BMC rest 


0.0 


Lung fibroblast none 


0.0 


D 


PBMC PWM 


3.7 

— 


Lung fibroblast TNF 
aipLa - ! 11 - ': beta 


0.0 


hi 


PBMC PHA-L 


0.0 


1 ung 1 1 1 - . i| i i 11 -4 


0.0 




Ramos IB lcII) none 


0.0 


1 i ii n , i| i ,t IL 


14.1 


n 


Ramos (B cell) 
ionomycin 


0.0 


Lung fibroblast IL-13 


4.3 




B lymphocytes PWM 


0.0 


Lung fibroblast IFN 
gamma 


00 




B MnpiocUe CP40I 


0.0 


Dermal fibroblast 


0.0 




and IL-4 


CCD 1070 rest 


IeOL-1 dbcAMP 


0.0 


Dermal fibroblast 
C( D1070 TNF alpha 


^ 

0.0 




EOL-1 dbcAMP 


16 


Dermal fibroblast 


0.0 




PMA/ionomycin 


CCD 1070 IL-1 beta 




Dendritic cells none 




Dermal fibroblast IFN 
gamma 


0.0 




Dendritic cells 1 PS 


0.0 


1 )enn;il librvbk si || - 1 


? 0 




Dendritic cells anti- 
CD40 


2.5 


IBD Colitis 2 

K - - 


0 0 




Monocytes rest 


I °° 


IBD Crohn's 


0 0 




Monocytes LPS 


0.0 


Colon 


100.0 




Macrophages rest 


0.0 


Lung 






Maeuf tiiLe^ LPs 


0.0 


Thymus 


0.0 




1 11 VIC none 


0.0 


Kidney 


2.4 




HUVEC starved 


0.0 
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Alcomprehensive panelvl.O Summary: Ag331 1 - Expression of this gene is 
low/uncletectable (CTs > 35) across all of the samples on this panel (data not shown). 



CNSneurodegenerationvl.O Summary: Ag331 1 - Expression of this gene is 
low/uncletectable (CTs > 35) across all of the samples on this panel (data not shown). 

General_screening_panel_vL4 Summary: Ag331 1 - Expression of this gene is 
low/undetectable (CTs > 35) across all of the samples on this panel (data not shown). 

Panel 4D Summary: Ag331 1 - Significant expression of this gene is seen only in colon 
(CT=33.9). Therefore, expression of this gene can be used to distinguish between this 
sample and the others on the panel and between healthy and inflammed bowel. Since 
expression is not detectable in samples derived from Crohn's and colitis patients, 
therapeutic modulation of the expression or function of this gene may be useful in the 
treatment of inflammatory bowel disease. 

V. CG58553-01: vasolpressin receptor 

Expression of gene CG58553-01 was assessed using the primer-probe set Ag3372, 
described in Table VA. Results of the RTQ-PCR runs are shown in Tables VB and VC. 



Table VA . Probe Name Ag3372 



Primers 


Sequences 


Length 


Start 
Position 


SEQID 
NO: 


(Forward 


5 ' -cggatctggtcatcacaca-3 ' 


19 


1983 


457 


jProbe 


TET-5' -ccacccacaacctcccaaggaact- 3 ' - 
TAMRA 


24 


2017 


458 


(Reverse 


5 ' -agcctcagaaggtcgagatg-3 ' 


20 


2041 


459 



Table V B. Panel 1.3D 



Tissue Name 


Rel. Exp.(%) 
Ag3372, Run 
165524269 


Tissue Name 


Rel. Exp.(%) 
Ag3372, Run 
165524269 


Liver adenocarcinoma 


0.7 


Kidney (fetal) 


0.0 


Pancreas 


0.2 


Renal ca. 786-0 


0.0 


Pancreatic ca. CAP AN 
2 


0.0 


Renal ca. A498 


0.1 


Adrenal gland 


0.0 


Renal ca. RXF 393 


0.0 


Thyroid 


°-> 


Renal ca. ACHN 


0.0 
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Salivary gland 


0.1 Renal ca.UO-31 


0.0 


Pituitary gland 


0.2 iRenalca TK 10 


0.0 


Brain (fetal) 


0.0 1 i\i 


2.1 


Brain (whole) 


0.3 jLiver (fetal) 


0.0 


Brain (amygdala) 


o o | L,ver ca - 

(hepatoblast) HepG2 


0.2 


Brsin (cerebellum) 


0.1 iLung 


2.4 


Brain (hippocampus) 


0.5 Lung (fetal) 


0.2 


Brain (substantia nigra) 


n „ iLung ca. (small cell) 
UZ LX-1 


0.0 


Brain (thalamus) 


„ „ iLung ca. (small cell) 
(NCI-H69 


0.0 


Cerebral Cortex 


n n ILung ca. (s.cell var.) 
U ' U SIIP-77 


0.1 


Spinal cord 


. n 1 iingca. (large 
|cell)NCI-H460 


0.0 


glio/astro U87-MG 


„ „ Lung ca. (non-sm. 
ccllj~A549 


0.1 


glio/astroU-118-MG 


n „ ILung ca. (non-s.cell) 
NCI-H23 


0.6 


astrocytoma SW1783 


"jL un g ca . (non-s.cell) 
UU |HOP-62 


0.1 


neuro*; met SK-N-AS 


n „ Lung ca. (non-s.cl) 
JNCI-H522 


0.0 


astrocytoma SF-539 0.0 ^"^a. (squam) 

SW V00 


0.0 


astrocytoma SNB-75 


0.1 


Lung ca. (squam.) 
NCI-H596 


- ™ 

0.0 


glioma SNB-19 


0.4 


Mammary gland 


0.7 


glioma U251 


0.2 


Breast ca.* (pl.ef) 
MCF-7 


0.0 


glioma SF-295 


0.0 


Breast ca.* (pl.ef) 
MDA-MB-231 


0.0 


Heart (fetal) 


n o iBreast ca.* (pl.ef) I 

T47D 1 u-i 


Heart 


0.0 


Breast ca. BT-549 


0.0 


S k e 1 e ta 1 muscle ( fet a ! ) 


0.0 


Breast ca. MDA-N 


0.0 


Skeletal muscle 


0.0 


Ovary 


0.0 


Bone marrow 


1.6 


Ovarian ca. 
OVCAR-3 


0.2 


Thymus 


1.7 


Ovarian ca. 
OVCAR-4 


0.0 


Spleen 


2.8 


Ovarian ca. 
OVCAR-5 


0.2 


; , n ph node 


5.5 


Ox arian ca. 


0.2 
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OVCAR-8 




Colorectal 


0.2 


Ovarian ca. IGROV- 
i 


0.0 


Stomach 


1.2 




Ovarian ca.* 
(ascites) SK-OV-3 


0.0 


Small intestine 


100.0 








0.0 


Colon ca. SW480 


0.0 




Placenta 


0.8 


Colon ca.* 

SW620I SW480 met) 


0.0 


Prostate 


0.1 


Colon ca. HT29 


0.0 


Prostate ca.* (bone 
met)PC-3 


0.0 


Colon ca.HCT-1 16 


0.0 


Testis 


1.4 


Colon ca. CaCo-2 


0.3 




Melanoma 

Hs688(A).T 


0.0 


Colon ca. 
tissue(OD03866) 


0.7 




Melanoma* (met) 
Hs688(B).T 


0.0 


Colon ca. HCC-2998 


3.8 


JMelanoma UACC- 
62 


0.0 


Gastric ca.* (liver met) 
NCI-N87 


1.0 


Melanoma M14 


0.2 


Bladder 


0.0 


Melanoma LOX 
IMVI 


0.2 


Trachea 


0.1 


Melanoma* (met) 
SK-MEL-5 


0.4 


Kidney 


0.6 


Adipose 


1.3 


Table VC. Panel 4D 


Tissue Name 


Rel. Exp.(%) 
Ag3372, Run 
165296616 


Tissue Name 


Rel. Exp.(%) 
Ag3372, Run 
165296616 


Secondary Hi I ai t 


1.4 


HUVEC IL-lbeta 


0.0 


Secondan rh2 act 


1 1 


HUVEC IFN gamma 


0.0 


Secondary Trl act 


2.9 


HUVEC TNF alpha + 
IFN gamma 




0.0 


Secondary Thl rest 


5.4 


HliVLC l\h alpha + 
II 1 




0.0 




Secondary Th2 rest 


M - 


HUVEC IL-1 1 




0.0 


Secondary Trl rest 


12.0 


Lung Microvascular EC 
none 


0.0 


Primary Thl act 

— — ■— — — 


11.4 


Lung Microvascular EC 
TNFalpha + IL-lbeta 


_ 


Primary Th2 act 


18.9 


Microvascular Dermal 
EC none 




o.o 1 


Primary Trl act 


27.0 


Microsvasular Dermal 




0.0 
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EC TNFalpha + lL-lbeta 




Primary Thl rest 


27.5 


Bronchial epithelium 
TMdlplK II .1 beta 


0.1 


Primary Th2 rest 


13.6 


Small airway epithelium 
none 


0.0 


Primary Trl rest 


, 9 „ (Small airway epithelium 
|TNFalpha + IL-lbeta 


0.0 


CD45RA CD4 
lymphocyte set 


3.0 Coronery artery SMC rest 


0.0 


CD45RO CD4 
lymphocyte act 


_ ^ (Coronery artery SMC 
TNFalpha + IL-lbeta 


0.0 


CD8 lymphocyte act 


5.8 Astrocytes rest 


0.0 


Secondary CD8 
lymphocyte rest 


„ Astrocytes rNFalpha + 
|IL-lbeta 


0.0 


Seconckry CD8 
lymphocyte act 


2.9 


KU-812 (Basophil) rest 


0.0 


CD4 lymphocyte none 


4.7 


KU-812 (Basophil) 
PMA/ionomycin 


0.1 


2ry Thl/Th2/Trl_anti- 


7.5 


CCD1106 

(Keratinocytes) none 


0.0 


LAK cells rest 


1.8 


CCD 1106 
(Keratinocytes) 
TNFalpha - IL-lbeta 


- . — 

0.0 


LAK cells IL-2 


5.8 iLiver cirrhosis 


1 .5 


LAK cells IL-2+IL-12 


2.3 Lupus kidney 


0.6 


I \K cells IL-2+IFN 
gamma 


5.5 INCI-H292 none 


2.5 


LAK cells IL-2+IL-18 


5.5 


NCI-H292 IL-4 


1.8 


LAK cells 
PMA/ionomycin 


2.7 


NCI-H292 IL-9 


5.9 


NK Cells IL-2 rest 


6.0 


NCI-H292 IL-13 


2.3 


Two Way MLR 3 day 


2. 1 NCI-H292 IFN gamma 


3.0 


I wo W a\ MLR 5 da} 


0.9 HPAECnone 0.0 


Two Way MLR 7 day 


1 HI PA EC INI alpha + II.- ()() 
|l beta 


PBMC rest 


1 .5 I ung fibroblast none 


0. ' 


PBMC PWM 


5.6 


Lung fibroblast TNF 
alpha + IL-1 beta 


0.0 


PBMC PHA-L 


0.9 


L uiu fibi )Ula,t IL-4 


0.0 


Ramos i B cell) none 


0.0 


1 nn<4 fibroblast 1 1 -* 


0.0 


Ramos (B cell) 
ionomycin 


0.2 Lung fibroblast IL- 13 


0.0 


B lymphocytes PWM 


2 2 Lung fibroblast IFN 
gamma 


0.0 


B lymphocytes CD40L 


3.7 iDermal fibroblast 


0.0 
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and IL-4 


CCD! 070 rest 




EOL-1 dbcAMP 


1.0 


(Dermal fibroblast 
\ A l ) I U/U 1 iNr alpna 


5.2 


EOL-1 dbcAMP 
PMA/ionomycin 


0.4 


IDermal fibroblast 
iCCD1070 11,-1 beta 


0.0 


Dendritic cells none 


0.2 


jDermal fibroblast IFN 
fgamma 


0.1 


Dendritic cells IPS 


0.0 


jDermal fibroblast IL-4 


0.0 


Dendritic cells anti- 
CD40 


0.0 


|lBD Colitis 2 


0 4 

0.4 


Monocytes resl 


0.3 


JlBD Crohn's 


8.4 


Monocles LPS 


0.2 


Colon 


100.0 


Macrophages rest 


0.7 


(Lung 


0.9 


MauopkuK , LPS 


0.0 


iThymus 


8.1 


HUVECnone 


0.0 


(Kidney 


6.8 


HUVEC starved 


0.0 j 


Zl 



Panel 1.3D Summary: Ag3372 Highest expression of the CG58553-01 gene is seen in the 
small intestine sample (CT=26.8). This gene encodes a novel vasopressin gene that plays a 
role in regulating electrolyte transport in the colon. Therefore, regulation of the transcript 
or the protein it encodes could be important in maintaining normal cellular homeostasis and 
in the treatment of Crohn's disease and ulcerative colitis. 



Among tissues with metabolic function, this gene is expressed in liver and adipose. 
Thus, this gene product may be involved in disorders that affect these tissues, such as 
obesity and type II diabetes. 

Low, but significant expression is also seen in the hippocampus. The hippocampus 
is critical for learning and memory. Thus, this gene product may have utility treating CNS 
disorders involving memory deficits, including Alzheimer's disease and aging. 

References: 

I . Sato Y, Hanai H, Nogaki A, Hirasawa K, Kaneko E, Hayashi H, Suzuki Y. Role 
of the vasopressin V(l) receptor in regulating the epithelial functions of the guinea pig 
distal colon. Am J Physiol 1999 Oct;277(4 Pt l):G819-28. 

Panel 4D Summary: Ag3372 In agreement with the results seen in panel 1 .4, the highest 
level of expression of this gene is in the colon sample (CT=27.5). Interestingly, the 
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expression is significantly lower in the IBD colitis 2 (CT>35) and IBD Crohn's 
(CT=30.9)samples. Therefore, alterations in the expression of this gene may be used in the 
treatment of Crohn's disease and ulcerative colitis. 

In addition, the expression of the CG58553-01 gene in several preparations of T 
5 lymphocytes suggests that small molecule antagonists, therapeutic antibodies specific for 
this mol ecule, or the extracellular domain of this protein, may be useful to reduce or 
eliminate the symptoms of Crohn's disease, ulcerative colitis, multiple sclerosis, chronic 
obstructive pulmonary disease, asthma, emphysema, rheumatoid arthritis, lupus 
erythematosus, or psoriasis. 

W. CG58626-01: Phospholipase 

Expression of gene CG58626-01 was assessed using the primer-probe set Ag3386, 
described in Table WA. Results of the RTQ-PCR runs are shown in Tables WB, WC and 
WD. 

Table WA . Probe Name Ag3386 



Primers Sequences 

_ 1 


Length 


Start 
Position 


SEQ II) 

NO: 


Forward 


5 1 -agtggcggtcaaaacttactct- 3 1 


22 


1386 


460 


Probe 


TET-5 ' -tggagacactgttgattccattactcctg- 
3 1 -TAMRA 


29 


1411 


461 


Reverse 


5 t g ft 1 3 1 1 a t 1 1 a - 3 ' 


22 


1455 


4U2 



1 5 Table WB . CNS neurodegeneration vl .0 



Tissue Name 


Rel. Exp.(%) Ag3386, 
Run 210154893 


Tissue Name 


Rel. Exp.(%) Ag3386, 
Run 210154893 


AD 1 Hippo 


6.4 


Control (Path) 3 
Temporal ( tx 


4.2 


AD 2 Hippo 


21.5 


Control (Path) 4 
Temporal Ctx 


25.0 


AD 3 Hippo 


5.0 


AD 1 Occipital 
Ctx 


14.4 


AD 4 Hippo 


5.3 


AD 2 Occipital 
Ctx (Missing) 


0.0 


AD 5 hippo 


95.3 


AD 3 Occipital 
Ctx 


3.3 


AD 6 Hippo 


51.1 


AD 4 Occipital 


18.7 
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L_ "i ' ck 




Control 2 Hippo 1 26.4 

1 


AD 5 Occipital 
Ctx 




Control 4 Hippo 


4.5 


AD 6 Occipital 
Ctx 


52.5 


Control (Path) 3 
Hippo 


4.0 


Control 1 Occipital 
Ctx 


2.4 


AD 1 Temporal Ctx 


-. - , Control 2 Occipital 
Ctx 


56.3 


AD 2 Temporal Ctx 


ry. s (Control 3 Occipital 

_ _ |Ctx 


11.0 


AD 3 Temporal Ctx 


3.8 


Control 4 Occipital 
C't\ 


4.3 


AD 4 Temporal Ctx 


18.9 


( ontrol (Path) 1 
Occipital Ctx 


100.0 

} 1 


AD 5 Inf Temporal 
Ctx 


95.9 


Control (Path) 2 
Occipital Ctx 


8.8 


AD 5 SupTemporal 
Ctx 


37.6 


Control (Path) 3 
Occipital Ctx 


1.7 


AD 6 Inf Temporal 
Ctx 


52.5 


Control (Path) 4 
Occipital Ctx 


12.3 


AD 6 Sup Temporal 
Ctx 


,, _ (Control 1 Parietal 
63J Ctx 


5.4 | 


Control 1 Temporal 
Ctx 


4.9 


Control 2 Parietal 
Ctx 


39.5 


Control 2 Temporal 
Ctx 


38.4 


Control 3 Parietal 
Ctx 


11.3 


Control 3 Temporal 
Ctx 


2 2 |Control (Path) 1 
JParietal Ctx 


77.4 1 


Control 4 Temporal 
Ctx 


g Q [Control (Path) 2 
Parietal Ctx 


20, | 


Control (Path) 1 
Temporal Ctx 


,„ Control (Path) 3 
JParietal Ctx 


2.7 | 


Control (Path) 2 
Temporal Ctx 


, 7 f [Control (Path) 4 
JParietal Ctx 


45.4 1 



Table WC . General screeningjpanel v 1 .4 



Tissue Name 


Rel. Exp.(%) 
Ag3386, Run 
217043839 


Tissue Name 


Rel. Exp.(%) 
Ag3386, Run 
217043839 


\dipose 


12.2 


Renal ca. TK-10 


10.7 


Melanoma* 
Hs688(A).T 


26.4 


Bladder 


18.4 


Melanoma* 


30.4 


Gastric ca. (liver met.) 


26.6 
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Hs688(B).T 




NCI-N87 




Melanoma* Ml 4 


13.1 


Gastric ca KATO III 


0.0 


Melanoma* 
LOXIMVI 


22.8 


Colon ca. SW-948 


10.8 


Melanoma* SK- 
MEL-5 


22.7 


Colon ca. SW480 


40.9 


Squamous cell 
carcinoma SCC-4 


1 1.2 


Colon ca.* (SW480 
met) SW620 


20.4 


1 cslis Pool 


47.0 


Colon ca. HT29 


5.2 


Prostate ca.* (bone 
met) PC-3 


80.1 


Colon ca. HCT-116 


— r „„ — „ 

100.0 


Prostate Pool 


7.1 


Colon ca. CaCo-2 


13 8 




3.2 


Colon cancer tissue 




Uterus Po 4 


6.4 


Colon ca. SW1116 


10.2 


Ovarian ca. 
OVCAR-3 


22.8 


Colon ca. Colo-205 


1.8 


Ovarian ca. SK- 
OV-3 


94.0 


Colon ca. SW-48 


2.4 


Ovarian ca. 
OVCAR-4 


4.7 


Colon Pool 


27.7 


OVCAR-5 


- 

29.3 


______________ 

Small Intestine Pool 


14.6 


IGROV-1 


12.7 


Stomach Pool 


12.2 


Ovarian ca. 
OVCAR-8 


11.1 


Bone Marrow Pool 


6.9 


Ovary 


11.6 


Fetal Heart 


8.1 ! 


Breast ca. MCF-7 


36.9 


Heart Pool 


6.3 


Breast ca. MDA- 
MB-231 


39.5 


Lymph Node Pool 


13.9 


Breast ca. BT 549 


28.5 


Fetal Skeletal Muscle 


3.6 


Breast ca '147D 52.9 


*4 cLtal Muscle Poo! 


6.7 


Breast ca. MDA-N 


11.3 


Spleen Pool 


17.1 


Breast Pool 


28.1 


1 'IV 1111. I' 1 


26.1 


Trachea 


11.0 


CNS cancer 

(glk astro) I 87-MG 


33.2 


Lung 


6.0 


CNS cancer 
(glio/astro) U-118-MG 


44.1 


Fetal Lung 


39.2 


CNS cancer 
(neuro;met) SK-N-AS 


44.4 


Lung ca. NCI-N417 


6.3 


CNS cancer (astro) SF- 
539 


10.4 


Lung ca. LX-1 


33.9 


CNS cancer (astro) 
SNB-75 


27.7 
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Lung ca. NCI-H146 


|CNS cancer (glio) 
143 SNB-19 


' ' ~ 

10.2 


Lung ca. SHP-77 


0 CNS cancer (olio) SI - 

73 2 295 

■ 


28.7 


Lung ca. A549 


25.3 (Brain (Amygdala) Pool 


23.2 


Lung ca. NCI-H526 




5 . 8 jBrain (cerebellum) 


19.8 


Lung ca. NCI-H23 


30.1 


Brain (fetal) 


35.6 


Lung ca. NCI-H460 


20.2 


Brain (Hippocampus) 
Pool 


25.2 


Lung ca. HOP-62 


11.9 


(. erebral ( ortex Pool 


39.2 _j 




20.7 


Brain (Substantia 
nigra) Pool 


23.0 


Liver 


0.7 


Brain (Thalamus) Pool 


45.7 


Fetal i i ver 


29.5 


Brain (whole) 


24.0 


Liver ca. HepG2 


ID 1 


spinal ( !ord Pool 


22.5 


Kidney Pool 




21.3 


Adrenal Gland 


8.5 


Fet \\ Kidney 


19.5 


Pitun ir\ eland Po-4 


7.0 


Renal ca. 786-0 


15.9 


Sal i van Gland 


1.9 


Renal ca. A498 




Thyroid (female) 


3.2 


Renal ca. ACHN 


8.0 


Pancreatic ca. 
CAPAN2 


3.7 


Renal ca. UO-31 


12.2 


Pancreas Pool 


18.2 


Table WD. Panel 4D 


Tissue Name 


Rel. Exp.(%) I 

Ag3386, Run j Tissue Name 
165296474 J 


Rel. Exp.(%) 
Ag3386, Run 
165296474 


Secondary Thl act 


30.4 jHUVEC IL-lbeta 


2.0 


Secondares lh2 act 


35 ft 'HI'VECIFN gamma 


3.3 


Secondary Trl act 


27.9 


HUVEC TNF alpha + 
IFN gamma 


3.8 


Secondary Thl rest 


8.9 


HUVEC TNF alpha + 
II I 


3.3 


Secondary Th2 rest 


8.0 


HI \ 1 ( II -1 1 


1.5 


Secondary Trl rest 


11.3 


Lung Microvascular EC 
none 


5.5 


Primary Thl act 


57.4 


Lung Microvascular EC 
TNFalpha + IL-lbeta 


4.8 


Primary Th2 act 


g Microvascular Dermal 
i EC none 


3.7 


Primary Trl act 


. iMicrosvasular Dermal 

EC TNFalpha + IL-lbeta 


33 1 


Primary Thl rest 


54.0 Bronchial epithelium 


5 - 5 . ! 
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jTNFalpha + ILlbeta 




Primary Th2 rest 


! s s Small airway epithelium 
Jnone 


... 

2.1 




Primary Trl rest 


9 . _ Small airway epithelium 
INFalpha + lL-lbeta 


6.1 


CD45RA CD4 
lymphocyte set 


12.4 


Coronery artery SMC rest 


4.6 


CD45RO CD4 
lymphocyte act 


33.9 


Coronery artery SMC 
l'NTalpha v [L-lbeta 


2.4 


CDS IvmpIuKvlc act 


29.3 


A.stroc>tes rest 


3 0 


Secondary CD8 
lymphocyte rest 


26.1 


Astrocytes TNF alpha + 
IL-lbeta 


3.0 


Secondary CD8 
lymphocyte act 


20.7 


KU-812 (Basophil) rest 


12.1 


CD4 lymphocyte none 


1.1 


KU-812 (Basophil) 
PMA/ionomy c i n 


27.7 


2ry Thl/Th2/Trl_anti- 
LsiJyj CM J 1 


12.8 


CCD 1106 
1 z\ itinoc; Li s) none 


2.7 


LAK cells rest 


12.0 


CCD1106 

(Keratinocytes) 

1 NFalpha v IL-lbeta 


0.8 


LAK cells IL-2 


24.3 l.i\er cirrhosis 


0 3 


LAK cells IL-2+IL-12 


28.7 Lupus kidney 


h~r 7 - 


LAK cells IL-2+IFN 
gamma 


42.0 


NCI-H292 none 


™ 

8.1 


LAK cells IL-2+ IL-18 


45.1 


NCI-H292 IL-4 


9.5 


LAK cells 
PMA/ii ii' imycin 


r 

8.8 


NCI-H292 IL-9 


8.5 


jNK Cells IL-2 rest 


21.8 


NCI-H292 IL-13 


4.5 


Two Way MLR 3 da\ 


18.7 


NCI-H292 IFN gamma 


3.6 


Two Way MLR 5 day 


11.0 


HPAEC none 


2.5 


JTwo Way MLR 7 day 


10.9 


HPAEC TNF alpha + IL- 
1 beta 


3.0 


PBMC rest 


4.5 "Lung fibroblast none 


4.6 


PBMC PWM 


^ q |Lung fibroblast TNF 
(alpha + IL-1 beta 


3.3 


PRA/IP pr I a T 
rtSMC rtlA-JL 


17.9 l imy fibroblast 11 -4 


UA 


Ramos (B cell) none 


26.1 Lung fibroblast IL-9 


12.3 


Ramos [B cell) 
ionomycin 


100.0 Lung fibroblast IL- 1 3 


6.7 


B lymphocytes PWM 


ss q Lung fibroblast 1FN 

Igamma 


16.6 


B Knipiocues CD40I 
and IL-4 


4Q „ iDermal fibroblast 
)CCD 1070 rest 


8.2 

1— -- 


EOl -1 dbcAMP 


13.0 (Dermal fibroblast 


37.1 
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I C C D 1 070 TNF alpha J 1 


CHI 1 .^U/^AA/TD 

c^JJL- 1 UDCAIVIr 
PMA/ionomycin 


q - I Dermal fibroblast 
CCD1070IL-1 beta 


4.4 


Dendritic cells none 


g g Dermal fibroblast IFN 
gamma 


6.0 


Dendritic cells LPS 


5.4 Dermal fibroblast 1L-4 


12.1 


Dendritic cells anti- 
CD40 


4.3 


IBD Colitis 2 


0.7 

_ _J 


Monocytes rest 


4.7 


IBD Crohn's 


0.5 


Monocytes LPS 


2.6 IColon 


3.4 




X.X Lung 


4.9 


Macrophages LPS 


2.8 (Thymus 


4.1 


HUVECnone 2.8 IKidney 


13.0 


HUVEC starved f 6.4 j 





CNSneurodegenerationvl.O Summary: Ag3386 This panel confirms the expression of 
this gene at moderate to low levels in the brain in an independent group of individuals. 
However, no differential expression of this gene was detected between Alzheimer's 
diseased postmortem brains and those of non-demented controls in this experiment. Please 
5 see Panel 1 .4 for a discussion of the potential utility of this gene in treatment of central 
nervous system disorders. 

General_screenmg_panel_vl.4 Summary: Ag3386 This gene is moderately expressed in 
most of the samples on this panel. Based on expression in this panel, this gene may be 
involved in gastric, pancreatic, brain, colon, renal, lung, breast, ovarian and prostate cancer 
10 as well as melanomas. Thus, expression of this gene could be used as a diagnostic marker 
for the presence of these cancers. Furthermore, therapeutic inhibition using antibodies or 
small molecule drugs might be of use in the treatment of these cancers. 

This gene product is also expressed in adipose, pancreas, adrenal, thyroid, pituitary, 
skeletal muscle, heart, and liver. This widespread expression in tissues with metabolic 
15 function suggests that this gene product may be important for the pathogenesis, diagnosis, 
and/or treatment of metabolic and endocrine diseases, including obesity and Types 1 and 2 
diabetes. 

In addition, this gene is expressed at moderate levels in the CNS. Therefore, this 
gene may play a role in central nervous system disorders such as Alzheimer's disease, 
20 Parkinson's disease, epilepsy, multiple sclerosis, schizophrenia and depression. 
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Panel 4D Summary: Ag3386 The CG58626-01 transcript is expressed ubiquitously in 
this panel. Highest expression of this transcript is seen in activated Ramos cells and 
activated B cells (CTs=27). The expression of this transcript in activated lymphoid cells 
when compared to non activated cells suggests that the CG5 8626-0 1 gene may be 
important for the diagnosis or pathogenesis of immune mediated diseases. Therefore, 
modulation of the expression and/or activity of this gene product might important for the 
treatment of autoimmune diseases, allergy, and delayed type hypersensitivity. 

X. CG57597-01: Hypothetical protein 

Expression of gene CG57597-01 was assessed using the primer-probe set Ag3293, 
described in Table XA. 

Table X A. Probe Name Ag3293 



Primers 


Sequences 


Length 


Start 
Position 


SEQ ID 
NO: 


Forward 


5 ' -cagaaacctgtgaactctgcat-3 ' 


22 


40 


463 


Probe 


TET-5 ■ -atgcaccaccactcctggctaatttt-3 ' - 

TAMRA 


26 


69 


464 


Reverse 


5 1 -ataaaaggtttgagccggatt-3 ' 


21 


115 





CNSneurodegenerationvl.O Summary: Ag3293 - Expression of this gene is 
low/undetectable (CTs > 35) across all of the samples on this panel (data not shown). 



General_screening_panel_vl.4 Summary: Ag3293 - Expression of this gene is 
low/undetectable (CTs > 35) across all of the samples on this panel (data not shown). 

Panel 4D Summary: Ag3293 - Expression of this gene is low/undetectable (CTs > 35) 
across all of the samples on this panel (data not shown). 

Y. CG57804-01:talin 

Expression of gene CG57804-01 was assessed using the primer-probe set Ag3337, 
described in Table YA. Results of the RTQ-PCR runs are shown in Tables YB, YC and 
YD. 



Table YA . Probe Name Ag3337 
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Primersj Sequences 


Length 


Start j SEQID 
Position NO: 


Forward 


5 1 -ggatttcaagcccagatacaat-3 1 


22 


781 


466 


Probe 


TET-5 1 -tggacctcatgtggaacataaacaca-3 ' - 
TAMRA 


26 


804 


467 


Reverse 


5 1 -ggcaggaattccttcagatc-3 ' 


20 


844 


468 



Table YB . CNS_neurodegeneration_vl.O 



• 

Tissue Name 


Re!. Exp.(%) Ag3337, 

Run 210138775 UssueName 


Run 210138775 


AD 1 Hippo 


6.8 


(Control (Path) 3 
1 emporal Ctx 


3.6 


AD 2 Hippo 


25.3 




Control (Path) 4 
Temporal Ctx 


22.4 


AD 3 Hinno 


3.6 




AD 1 Occipital Ctx 


5 6 


AD 4 Hippo 


5.7 


AD 2 Occipital Ctx 
(Missing) 


0.0 


AD 5 Hippo 


78.5 


AD 3 Occipital Ctx 


- — 2 ~ 2 


AD 6 Hippo 


27.5 


AD 4 Occipital Ctx 


14.7 


Control 2 Hippo 


27.4 


AD 5 Occipital Ctx 


44.1 


Control 4 Hippo 


8.1 


AD 6 Occipital Ctx 


16.6 


Control (Path) 3 
Hippo 


4.3 


Control 1 Occipital 
Ctx 


1.6 


AD 1 Temporal 
Ctx 


7.6 


Control 2 Occipital 
Ctx 


67.8 


AD 2 Temporal 

Ctx _ 


24.5 


jControl 3 Occipital 
( i\ 


11.9 


AD 3 Temporal 
Ctx 


3.3 


jControl 4 Occipital 
Ctx 


3.0 


AD 4 Temporal 
Ctx 


15.3 




Control (Path) 1 
Occipital Ctx 


89.5 


AD 5 In f Temporal 
Ctx 


89.5 


jControl (Path) 2 
lOccipital Ctx 


8.2 


AD 5 Sup 
Temporal Ctx 


35.8 




Control (Path) 3 
Occipital Ctx 




0.6 


AD 6 Inf Temporal 
Ctx 


27.4 




Control (Path) 4 
Occipital Ctx 


10.3 


AD 6 Sup 
Temporal Ctx 


32.8 


Control 1 Parietal 

( IX 


3.6 


Control 1 
Temporal Ctx 


3.4 


Control 2 Parietal 
Ctx 


23.7 


Control 2 
Temporal Ctx 


47.6 




Control 3 Parietal 
Ctx 


14.1 


Control 3 


12.4 


(Control (Path) 1 


100.0 
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[Temporal Ctx 


Parietal C tx 




(Control 3 
[Temporal Ctx 


5.8 


Control (Path) 2 
Parietal Ctx 


21.9 


|ControT(Path) 1 
[Temporal Ctx 


64.2 


Control (Path) 3 
Parietal Ctx 


2.0 


(Control (Path) 2 
(Temporal Ctx 


42.0 


Control (Path) 4 
Parietal Ctx 


39.2 



Table YC . General_screening_panel_vl.4 



r 

1 Tissue Name 




Rel. Exp.(%) 
Ag3337, Run 
215773748 


Tissue Name 


Rel. Exp.(%) 
Ag3337, Run 
215773748 


jAdipose 


20.2 


Renal cii. 1 K-10 


22.1 


Melanoma* 
|Hs688(A).T 


58.6 


Bladder 


14.2 

" — - " 


(Melanoma* 
Hs688(B).T 


22.8 


Gastric ca. (liver met.) 
NCI-N87 


16.2 


Melanoma* M14 


5.7 


Gastric ca. KATO III 


100.0 


(Melanoma* 
1 ( >XIMV1 


5.5 


Colon ca. SW-948 


16.3 


Melanoma* SK- 
?MEL-5 


3.4 


Colon ca. SW480 


4.2 


[Squamous cell 
carcinoma SCC-4 


4.4 


Colon ca.* ISO 
met) SW620 


2.6 


]Testis Pool 


5.1 


Ccicn ca. HT29 


0.7 


Prostate ca.* (bone 
imet) PC -3 


6.4 


Colon ca. HCT-116 


7.6 


prostate Pool 


3.4 


Colon ca. CaCo-2 


81.8 


Placenta 


1.6 


Colon cancer tissue 


1.7 


Uterus Pool 


2.1 


Colon ca. SW1 1 16 


1.6 


lOvarian ca. 
< JVCAR-3 


8.9 


Colon ca. Colo-205 


0.1 


(Ovarian ca. SK- 
lOV-3 


32.1 


Colon ca. SW-48 


3.2 


< >\ ii ian ca. 

OVCAR-4 


7.2 


Colon Pool 


8.0 




lOvarian ca. 
OVCAR-5 


21.0 


Small Intestine Pool 


7.9 


Ovarian ca. 
jlGROV-1 


23.5 


Stomach Pool 




Ovarian ca. 

loycAR-8 


5.4 


Bone Marrow Pool 


3.8 


(Ovary 


117 


Fetal 1 leart 


24.8 


(Breast ca. MCF-7 


?;! 


Heart Pool 


10.2 
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Breast ca. MDA- 
MB-231 


19.5 


Lymph Node Pool 


10.6 


Breast ca. BT 549 


11.7 


1 etal Skeletal Muscle 


30.1 


Breast ca. T47D 


30.8 


Skeletal Muscle Pool 


24.8 


Breast C3. NIDA-N 


5.0 


Spleen Pool 




Breast Pool 


6.9 


Thymus Pool 


0.0 


Trachea 




10.3 


CNS cancer 
(glio/astro) U87-MG 


10.7 


Lung 


2.2 


CNS cancer 
(glio/astro) U-118-MG 


68.3 


Fetal Lung 


10.8 


CNS cancer 
(neuro;met) SK-N-AS 


23.5 


Lung ca.NCI-N417 


0.8 

— 


CNS cancer (astro) SF- 


21.5 


Lung ca . LX-1 

I , _ 


1.7 


. ________ 

CNS cancer (astro) 
SNB-75 


40.3 


Lung ca.NCI-H146 


0.4 


CNS cancer (glio) 
SNB-19 


27.7 


Lung ca. SHP-77 


11.9 


CNS cancer (glio) SF- 
295 


38.2 


Lung ca. A549 


13.6 


Bulla { 4 imgdakn Pool 


28.7 


Lung ca. NCI-H526 


7.6 


Brain (cerebellum) 


38.7 


Lung ca. NCI-H23 


10.2 jBrain (fetal) 


58.6 


Lung ca. NCI-H460 


5.1 


Brain (Hippocampus) 
Pool 


r " 

25.7 


Lung ca. HOP-62 


3.8 


Cerebral Cortex Pool 


59.0 


Lung ca. rsL-1-rljzz 


10.1 


Brain (Substantia 
nigra) Pool 


39.2 


1 iver 


0.3 


Brans ( 1 halamu .1 Pu<>1 


51.4 


Fetal Liver 


15.3 


Brain (whole) 


58.2 


Liver ca. HepG2 




Spinal Coul Pool 


18.6 


Kidney Pool 


18.4 


Adrenal Gland 


11.1 


Fetal Kidney 


114 


Pituitan gland Pool 


-.! 


Renal "Kn-0 


31.0 Salivary Gland 


14.0 


Renal ca. A498 


0.7 ■ Thyroid (female i 


4.9 


Renal ca. ACHN 


_» _ IPancreatic ca. 
203 |CAPAN2 


j 


Renal ca. 1 U >-3 1 


8. 1 Pancreas Pool 


9.4 | 



Table YD . Panel 4D 



Rel. Exp.(%) 
Ag3337, Run 
165725932 



Tissue Name 



Rel. Exp.(%) 
Ag3337, Run 
165725932 
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Secondary Thl act 


0.0 


III VEC IL-1 beta 


0 7 




0.0 


III VEC IFN samma 




Secondary Trl act 


Q 4 IHUVEC TNI- alpha + 
jlFN gamma 


I _ 


Secondary Thl rest 


HUVECTNFalpha + 
|IL4 


0.6 


Secondary Th2 rest 


0.0 IIUVEC IL-1 1 


_. _ 

— 0.3 


Secondary Trl rest 


q 2 Lung Microvascular EC 
jjnone 


2.1 


Primary Thl act 


ft ~ Lung Microvascular EC 
Uo rNFalpha • IL-1 beta 


5.1 


Primary Th2 act 


^ ^ Microvascular 1 )ennal 
i EC none 


16.4 

9.8 


Primary Trl act 


0 f IMicrosvasular Dermal 

'EC TNFalpha + IL-1 beta 


Primary Thl rest 


1.3 


Bronchial epithelium 
TNFalpha + IL1 beta 


1.2 


Primary Th2 rest 


0.6 


Small airway epithelium 
none 


1.3 


Primary Trl rest 


0.3 


Small airway epithelium 
TNFalpha + IL-1 beta 


2.1 


( D45RA CD4 
lympnocyte act 


18.7 


Coronery artery SMC rest 


9.9 


CD45RO CD4 
K ipphi > j\te act 


0.6 


Coronery artery SMC 
TNFalpha + IL-1 beta 


3.5 


CD8 lymphocyte act 


12 


Astrocytes rest 


100.0 


Secondary CD8 
lymphocyte rest 


0.9 


Astrocytes TNFalpha + 
IL-1 beta 


65.5 


Secondary CD8 
l\ mphoc> te act 


0.2 [KU-812 (Basophil) rest 


11.7 


CD4 lymphocyte none 


Q 0 KU-812 (Basophil) 
sPMA/ionomycin 


8.5 


2ry Thl/Th2/Trl_anti- 
CD95 CHI 1 


Q3 SCCD1106 

(Keratin cytc. 1 1 none 


-v. - 


LAK cells rest 


.CCD 1106 
4.0 l(Keratinocytes) 

rNFalpha + IL-1 beta 


2.0 


LAK cells IL-2 


1 .2 (Liver cirrhosis 


3.6 


LAK cells IL-2+IL-12 


0.4 Lupus kidney 


13.6 


LAK cells IL-2+IFN 
gamma 


2.1 'NCI-H292 none 

J 


11.0 


LAK cells IL-2+IL-18 


1.2 JNCI-H292 IL-4 


25.0 


LAK cells 

I'M \ ionnmycin 


2.0 1nCI-H292 IL-9 

^ , L 


15.6 


\K Cells II -2 rest 


0.0 |NCI-H292IL-13 


12.5 



632 



Two Way MLR 3 day 


5.2 NCI-H292 1FN gamma 


4.6 


rwo Waj MLR 5 da) 


2.7 HPAEC none 


1.5 


Two Way MLR 7 day 


R HPAEC TNF alpha - IL- 
• 5 |l beta 


2.5 


PBMC rest 


0.2 Lung fibroblast none 


80.1 


PBMC PWM 


IT una fihrnhlaQt TT\TF 
j ^ iJuUllg lluluuia^L 1 INF 

j alpha + IL-1 beta 


22.7 


PR\1< ' PI 1 \-l 
I I) lvi j. 1 1 / \ - 1 j 


u.j |i_.ung iiuroDiasi ii^-^t 




Ramos (B cell) none 


2.4 Lung fibroblast II 


47.6 


Ramos (B cell) 
ionomycin 


2.1 |Lung fibroblast IL-1 3 


81.8 


B lymphocytes PWM 


Q7 Lung fibi hLtsi [FN 
gamma 


50.7 


B lymphocytes CD40L 

anr\ TT A 


^ , iDermal fibroblast 
ICCD 1070 rest 


94.6 


EOL-1 dbcAMP 


. q jDermal fibroblast .„ ? 

CCD 1070 TNF alpha | 3 ' 


bUL-1 GDCAJVLr 

PMA/ionomycin 


1.2 


Dermal fibroblast 
CCD1070 IL-1 beta 


31.2 


Dendritic cells none 


12.8 


Dermal fibroblast IFN 
gamma 


14.2 


Dendritic cells LPS 


1.3 


Dermal fibroblast IL-4 


C) 5 q 


Dendriti c cells anti- 
CD40 


11.4 


IBD Colitis 2 


1.2 


Monocytes rest 


0.5 


11-51) C rohn's 


9.1 


Monocytes LPS 


1.3 


Colon 


(»0.7 


Macrophages rest 


13.6 1 


Lung 


8.0 i 


M iciop lages 1 PS 


2.8 Thvinus 


39.2 j 


HI VL( none 


1.0 


Kidney 


10.4 


HUVEC starved 


1 f» 







CNS_neurodegeneration_vl.O Summary: Ag3337 - This panel confirms the expression 
of this gene at low to moderate levels in the brain in an independent group of individuals. 
However, no differential expression of this gene was detected between Alzheimer's 
diseased postmortem brains and those of non-demented controls in this experiment. Please 
5 see Panel 1 .4 for a discussion of the potential utility of this gene in treatment of central 
nervous system disorders 

Generall_screening_panel_vl.4 Summary: Ag3337 - This gene is expressed in almost all 

samples on this panel. This gene is expressed at moderate levels in the CNS. Therefore, this 

gene may play a role in central nervous system disorders such as Alzheimer's disease, 

10 Parkinson's disease, epilepsy, multiple sclerosis, schizophrenia and depression. 
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In addition, this gene is also expressed in adipose, pancreas, adrenal, thyroid, 
pituitary, skeletal muscle, heart, and liver. This widespread expression in tissues with 
metabolic function suggests that this gene product may be important for the pathogenesis, 
diagnosis, and/or treatment of metabolic and endocrine diseases, including obesity and 
Types 1 and 2 diabetes. 

Panel 4D Summary: Ag3337 This gene is most highly expressed in resting astrocytes 
(CT=28.9). In addition, this gene is highly expressed in a cluster of treated and untreated 
samples derived from lung and dermal fibroblasts. Thus, therapeutic modulation of the 
expression or function of this gene may be effective in the treatment of pathological and 
inflammatory lung and skin diseases, such as psoriasis, asthma, emphysema, and allergies. 

Z. CG57551-01: NAC-1 Like Gene 

Expression of gene CG57551-01 was assessed using the primer-probe set Ag3282, 
described in Table ZA. Results of the RTQ-PCR runs are shown in Tables ZB, ZC and ZD. 

Table ZA . Probe Name Ag3282 



Primers 


Sequences 


Length 


Start 
Position 


SEQID 
NO: 


FuiW ltd j it ct agctt tg taca-3' 


22 


269 J 469 


Probe 


TET-5 ' -accagttcctgctcatgtacacggct-3 1 - 

TAMRA 


26 ; 


318 


470 


Reverse 


5 ' -atctcctggatctgcaggaa-3 ' 


"!0 


347 


471 



Table ZB . CNSneurodegenerationvl.O 



Tissue Name 


Rel. Exp.(%) Ag3282, 
Run 210060482 


Tissue Name 


Rel. Exp.(%) Ag3282, 
Run 210060482 


AD 1 Hippo 


22.8 


Control (Path) 3 
Temporal Ctx 


9.7 


AD 2 Hippo 


49.0 


Control (Path) 4 
Temporal Ctx 


24.3 


AD 3 Hippo 


1 , , IAD 1 Occipital 

JCtx 


16.5 


AD 4 Hippo 


12.3 


AD 2 Occipital 
Ctx (Missing) 


0.0 


AD 5 hippo 


66.9 


AD 3 Occipital 
Ctx 


10.2 


AD 6 Hippo 


59.9 


AD 4 Occipital 


18.2 
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_ 


Ctx 





Control 2 Hippo 


dQ , IAD 5 Occipital 

jCtx 


9.5 


Control 4 Hippo 


, 0 _ AD 6 Occipital 
18J JCtx 


41.5 


Control (Path) 3 
Hippo 


g 3 jControl 1 Occipital 


6.8 


AD 1 Temporal Ctx 


1 q ? IControl 2 Occipital 
1VZ jCtx 


91.4 


AD 2 Temporal Ctx 


4n ^ Control 3 Occipital 
W ' 3 jCtx 


16.3 


AD 3 Temporal Ctx 


1 4 _ jControl 4 Occipital 

jCtx 


12.2 


AD 4 Temporal Ctx 


183 iControl (Path) 1 
^Occipital Ctx 


100.0 


AD 5 Inf Temporal 
Ctx 


66 Q jControl (Path) 2 
■Occipital Ctx 


9.2 


AD 5 SupTemporal 
Ctx 


4 Control (Path) 3 
jOccipital Ctx 


5.3 


AD 6 Inf Temporal 
Ctx 


36 j iControl (Path) 4 
jOccipital Ctx 


15.8 


AD 6 Sup Temporal 
Ctx 


^ 4 4 jControl 1 Parietal 
Ctx 


11.7 


Control 1 Temporal 
Ctx 


jq q jControl 2 Parietal 

jCtx 


34.2 


Control 2 Temporal 
Ctx 


74 7 jControl 3 Parietal 
JCtx 


23.3 


Control 3 Temporal 
Ctx 


Control (Path) 1 
jParietalCtx 


72.7 


Control 4 Temporal 
Ctx 


15.5 


Control (Path) 2 
Parietal Ctx 


21.6 


Control (Path) 1 
1 emporal Ctx 


74.2 


Control (Path) 3 
Parietal < tx 


5.5 


Control (Path) 2 
Temporal Ctx 


31.2 


Control (Path) 4 
Parietal Ctx 


35.8 



Table ZC . General_screening_panel_vl.4 



Tissue Name 


Rel. Exp.(%) 
Ag3282, Run 
216512995 


Tissue Name 


Rel. Exp.(%) 
Ag3282, Run 
216512995 


Adipose 


1.8 


Renal ca. TK-10 


22.7 


Melanoma* 
Hs688(A).T 


16.3 


Bladder 


6.3 


Melanoma* 


25.0 


Gastric ca i liver met.) 


47.0 
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Hs688(B).T 




1N\^1-1N O / 




jfylelanoima* Nil 4 




UaSLnC Ca. Iv/\.1L/ 111 




Melanoma* 
LOX1MV] 


21.6 

.., 


Colon ca. SW-948 


19.3 


Nlelanoima* SK- 
MEL-5 


17.0 


Colon ca. SW480 


50.3 


Squamous cell 
carcinoma SCC-4 


24.7 


Colon ca.* (SW480 
met) SW620 


25.9 


Testis Pool 


6.1 


Colon ca. HT29 


17 7 


Prostate ca.* ( bone 
met) PC-3 


67.8 


Colon ca. HCT-116 


™ 

100.0 


Prostate Pool 


3.5 


Colon ca. Ca( o-2 




Placenta 




Colon cancer tissue 


14.0 


Uterus Pool 


0.6 


Colon ca. SW1116 


12.7 


Ovarian ca. 
OVCAR-3 


41.2 


Colon ca. Colo-205 


~. 

7.6 


Ovarian ca. SK- 
OV-3 


65.5 


Colon ca. SW-48 




Ovarian ca. 
UVLAK-4 


35.8 

- — — 


Colon Pool 


4.9 1 


Ovarian ca. 
OVCAR-5 


37.6 


Small Intestine Pool 


_____ 


Ovarian ca. 
IGROV-1 


28.9 


Stomach Pool 


3.3 


Ovarian ca. 
OVCAR-8 


14.2 


Bone Marrow Pool 


1.5 


Ovary 


3.9 


Fetal Heart 


3.0 | 


Breast ca. MCF-7 


42.3 


Heart Pool 


2.2 


Breast ca. MDA- 
MB-231 


69.7 


Lymph Node Pool 


5.6 


Breast ca. B 1 M" 


51.4 


Fetal Skeletal Muscle 


1.5 


Breast ca. N 7 I> 


86.5 


Skeletal Muscle Pool 


0.0 


Dreasi ca. iviu/\-in 




Spleen Pool 


ZZ .... 2 : 8 


Breast Pool 

- - — - • 


4.6 


Thymus Pool 


3.8 


Trachea 


8.7 


CNS cancer 

(glio astro) U87-MG 


60.3 


Lung 


0.2 


CNS cancer 
(glic/astro)U-l 18-MG 


100.0 


Fetal Lung 


6.3 


CNS cancer 
(neuro;met; SK-N-AS 


47.3 


Lung ca.NCI-N417 


8.4 


CNS cancer (astro) SF- 
539 


22.8 


Lung ca. LX-1 


17.3 


CNS cancer (astro) 
SNB-75 


47.3 
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Lung ca. NCI-H146 


. - , CNS cancer (glio) 


29.3 


Lung ca. SHP-77 


CNS cancer (^lio) SF 
lt0 ^ 1295 


49.3 


Lung ca. A549 


27.2 Brain ( \m\ gdala) I" >ol 


6.9 


Lung ca. NCI-H526 


6.1 jBrain (cerebellum) 


15.1 


Lung ca. NCI-H23 


25.9 jBrain (fetal) 


9.2 


Lung ca. NCI-H460 


g q jBrain (Hippocampus) 
JPool 


8.9 


Lung ca. HOP-62 


1 1 .9 |Cerebral Cortex Pool 


13.4 


JLUng ca. JNL-l-hOzz 


2 j g Brain (Substantia 
Jnjgra) P°°l 


15.3 


1 iver 


1 .7 jBrain (Thalamus) Pool 


11.5 


Feral Liver 


9.8 Brain (whole) 


12.6 


Liver Cci. i IepCi2 


18.2 iSpinal Cord Pool 


7.1 


kidney Pool 


5.0 


Adrenal Gland 


5.3 


Fetal Kidney 


3.4 


Pituitary gland Pool 


I 9 — 


Renal ca. 786-0 


42.0 


Salivary Gland 


4.0 


Renal ca. A498 


16.7 


Thyroid (female) 


3.6 


Renal ca. ACHN 


13.9 


Pancreatic ca. 
CAPAN2 


15.5 


Renal ca. UG-3 1 


17.4 


Pancreas Pool 


6.6 



Table ZD . Panel 4D 



Tissue Name 


Rel. Exp.(%) 
Ag3282, Run 
164634321 


Tissue Name 


Rel. Exp.(%) 
Ag3282, Run 
104034321 


Secondary Thl act 


52.9 


HUVEC IL-lbeta 


13.6 


Secondary Th2 act 


67.8 


HUVEC IFN gamma 


42.9 


Secondary Trl act 


75.3 


HUVEC TNF alpha + 
IFN gamma 


37.1 


Secondary Thl rest 


8.4 


HUVEC TNF alpha + 
II - 


42.6 


Secondary Th2 rest 


11.4 


HUVEC IL-1 1 


25.9 


Secondary Trl rest 


12.2 


Lung Microvascular EC 
none 


41.2 


Primary Thl act 


53.6 




Lung Microvascular EC 
rNFalpha ■ IL-lbeta 


36.3 


Primary Th2 act 


44.4 


Microvascular Dermal 
EC none 


50.3 


Primary Trl act 


, n 7 jMicrosvasular Dermal 

EC TNFalpha + IL-1 beta 


33.0 


Primary Thl rest 


37.6 


Bronchial epithelium 


51.4 
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! 


TNFalpha + ILlbeta 


Primary Th2 rest 


15.8 


Small airway epithelium 
none 


23.3 


Primary Trl rest 


18.3 


Small airway epithelium 
TNFalpha + IL-1 beta 


71.7 


CD45RA CD4 
lymphocyte act 


33.0 jCoronery artery SMC rest 


43.5 


CD45RO CD4 
lymphocyte act 


s4 „ Coronery artery SMC 
rNFalpha + IL-lheta 


31.0 


CDS U mphocyte not 


42.9 Astrocytes rest 


38.4 


Secondary CD8 
lymphocyte rest 


co lAstrocytes TNFalpha + 
• U,J _ JL^lbeta _ 


37.1 


Secondary CD8 
lymphocyte act 


32.5 KU-812 (Basophil) rest 


1 


CD4 lymphocyte none 


24 KU-812 (Basophil) 
PMA/ionomycin 


90.8 


2ryThl/Th2/Trl_anti- 

UJJyD LH 1 1 


11.7 


CCD1106 

(Keratinocytes) none 


64.2 


LAK cells rest 


18.9 


CCD1106 
(Keratinocytes) 
TNFalpha + IL-1 beta 


34.4 


LAK cells IL-2 


41.2 Liver cirrhosis 


2.9 


LAK cells [L-2 1L-12 


29.5 iLupus kidney 


2.2 


LAK cells IL-2+IFN 
gamma 


36.3 


NCI-H292 none 


38.4 


LAK cells IL-2+IL-18 


34.2 


NCI-H292 IL-4 


— 66 9 


LAK cells 
PMA/ionomycin 


11.8 


NCI-H292 IL-9 


62.4 


NK Cells IL-2 rest 


29.3 


NCI-H292IL-13 


65.1 


Two Way MLR 3 da\ 


21.9 


NCI-H292 IFN gamma 




Two Way MLR 5 day 


27.7 


HPAEC none 


31.2 


Two Way MLR 7 day 


27.0 


HPAEC TNFalpha + IL- 
1 beta 


37.6 


PBMC rest 


6.5 


Lung tlbiohiaq none 


35.6 


PBMC PWM 


89.5 


Lung fibroblast TNF 
alpha + IL-1 beta 


20.7 


pniuf pxj a T 


536 


Lung fibroblast IL-4 


63.3 


Ramos (B cell) none 


40.6 


Lung fibroblast IL-9 


55.5 


Ramos (B cell) 
ionomycin 


56.3 


Lung fibroblast IL-1 3 


44.8 


B lymphocytes PWM 


100 0 p un g fibroblast IFN 
Igamma 


71.2 

1 


B lymphocytes CD40L 
and IL-4 


41.2 


Dermal fibroblast 
v i. Dl 070 rest 


78.5 


EOL-1 dbcAMP 


50.0 


Dermal fibroblast 


KX.w 
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CCD 1070 TNF alpha 




EOL-1 dbcAMP 
PMA/ionomycin 


46.3 


jDermal fibroblast 
CCD 1070 IL-1 beta 


49.7 


Dendritic cells none 


33.2 


jDermal fibroblast IFN 
igamma 


21.5 


iDendritic cells LPS 


26.1 


— 

JDermal fibroblast IL-4 


43.8 


Dendritic cells anti- 
CD40 


29.9 


i 

IBD Colitis 2 

11 J 


1.2 


Monocytes rest 


17.1 


JlBD Crohn's 


1.8 


Monocytes LPS 


14.0 


Colon 


15.4 


Macrophages rest 


59.0 


(Lung 


16.6 


Macrophages LPS 


29. 1 


iThymus 


15.6 


HUVEC none 


35.1 


jKidney 


18.9 


HUVEC starved 


62.0 





CNS_neurodegeneration_vl.O Summary: Ag3282 - This panel confirms the expression 
of this gene at low levels in the brain in an independent group of individuals. However, no 
differential expression of this gene was detected between Alzheimer's diseased postmortem 
brains and those of non-demented controls in this experiment. Please see Panel 1.4 for a 
5 discussion of the potential utility of this gene in treatment of central nervous system 
disorders. 

General_screening_panel_vl.4 Summary: Ag3282 Highest expression of this gene is 
seen in a brain cancer cell line (CT=24.3). This gene appears to be expressed more highly 
in the cancer cell lines than in the normal tissue samples on this panel and may be involved 
10 in cellular growth and proliferation. Based on this expression profile, this gene may be 

involved in gastric, pancreatic, brain, colon, renal, lung, breast, ovarian and prostate cancer 
as well as melanomas. Thus, expression of this gene could be used as a diagnostic marker 
for the presence of these cancers. Furthermore, therapeutic inhibition using antibodies or 
small molecule drugs might be of use in the treatment of these cancers. 

15 This gene is also expressed at high levels in all regions of the CNS examined. 

Therefore, this gene may play a role in central nervous system disorders such as 
Alzheimer's disease, Parkinson's disease, epilepsy, multiple sclerosis, schizophrenia and 
depression. 

In addition, this gene product is expressed in adipose, pancreas, adrenal, thyroid, 
20 pituitary, fetal skeletal muscle, heart, and liver. This widespread expression in tissues with 
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metabol ic function suggests that this gene product may be important for the pathogenesis, 
diagnosis, and/or treatment of metabolic and endocrine diseases, including obesity and 
Types 1 and 2 diabetes. 

Furthermore, this gene is more highly expressed in fetal skeletal muscle (CT=30.4) 
5 and liver (CT=27) when compared to expression in the adult skeletal muscle (CT>35) and 
liver (CT=30) may be useful for the differentiation of the fetal and adult sources of this 
tissue. 

Panel 4D Summary: Ag3282 This gene is expressed at high to moderate levels in a wide 
range of cell types of significance in the immune response in health and disease. Highest 
10 expression is seen in polkweed mitogen stimulated B lymphocytes (CT=25.7). In addition, 
expression is seen in members of the T-cell, B-cell, endothelial cell, 

macrophage/monocyte, and peripheral blood mononuclear cell family, as well as epithelial 
and fibroblast cell types from lung and skin, and normal tissues represented by colon, lung, 
thymus and kidney. This ubiquitous pattern of expression suggests that this gene product 

15 may be involved in homeostatic processes for these and other cell types and tissues. This 
pattern is in agreement with the expression profile in Panel 1 .4 and also suggests a role for 
the gene product in cell survival and proliferation. Therefore, modulation of the gene 
product with a functional therapeutic may lead to the alteration of functions associated with 
these cell types and lead to improvement of the symptoms of patients suffering from 

20 autoimmune and inflammatory diseases such as asthma, allergies, inflammatory bowel 
disease, lupus erythematosus, psoriasis, rheumatoid arthritis, and osteoarthritis. 



AA. CG57411-01: KELCH-LIKE PROTEIN KLHL3C 

Expression of gene CG5741 1-01 was assessed using the primer-probe set Ag3229, 
25 described in Table AAA. Results of the RTQ-PCR runs are shown in Tables AAB, AAC, 
AAD and AAE. 



Table AAA . Probe Name Ag3229 



Primers 


Sequences 


Length 


Start 
Position 


SEQ ID 

NO: 


Forwarc 5 1 -gcagcgagctctaccacat-3 1 


19 


287 


472 
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Probe 


TET-5 ' -aaggccttcgcgctgcagatctt- 3 1 - 
TAMRA 


23 


310 | 473 


Reverse 


5 1 -aagtcgtccttggagatgct-3 ' 


20 


364 j 474 



Table AAB . CNSneurodegenerationvl.O 



Tissue Name 


Rel. Exp.(%) Ag3229, I " " 
Run 209862301 f Aissuemme 


Ret. E\p.(%).V C 3229, 
Run 209862301 


AD 1 Hippo 


16 3 TControl (Path) 3 
Temporal ( tx 


_________ 

8.0 


AD 2 Hippo 


34.6 


Control (Path) 4 
Temporal Ctx 


35.8 


AD 3 1 lippo 


15.9 


AD 1 Occipital Ctx 


18.6 


AD 4 Hippo 


6 9 |AD 2 Occipital Ctx 
j(Missing) 


0.0 


AD 5 Hippo 


100.0 AD 3 Occipital Ctx 


1 1.7 


AD 6 Hippo 


35.4 lAD 4 Occipital Ctx 


17.7 


Control 2 ( lippo 


31.2 |AD 5 Occipital Ctx 


49.7 


Control 4 Hippo 


12.1 


AD 6 Occipital Ctx 




Control (Path ) 3 
I lippo 


6.2 


Control 1 Occipital 

( X 


3.3 


AD 1 Temporal 
Ctx 


21.6 


Control 2 Occipital 
Ctx 


69.3 


AD 2 Temporal 
Ctx 


33.0 


Control 3 Occipital 
Ctx 


26.6 


AD 3 Temporal 
Ctx 


14.1 


Control 4 Occipital 
Ctx 


7.5 


AD 4 Temporal 
Ctx 


16.8 


Control (Path) 1 
Oct ipital Ctx 


72.2 


AD 5 Inf Temporal 
Ctx 


71.7 


Control (Path) 2 
Occipital Ctx 


13.7 


AD 5 Sup 
Temporal Ctx 


32.3 

... . 


( ontrol (Path) 3 
Occipital Ctx 


6.3 


AD 6 Inf Temporal 
Ctx 


30.6 


Control (Path) 4 
Occipital Ctx 


16.8 


AD 6 Sup 
Temporal Ctx 


33.9 


C ontrol 1 Parietal 
Ctx 


8.6 


Control 1 
Temporal Ctx 


4.4 


Control 2 Parietal 
Ctx 


39.8 


Control 2 
Temporal Ctx 


56.6 


Control 3 Parietal 
Ctx 


21.5 


Control 3 
Temporal Ctx 


19.6 

- 


Control (Path) 1 
Parietal Ctx 


66.4 


Control 3 
Temporal Ctx 


14.2 


Control (Path) 2 
Parietal Clx 


26.8 
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Control (Path) 1 
1 unpoial Ctx 


62.0 


Control (Path) 3 
Parietal Ctx 


5.2 


Control (Path) 2 
Temporal Ctx 


36.1 


Control (Path) 4 
Parietal Clx 


54.3 



Table A AC . General_screening_panel_vl.4 



1 T- K 

1 Tissue Name 


Rel. Expl(%) 

Ag3229, Run Tissue Name 
214439727 


Rel. F.\p.(%> 
Ag3229, Run 
21443«)727 


Adipose 


6.0 Renal ca. TK- 10 


20.4 


Melanoma* 

I lM)oo(,'A 1. 1. 


8.1 Bladder 

I 


6.7 


Melanoma* 
Hs&88(3).T 


. , . |Gastric ca. (liver met.) 
NCI-N87 


- — - 

11.2 


jMelanoma* M14 
Pvk I moina* 
1 OXI.VVI 


2.1 Gastric ca. KATO 111 


59.5 


1 

24.8 Colon ca. SW-948 

1 


0.6 


(Melanoma* SK- 


20.7 (Colon ca. SW480 


31.6 


j Squamous cell 
(carcinoma SCC-4 


6 7 Colon ca.* (SW480 
jmet)SW620 


4.7 


Testis Fool 


3.0 Colon ca. HT29 


2.7 


(Prostate ca.* (bone 
met) PC-3 
Prostate Pool 


17.8 Colon ca. HCT-116 
8.5 Colon ca. CaCo-2 


35.1 

iT" 


i Placenta 


1 4.2 jColon cancer tissue 


6.2 


Uterus Pool 


5.8 Colon ca. SWI 1 16 


3.3 


(Ovarian ca. 
'■ )\ CAR-3 
lOvarian ca. SK- 
iOV-3 

lOvarian ca. 
()\ C MM 


40.9 
17.7 
11.9 


Colon ca. Colo-205 
Colon ca. SW-48 
Colon Pool 


5.4 
3.3 
25.0 


Ovarian ca. 
OV< \R-5 


84.1 


Small Intestine Pool 


14.9 


[Ovarian ca. 
1GROV-1 


2.0 


Stomach Pool 


6.4 


lOvarian ca. 
lOvary 


8.1 
8.7 


— — — — 
Bone Marrow Pool 

Fetal Heart 


9.7 I 

*~~ T/T' 1 


Br.. ist ca Mi F- 7 


0.9 


Heart Pool 


10.7 i 


Breast ca. MDA- 
jMB-231 


30.1 


Lymph Node Pool 


21.8 


Breast ca. BT 549 


8.1 


Fetal Skeletal Muscle 


4.2 
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Breast ca. T47D 


100.0 Ske letal Muscle Pool : 8.7 


Breast ca N4DA-N 


0.0 Spleen Pool 10.4 


Breast 1*001 


22.4 (Thymus Pool 


11.2 


Trachea. 


1 n . JCNS cancer 

l(glio/astro) U87-MG 


55.5 


Lung 


1.7 


CNS cancer 
(glio/astro)U-118-MG 


44.8 


Fetal Lung 


6.7 


CNS cancer 
(neuro;met) SK-N-AS 


5.8 


Lung ca. NCI-N417 


11.7 


l \ T S cancel I islro) SF- 
539 


0.4 


Lung ca. LX-1 


37.1 


CNS cancer (astro) 
SNB-75 


5.0 


Lung ca. NCI-H146 


6.2 


CNS cancer (glio) 
SNB-19 


2.9 


Lung ca. SHP-77 


61.1 


CNS cancer (glio) SF- 
295 


39.0 


Lung ca. A549 


6.3 f Brain (Amygdala) Pool 




Lung ca. NCI-I 1526 


8.7 {Brain (cerebellum) 


22 2 


Lung ca. NCI-H23 


6.3 [Brain (fetal) 




Lung ca. NCI-H460 


7 q Br.im (Hipp (campus) 

Pool 


8.5 


Lung ca. HOP-62 


8.1 jCerebral Cortex Pool 


20.7 


Lung ca. NCI-H522 


0.5 


Brain (Substantia 
nigra) Pool 


14.7 


Liver 


0.3 


Brain (Thalamus) Pool 




13.8 


Fetal Liver 


1.8 


Brain (whole) 


11.5 


Liver ca. HepG2 


0.2 


Spinal Cord Pool 


zzzin :::: 


Kidney Pool 


26.8 


Adrenal Gland 


29.9 


Fetal Kidney 


10.2 


Pituitary gland Pool 


10.7 


Renal ca. 786-0 


7.5 Salivary Gland 


4.1 


Renal ca. A498 


4.0 Thyroid (female) 


- - M - — ' 


Renal ca. ACHN 


. . Q 1 Pancreatic ca. 
|CAPAN2 


1.0 


Renal ca. UO-31 15.3 (Pancreas Pool 


28.7 I 



Table A AD . Panel 2.2 



Tissue Name 


Rel. Exp.(%) I 

Ag3229, Run Tissue Name 
174442765 


Rel. Exp.(%) 
Ag3229, Run 
174442765 


Normal Colon 


. . , (Kidney Margin 
(01)043 18) 


100.0 


Colon cancer 


31.9 Kidiv\ malignant 


10.7 
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( UUUt". 04 1 


cancel i ODuoZO-1 1> ) 




Colon Margin 
(OD06064) 


Kidney normal 
20.6 adjacent tissue 
((OD06204E) 


1 1.6 


Colon cancer 
(OD06159) 


, j-. Kidney Cancer 
(OD04450-01 i 


38.4 


Colon Margin 
(OD06159) 


1 ? _ (Kidney Margin 
j(OD04450-03) 


17.4 


Colon cancer 
(OD06297-04) 


, 7 Kidney Cancer 
[8120613 


0.0 


Colon Margin 
(OD06297-05) 


99 . Kidney Margin 
18120614 


6.0 


CC Gr.2 ascend colon 
(OD03921) 


y. <- (Kidney Cancer 
|9010320 


12.0 


CC Margin (OD03921) 


, n s iKidney Margin 
(9010321 


9.9 


Colon cancer metastasis 
(OD06104) 


8.6 


Kidney Cancer 
8120607 


47.3 


Lung Margin 
(OD06104) 


6.2 


Kidney Margin 
8120608 


5.6 


Colon mets to lung 
(OD04451-01) 


31.0 


Normal Uterus 


48.3 


Lung Margin 
(OD0445 1 -02) 


39.5 


Uterine Cancer 064011 


14.9 


Normal Prostate 


41 .2 Normal Thvroid 


2.6 


Prostate Cancer 
(OD04410) 


„ - jThyroid Cancer 
064010 


4.3 


Prostate Margin 
(OD04410) 


, A r Thyroid Cancer 
106 (A302152 


15.3 


Normal Ovary 


~~ 9 Thyroid Margin 
Z5 - Z |A302153 


2.7 


Ovarian cancer 
(OD06283-03) 


7.2 'Normal Breast 


46.0 


Ovarian Margin 
(OD06283-07) 


1 _ „ jBreast Cancer 
(OD04566) 


5.9 


Ovarian C'ancet i!f;4tMs8 


22 .2 ] Breast Cancer 1 024 


27.4 


Ovarian cancer 

(OIJUo 140 ) 


„ „ jBreast Cancer 
fOD04590-01) 


19.5 


Ovarian Margin 
(OD06145) 


1 ~ . Breast Cancer Mets 
((OD04590-03) 


13.5 | 


Ovarian cancer 
(OD06455-03) 


6.3 


Breast Cancer 
Metastasis (OD04655- 
05) 


12.2 


Ovarian Margin 
(OD06455-07) 


12.9 


Breast Cancer 064006 


8.1 
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Normal Lung 


14 5 Bi east Cancer 9100266 


3.0 


Invasive poor diff. lung 
adeno (ODO4945-01 


5.0 


Breast Margin 
9100265 


3.4 


Lung Margin 
(ODO4945-03) 


37.4 


Breast Cancer 
A209073 


11.2 


Lung Malignant Cancer 
(OD03126) 


9.6 


Breast Margin 
A2090734 


61.1 


Lung Margin 
(OD03126) 


14.2 


Breast cancer 
(OD06083) 


4.7 


Lung Cancer 
(OD05014A) 


4.9 


Breast cancer node 
metastasis (OD06083) 


12.7 


Lung Margin 
(OD05014B) 


39.0 


Normal Liver 


1 

2.8 


Lung cancer 
(OD06081) 


17.4 


Liver Cancer 1 026 


13.6 


Lung Margin 
(OD06081) 


32.3 


Liver Cancer 1 025 


12.9 


Lung Cancer 
(OD04237-01) 


4.2 


Liver Cancer 6004-T 


13.2 


Lung Margin 

l /Ynft4.9 37 (Y~>\ 


24.7 


Liver Tissue 6004-N 


1.3 


Ocular Melanoma 
[Metastasis 


12.9 


Liver Cancer 6005-T 


43.2 


|Ocular ivlelanoma 
Margin (Liver) 


10.7 


Liver Tissue 6005-N 


4.8 




52.9 


Liver Cancer 064003 


39.5 


Melanoma Margin 
CI \ma\ 


21.0 


Normal Bladder 


9.3 


Normal Kidney 


4.7 


Bladder Cancer 1023 


5.8 


Kidney Ca, Nuclear 
grade 2 (OD04338) 


40.3 


Bladder Cancer 
A302173 


4.2 


Kidney Margin 
(OD04338) 


7.5 


Normal Stomach 


31.4 


Kidney Ca Nuclear 
grade 1/2 (OD04339) 


82.4 


Gastric Cancer 
9060397 


1.2 


Kidney Msrgin 
(OD04339) ~ 


, - 9 jStomach Margin ! 
J9060396 


7.1 


Kidney Ca, Clear cell 
type (OD04340) 


8.3 


Gastric Cancer 
oor>ov>5 


7.4 


Kidney Margin 
(OD04340) 


24.7 


Stomach Margin 
9060394 


10.9 


Kidney Ca, Nuclear 
grade 3 (OD04348) 


13.1 


Gastric Cancer 064005 \ 


10.4 



Table AAE . Panel 4D 
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Tissue Name 


Rel. Exp.(%) 

Ag3229, Run I Tissue Name 
l & 64389704 


Rel. Exp.(%) 
164389704 


Secondary 1 h 1 act 


3.4 HUVEC IL-lbeta 


20.0 


Secondary Th2 act 


IS III VEC IFN gamma 




Secondary Trl act 


2 j HUVEC I NF alpha - 
IFN gain ma 


26.6 


Secondary Thl rest 


HUVEC TNF alpha + 
1Z |IL4 


35.1 


Secondary Th2 rest 


2.0 


HUVEC IL-11 


17.6 


Secondary Trl rest 


4.5 


Lung Microvascular EC 
none 


34.2 


Primary Thl act 


17.7 


Lung Microvascular EC 
TNFalpha + IL-lbeta 


49.0 


Primary Th2 act 


. _ Microvascular Dermal 1 , _ , 
5.3 | 30.6 
bC. none 1 


Primary Trl act 


7 - Q jMicrosvasular Dermal 1 _ 
EC TNFalpha + IL-lbeta « 


Primary Thl rest 


1 . ft jBronchial epithelium 
jTNFalpha + ILlbeta 


46.7 


Primary Th2 rest 


r> - Small airway epithelium 
[none 


22.1 


Primary Trl rest 


„ 1 ISmall airway epithelium 
TNFalpha + IL-lbeta 


97.9 


CD45RA CD4 


12.7 ! Coronery artery SMC rest 


31.2 


CD45RO CD4 
lymphocyte stct 


r ( | Coronery artery SMC 
1 NFalpha i IL-lbeta 


10.5 


CD8 lymphocyte act 


2.4 Astrocytes rest 


7.5 


Secondary CD8 
lymphocyte rest 


, 9 Wiocytes TNFalpha + 
|lL-lbeta 


8.6 


Secondary CD8 
lymphocyte act 


0.0 lKU-812 (Basophil) rest 

... — ~,. — I. „ - ...J 


0.8 


CD4 lymphocyte none 

- ^ - ■ 


56 JKU-8 12 (Basophil) 
PM \ ionoinycin 


2.9 | 


2ry Thl/Th2/Trl anti- 
CD95 CHI 1 


3y 1CCD1106 

(Keratinocyles) none 


6.2 


LAK cells rest 


CCD] 106 
5.9 l(Keratinocytes) 

TNFalpha + IL-lbeta 


6.0 1 


LAK cells IL-2 


3.0 iLiver cirrhosis 15.5 


LAK cells IL-2+IL-12 


6.2 


Lnp kidney 


12.2 


LAK cells IL-2+IFN 
gamma 


10.7 


NCI-H292 none 


30.8 


LAK cells IL-2+IL-18 


5.0 


NCI-H292 IL-4 


49.7 
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LAK cells 
PMA/ionomycin 


4.0 


JNCI-H292 IL-9 


43.5 


INK Cells IL-2 rest 


1.9 





31.6 


Two Way MLR 3 day 


9.0 


INCI-H292 I FN gamma 


17.7 


Two Way MLR 5 day 


3.3 


IHPAEC none 


18.0 


Two Way MLR 7 day 


1.2 


HPAEC TNF alpha + 1L- 
jl beta 


58.2 


PBMC rest 


0.8 


jLung fibroblast none 


40.6 


PBMC PWM 


10.7 


Lung fibroblast TNF 
[alpha + IL-1 beta 


11.0 


1PBMC PHA-L 


10.2 


^Lung fibroblast IL-4 




!' n - >>^Jl)none 


0.0 


Lung fibroblast IL-9 


55.1 


Ramos (B cell) 
iionomycin 

- 


0.0 




|Lung fibroblast IL-1 3 


78.5 


B lymphocytes PWM 


23.3 


Lung fibroblast IFN 
gamma 


82.4 


B lymphocytes CD40L 

anH TT -4 


18.6 


(Dermal fibroblast 
i\^\^Lj iu /v resi 


45.4 


EOL-1 dbcAMP 


1.8 


IDermal fibroblast 

IPPFil 07f> TMF nlt-iha 
|V^i^jl^iu/u 1 iNr aipna 


36.3 


EOL-I dbcAMP 
PMA/ionomycin 


2.0 


IDermal fibroblast 
CCD Id 70 IL-1 beta 


23.8 


Dendritic cells none 


5.9 


Dermal fibroblast IFN 
5gamma 


4.6 


Dendritic cells LPS 


8.0 


Derma br >bl >t IL-4 


16.6 


Dendritic cells anti- 
:CD40 


3.3 


IIBD Colitis 2 

: 


6.2 


Monocytes rest 


4.2 


IlBD Crohn's 


3.6 i 


Monocytes LPS 


0.9 


Colon 


30.1 


Macrophages rest 


3.1 


Lung 


19.9 


Macrophages I PS 


1 4" 


Thymus 


20.6 


1ILYEC none 


^ 1 


i Kidney 


20 «> 


HI VEC starved 


- _i5^__L_ ___ i 





CNS_neurodegeneration_vl.O Summary: Ag3229 - This panel confirms the expression 
of this gene at low levels in the brain in an independent group of individuals. However, no 
differential expression of this gene was detected between Alzheimer's diseased postmortem 
brains and those of non-demented controls in this experiment. Please see Panel 1.4 for a 
discussion of the potential utility of this gene in treatment of central nervous system 
disorders. 
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General_screening_panel_vl.4 Summary: Ag3229 - Highest levels of expression of this 
gene are seen in breast cancer cell line T47D (CT=28.5). Based on expression in this panel, 
this gene may be involved in gastric, brain, colon, renal, lung, breast, ovarian and prostate 
cancer as well as melanomas. Thus, expression of this gene could be used as a diagnostic 
marker for the presence of these cancers. Furthermore, therapeutic inhibition using 
antibodies or small molecule drugs might be of use in the treatment of these cancers. 

This gene product is also expressed in adipose, pancreas, adrenal, thyroid, pituitary, 
skeletal muscle, and heart. This widespread expression in tissues with metabolic function 
suggests that this gene product may be important for the pathogenesis, diagnosis, and/or 
treatment of metabolic and endocrine diseases, including obesity and Types 1 and 2 
diabetes. 

In addition, this gene is expressed at low to moderate levels in all regions of the 
CNS examined. Therefore, this gene may play a role in central nervous system disorders 
such as Alzheimer's disease, Parkinson's disease, epilepsy, multiple sclerosis, schizophrenia 
and depression. 

Panel 2.2 Summary: Ag3229 Highest expression of the CG5741 1-01 gene is seen in the 
kidney (CT=32.2). In addition, significant levels of expression are seen in samples derived 
from normal lung and breast. Expression in these normal tissues is also higher than in the 
corresponding malignant tissue. Thus, expression of this gene could be used to differentiate 
between these samples and other samples on this panel and as a marker to detect the 
presence of lung, breast and kidney cancer. Furthermore, therapeutic modulation of the 
expression or function of this gene may be effective in the treatment of lung, breast and 
kidney cancer. 

Panel 4D Summary: Ag3229 Highest expression of the CG5741 1-01 gene is seen in IL-4 
treated lung fibroblasts (CT=31.3). Significant levels of expression are seen in activated- 
NCI-H292 mucoepidermoid cells as well as untreated NCI-H292 cells. Moderate 
expression is also detected in IL-9, IL-13 and IFN gamma activated lung fibroblasts, 
human pulmonary aortic endothelial cells (treated and untreated), small airway epithelium 
(treated and untreated), treated bronchial epithelium and lung microvascular endothelial 
cells (treated and untreated). The expression of this gene in cells derived from or within the 
lung suggests that this gene may be involved in normal conditions as well as pathological 
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and inflammatory lung disorders that include chronic obstructive pulmonary disease, 
asthma, allergy and emphysema. Moderate/low expression of this gene is also detected in 
treated and untreated HUVECs (endothelial cells) and coronary artery smooth muscle cells 
(treated and untreated) and normal tissues that include lung, colon, thymus and kidney. 
5 Expression in the various immune cell types and tissue samples suggests that therapeutic 
modulation of this gene product may ameliorate symptoms associated with infectious 
conditions as well as inflammatory and autoimmune disorders that include psoriasis, 
allergy, asthma, inflammatory bowel disease, rheumatoid arthritis and osteoarthritis. 

AB. CGJ57399-01 and CG57399-03: PHOSPHOLIPASE ADRAB-B 
10 PRECURSOR 

Expression of gene CG57399-01 and variant CG57399-03 was assessed using the 
primer-probe sets Ag3952 and Ag3226, described in Tables ABA and ABB. Results of the 
RTQ-PCR runs are shown in Tables ABC and ABD. 

1 5 Table ABA . Probe Name Ag3952 



Primers 


Sequences 


(Length 


Start 
i Position 


SEQID 

NO: 


Forward 


5 ' -ctgtgtccctgtgtcctgaa-3 • 


20 


In? 5 


47^ 


Probe 


TET-5 1 -tcaacagaacttgctaccctcatcga-3 1 -TAME 


-a 26 


| 1666 


176 


Reverse 


5 ' -gtgggtcttctcctgaaacttc-3 1 




1701 


177 


Table ABB. Probe Name As3226 


Primers 


Sequences 


Length 


Start 
Position 


SEQ ID 

NO: 


Forward|5 1 -gatgatcctcaggtcactgtgt-3 1 


22 


1617 


478 


Probe 


TET-5 ' -ccctgtgtcctgaagtttgatgataactca-3 ' - 
TAMRA 


1 30 


1639 | 479 


Reverse 


5 1 -tcgatgagggtagcaagttct-3 1 




1671 


480 



Table ABC . General_screening_panel_vl.4 



1 

| Tissue Name 


Rel. Exp.(%) 
Ag3952, Run 
213856126 


Tissue Name 1 


Rel. Exp.(%) 
Ag3952, Run 
213856126 


Adipose 


9.0 


Renal ca lk-1'- 


15.0 


i Melanoma* 
;Hs688(A).T 


3.0 


— — — — — — |- 

Bladder 


22.7 


Melanoma* 
Hs688(B).T 


3.4 


Gastric ca. (liver met.) i 
NCI-N87 J 


13.0 



o 
m 



Melanoima* Ml 4 


0.9 


Gastric ca. KATO III 


75.3 


Melanoma* 
LOXIMVI 


11.7 


Colon ca. SW-948 


4.3 


Melanoma* SK- 
MEL-5 


1.5 


Colon ca. SW480 


97.3 


Squamous cell 
carcinoma SCC-4 


8.7 


Colon ca.* (SW480 
met) SW620 


4.4 


Testis Pool 


12.8 


Colon ca HT29 


0.4 


Prostate ca.* (bone 
met) PC-3 


10.5 


Colon ca. HCT-116 


1.2 


Prostate Pool 


12.9 


Colon ca. CaCo-2 


60.7 


Placenta 


5 j 


Colon cancer tissue 




Uterus Pool 


6.5 


Colon ca. SW1116 


0.0 


Ovarian ca. 
OVCAR-3 


7.3 


Colon ca. Colo-205 


0.9 


Ovarian ca. SK- 
OV-3 


26.4 


Colon ca. SW-48 


26T — 


Ovarian ca. 

KJ V l^AK-'f 


1.9 

________ 


Colon Pool 


18.8 


Ovarian ca. 
OVCAR-5 


6.7 


Small Intestine Pool 


5.3 


OvBrisn C3.. 
IGROV-l 


9.2 


Stomach Pool 


7.9 


Ovarian ca. 
OVCAR-8 


4.2 


Bone Marrow Pool 


8.4 


0\ ai) 


10.0 


Fetal Heart 


1.2 


Breast ca. MCF-7 


0.4 

- — - 


Heart Pool 


5 - 7 


Breast ca. MDA- 
MB-231 


92.0 




Lymph Node Pool 


32.1 


Breast ca. BT 549 


5.5 


Fetal Skeletal Muscle 


1.2 


Breast ca. T47D 


2.5 


Skeletal Muscle Pool 


4.7 


Rrpc»ct pa A/IT* A XT 




opieen Jrooi 




Breast Pool 


19.6 


Thymus Pool 


19.3 


Trachea 


10.3 

— — — 


CNS cancer 
(glio/astro) U87-MG 


38.2 


[— 

Lung 


1.2 


CNS cancer 
(glio/astro) U- 11 8-MG 


12.2 


Fetal Lung 


8.3 


CNS cancer 
(neuro;met) SK-N-AS 


0.9 


Lung ca. NCI-N417 

— — 


0.9 


CNS cancer (astro) SF- 
539 




Lung ca. LX-1 


27.2 


CNS cancer (astro) 
SNB-75 


17.1 


Lung ca. NCI-H146 


10.7 


CNS cancer (glio) 
SNB-19 


6.8 
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Lung ca. SHP-77 


47.3 




CNS cancer (glio) SF- 
295 


5.7 


Lung ca. A549 


5.1 




Brain (Amygdala) Pool 


7.0 


Lung ca. NCI-H526 


0.0 




Brain (cerebellum) 


3.2 


Lung ca. NCI-H23 


4.1 




Brain (fetal) 


19.3 


Lung ca. NCI-H460 


0.5 


— ' 


Brain (Hippocampus) 
Pool 


13.1 


Lung ca. HOP-62 


2.7 


jCerebral Cortex Pool 


14.8 


Lung ca. lNd-rijzz 


1 3 


Brain (Substantia 
jnigra) Pool 


6.3 


Liver 


o.o 




Brain { f balamus) Po, ,! 


15.2 


Fetal I iver 


1.7 


jBrain (whole) 


10.4 


Liver ca. HepG2 


0.5 


(Spinal Cord Pool 


5.3 


Kidney Pool 


21.2 


•\djenal Ghnd 


100.0 


Fetal K idney 


1.6 




Pituitary gland Pool 


4.3 


Renal ca. 786-0 


1.7 




Salivary Gland 


3.4 


Renal ca. A498 


1.3 


Th} n id (km Jo 


14.5 


Renal ca. ACHN 


4.3 


Pancreatic ca. 
CAPAN2 


1.7 


Renal ca. UO-3 1 


17.4 


l\ nereas Pool 


24.5 



Table ABD . Panel 1.3D 



Tissue Name 


Rel. Exp.(%) 1 
Ag3226, Run Tissue Name 
167994701 


Rel. Exp.(%) 
Ag3226, Run 
167994701 


I -iver adenocarcinoma 


2.5 .Kidney (fetal) 


16.3 


Pancreas 


0.0 iRenal ca. 786-0 


0.9 


Pancreatic ca. CAP AN 
2 

Adrenal gland 


0.0 Renal ca. A498 
I l >.6 Renal ca. RXF ?>93 


1.4 
3.4 


Th\roid 


16.3 Renal ca. ACHN 


1.4 


Salivary gland 


0.0 


■ I <>-_-! 


2.8 


Pituitary gland 


1.9 


Renal ca. I K- 10 


4.4 


Brain (fetal) 




Liver 


0.0 


Brain (whole) 


4.6 


Liver (fetal) 


1.1 


|Brain (amygdala) 


6.7 


Liver ca. 

(hepatoblast) HepG2 


0.0 


Brain ( cerebellum) 


1 .6 |Lung 


8.8 


Brain i hipp jampus) 


22.2 


Lung (fetal) 


1.7 


Brain (substantia nigra) 


3.1 


Lung ca. (small cell) 
LX-1 


18.6 


Brain (thalamus) 


3.2 


Lung ca. (small cell) 
NCI-H69 


4.2 
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Cerebral Cortex 


~ 'Lung ca. (s.cell var.) 
[SHP-77 


100.0 


Spinal cord 


■j i (Lung ca. (large 
|cell)NCI-H460 


0.0 


glio/astro U87-MG 


_ . iLung ca. (non-sm. 
[cell) A549 


6.7 


glio/astroU-118-MG 


4 j |Lung ca. (non-s.cell) 
JNCI-H23 


. 5J 


astrocytoma SW1783 


1.2 


Lung ca. (non-s.cell) 
HOP-62 


0.0 


neuro*; met SK-N-AS 


0.0 


Lung ca. (non-s.cl) 
NCI-H522 


0.0 


astrocytoma SF-539 


n ft Lung ca. (squam.) 
JSW 900 


0.9 


astrocy1:oma SNB-75 


. |Lung ca. (squam.) 
(NCI-H596 


3.7 


glioma SNB-19 


6.0 


Mammary gland 


— - — 

— — — ^— — — 


glioma U251 


14.1 


Breast ca.* (pl.ef) 
MCF-7 


0.0 


glioma SF-295 


0.0 


Breast ca.* (pl.ef) 
MDA-MB-231 


45.4 


Heart (fetal) 


1.4 


Breast ca.* (pl.ef) 
T47L) 


4.3 


Heart 


1.0 


Breast ca. BT-549 


7.1 


Skeletal muscle (fetal) 


0.7 


Breast ca. MDA-N 




Skeletal muscle 


3.2 


Ovary 


10.9 


Bone marrow 


3.1 


Ovarian ca. 
OVCAR-3 


0.0 


Thymus 


5.7 


Ovarian ca. 
OVCAR-4 


2.4 


Spleen 

1 


7.2 


Ovarian ca. 
OVCAR-5 


5.2 


Lymph node 


0.0 


Ovarian ca. 
OVCAR-8 


0.0 



0.0 


Colorectal 


4.8 


Ovarian ca. IGROV- 

1 


Stomach 


- 1 Ovarian ca.* 

5 (ascites) SK-< )V-3 


3.0 


Small intestine 


1.5 


Uterus 


5.8 ! 


Colon ca. SW480 


33.2 


Placenta 


0.0 [ 


Colon ca.* 
SW620(SW480 met) 


8.8 


Prostate 


1.6 


Colon ca. HT29 


0.0 


Prostate ca.* (bone 
met)PC-3 


2.6 


Colon ca. HCI -1 l(. 


0.0 


Testis 


7.4 
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Colon ca. CaCo-2 


35.4 


Melanoma 
Hs688(A).T 


0.0 


Colon ca. 
tissue(OD03866) 


24.5 


Melanoma* (met) 
Hs688(B).T 


0.0 


Colon ca. HCC-2998 


15.7 


Melanoma UACC- 
62 


0.0 


Gastric ca.* (liver met) 
NCI-N87 


6.4 


Melanoma Ml 4 


0.0 


Bladder 


14.6 


Melanoma LOX 
IMVI 


0.0 


Trachea 


4.4 


Melanoma* (met) 
SK-MEL-5 


0.0 


Kidney 


2.4 


Adipose 


17.3 



General_screening_panel vl.4 Summary: Ag3952 Highest expression of this gene is 
seen in the adrenal gland (CT=29). Thus, this gene product may be a treatment for 
Addison's disease and other adrenalopathies. This gene also has low levels of expression in 
5 adipose, heart, skeletal muscle, pituitary, thyroid, and pancreas. Therapeutic modulation of 
this gene product may be important for the diagnosis or treatment of endocrine or 
metabolic disease, including Types 1 and 2 diabetes, obesity and pancreatitis. 

Expression of this gene is also seen in sample derived from colon, gastric, lung and 
breast cancers. Thus, expression of this gene could be used to differentiate between these 

1 0 samples and other samples on this panel and as a marker to detect the presence of these 
cancers. Furthermore, therapeutic modulation of the expression or function of this gene 
may be effective in the treatment of colon, gastric, lung and breast cancers. 

Low but significant levels of expression are also seen for all regions of the CNS 
examined. Thus, this gene product may be useful for treatment of CNS disorders such as 

1 5 Alzheimer's disease, Parkinson's disease, stroke, epilepsy, schizophrenia and multiple 
sclerosis. 

Panel 1.3D Summary: Ag3952 Highest expression of the CG57399-01 gene is seen in a 
lung cancer cell line (CT=32.5). Low but significant expression is also seen in cell lines 
20 derived from breast and colon cancers. Overall, expression is consistent with expression 
seen in Panel 1.4. Thus, expression of this gene could be used to differentiate between 
these samples and other samples on this panel and as a marker to detect the presence of 
these cancers. Furthermore, therapeutic modulation of the expression or function of this 
gene may be effective in the treatment of colon, gastric, lung and breast cancers. 
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Among metabolic tissues, significant levels of expression are seen in adipose and 
the adrenal gland. Thus, this gene product may be useful for treatment of obesity, 
Addison's disease and other adrenalopathies. 

In addition, this gene is expressed in the hippocampus, and cerebral cortex. Both 
5 these regions of the brain undergo degeneration in Alzheimer's disease. Thus, therapeutic 
modulation of the expression or function of this gene may be effective in the treatment of 
this disease or any other neurodegenerative disorders. 

AC. CG57399-02: PHOSPHOLIPASE ADRAB-B PRECURSOR 

Expression of gene CG57399-02 was assessed using the primer-probe set Ag3952, 
1 0 described in Table AC A. Results of the RTQ-PCR runs are shown in Table ACB. Please 
note that this gene represents a variant of CG57399-01. This sequence however, only 
corresponds to probe and primer set Ag3952. 

Table ACA . Probe Name Ag3952 



r 

Primers 


Sequences 


Length 


Start Position 


SEQID 
NO: 


l-'orwarc 


5 1 -ctgtgtccctgtgtcctgaa-3 1 




578*""" 


181 


i Probe 


TET-5 ' -tcaacagaacttgctaccctcatcga-3 ' - 
TAMRA 


26 


611 


482 


[lie \ erse 


5 ' -gtgggtcttctcctgaaacttc-3 ' 


22 


646 


483 



Table ACB . General_screening_panelvl.4 



Tissue Name 


Rel. Exp.(%) 
Ag3952, Run 
213856126 


Tissue Name 


Rel. Exp.(%) 
Ag3952, Run 
213856126 


Adipose 


9.0 


Renal ca. TK-10 


15.0 


Melanoma* 
Hs688(A).T 


3.0 


Bladder 


22.7 


Melanoma* 
Hs688(B).T 


3.4 


Gastric ca. (liver met.) 
NCI-N87 


13.0 


Melanoma* Ml 4 


0.9 


Gastrii K \KUii 


75.3 


Melanoma* 
LOXIMVI 


11.7 


Colon ca. SW-948 


4.3 


Melanoma* SK- 
MEL-5 


1.5 


Colon ca. SW480 


97.3 


Squamous cell 
carcinoma SCC-4 


8.7 


Colon ca.* (SW480 
met) SW620 


4.4 
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Testis Pool 


12.8 


Colon ca. HT29 


0.4 


Prostate ca.* (bone 
met) PC-3 


10.5 


Colon ca.HCT-1 16 


1.2 


Prostate Pool 


12.9 


Colon ca. CaCo-2 


60.7 


PI 3. cent 3 


5.1 


Colon Ccincer tissue 


28.7 


Uterus Pool 


6.5 


Colon ca. SW1116 


0.0 


Ovarian ca. 
OVCAR-3 


7.3 


Colon ca. Colo-205 


0.9 


Ovarian ca. SK- 
OV-3 


26.4 


Colon ca. SW-48 


26.1 


Ovarian ca. 
ovr A V -A 


1.9 


Colon Pool 


18.8 


Ovarian ca. 
OVCAR-5 


6.7 


Small Intestine Pool 


5.3 


Ovarian ca. 
IGROV-1 


9.2 


Stomach Pool 


7.9 


Ovarian ca. 
OVCAR-8 


4.2 


Bone Marrow Pool 


8.4 


Ovary 


10.0 


Fetal Heart 


1.2 


Breast ca. MCF-7 


0.4 


Heart Pool 


5.7 


Breast ca. MDA- 
MB-231 _ 


92.0 


Lymph Node Pool 


32.1 


Breast ca. BT 54V 


5.5 " 


Fetal Skeletal Muscle 


1.2 


Breast ca. T47D 


~" 2.5 


Skeletal Muscle Pool 




oreasi ca. iviu/v-in 




Spleen Pool 


18.2 


Breast Pool 


19.6 


Thymus Pool 


19.3 


Trachea 


10.3 


CNS cancer 
(glio/astro) U87-MG 


38.2 


Lung 


— - 

1.2 


CNS cancer 
(glio/astro) i -118-MG 


12.2 


Fetal Lung 


8.3 


CNS cancer 
(neuro;met) SK-N-AS 


0.9 


Lung ca. NC1-N417 


0.9 


CNS cancer (astro) SF- 
539 


7.6 


Lungca. LX-1 


______ 

27.2 


SNB-75 ^ ^ 


_______ 

17.1 


Lung ca. NCI-H146 




10.7 


CNS cancer (glio) 
SNB-19 


6.8 


Lung ca. SHP-77 

— , ™ „™, 


47.3 


CNS cancel (glio) SI - 
295 


5.7 


(Lung ca. A549 


5.1 


P. rain { Amyeualaj Poo! 


7.0 


Lung ca.NCI-H526 


0.0 


Brain (cerebellum) 


3.2 


Lung caTNCI-1123 


4.l" iBrain (fetal) 


19.3 


1 in. . i V f-ll Mi 


0.5 


Brain (Hippocampus) 


13-. 
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Pool 




Lung ca. HOP-62 


2.7 


Cerebral Cortex Pool 


14.8 


Lung ca. JNCI-hOZZ 


1.3 


Brain (Substantia 
nigra) Pool 


6.3 


Liver 


0.0 


Brain (Thalamus) Pool 


15.2 


i ;tal Liver 


1.7 jBrain (whole) 


10.4 


I i\er ca HcpG2 


0.5 JSpinal Cord Po- 1 


5.3 


Kidney Pool 


21.2 \drenal Gland 


100.0 


Fetal Kidney 


1 .6 {pituitary gland Pool 


4.3 


Renal ca. 786-0 


1.7 [Salivary Gland 


3.4 


Renal ~c;i7a498~~" 


1 .3 iThyroid (female) 




Renal ca. ACHN 


. , Pancreatic ca. 
CAPAN2 


1.7 


Renal ca. UO-31 


1 7.4 jPancreas Pool 


24.5 



ru * ^ "~ 

tff General_screeningjpanel_vl.4 Summary: Ag3952 Highest expression of this gene is 

U 

p seen in the adrenal gland (CT=29). Thus, this gene product may be a treatment for 

=L Addison's disease and other adrenalopathies. This gene also has low levels of expression in 

yj adipose., heart, skeletal muscle, pituitary, thyroid, and pancreas. Therapeutic modulation of 

J" 5 ! 5 this gene product may be important for the diagnosis or treatment of endocrine or 

O metabolic disease, including Types 1 and 2 diabetes, obesity and pancreatitis. 

Expression of this gene is also seen in cell line samples derived from colon, gastric, 
lung and breast cancers. Thus, expression of this gene could be used to differentiate 
between these samples and other samples on this panel and as a marker to detect the 
10 presence of these cancers. Furthermore, therapeutic modulation of the expression or 

function of this gene may be effective in the treatment of colon, gastric, lung and breast 
cancers. 

Low but significant levels of expression are also seen for all regions of the CNS 
examined. Thus, this gene product may be useful for treatment of CNS disorders such as 
15 Alzheimer's disease, Parkinson's disease, stroke, epilepsy, schizophrenia and multiple 
sclerosis. 

AD. CG59311-01: ACYL-COENZYME A THIOESTER HYDROLASE bp. 

Expression of gene CG5931 1-01, splice variant CG5931 1-02, and full length clone 
CG5931 1-03, was assessed using the primer-probe set Ag3541, described in Table ADA. 
20 Results of the RTQ-PCR runs are shown in Tables ADB and ADC. 



Table ADA . Probe Name Ag3541 



Primers 


Sequences 


Length 


Start 
Position 


SEQID 
NO: 


Forward |5 1 -ctcactcaaaggcacaggtaga-3 ' 


22 


1199 


484 


Probe 


TET-5 1 -tggcagcaaattcaaactttcttcca-3 ' - 
TAMRA 


26 


1225 


485 


Reverse 


5 ' -tttgctgtgcttgacagatttt-3 ' 


22 


1269 


486 



Table ADB . General_screening_panel_vl.4 



Tissue Name 


Rel. Exp.(%) 
Ag3541, Run 
217049294 


Tissue Name 


Rel. Exp.(%) 
Ag3541, Run 
217049294 


Adipose 


0.0 


Renal ca. TK-10 


6.0 


Melanoma* 
Hs688(A).T 


0.7 


Bladder 


3.7 


Melanoma* 
Hs688(B).T 


1.6 


Gastric ca. (liver met.) 
NCI-N87 


0.0 


Melanoma* Ml 4 


0.0 


Gastric ca. KATO 111 


0.0 


Melanoma* 
LOXIMVI 


0.0 


Colon ca. SW-948 


0.0 


Melanoma* SK- 
MEL-5 


0.0 


Colon ca. SW480 


2.7 


Squamous cell 
carcinoma SCC-4 


0.0 


Colon ca.* (SW480 
met) SW620 


5.4 


Testis Pool 




Colon ca. HT29 


0.0 


Prostate ca.* (bone 
met) PC-3 


1.4 


Colon ca. HCT-1 16 


0.6 


Prostate Pool 


2.3 


Colon ca. CaCo-2 


0.6 


Placenta 


0.5 


Colon cancer tissue 


0.0 


Uterus Pool 


0.0 


Colon ca. SW11 16 


0.0 


Ovarian ca. 
OVCAR-3 


2.9 


Colon ca. Colo-205 


0.0 


Ovarian ca. SK- 
OV-3 


0.0 


Colon ca. SW-48 


0.0 


Ovarian ca. 
OVCAR-4 


0.9 


Colon Pool 


4.6 


Ovarian ca. 
OVCAR-5 


27.0 


Small Intestine Pool 


6.6 


Ovarian ca. 
IGROV-1 


0.0 


Stomach Pool 


3.1 


Ovarian ca. 
OVCAR-8 


1.8 


Bone Marrow Pool 


1.4 


Ovary 


2.5 


Fetal Heart 


9.2 
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Breast ca. M< F-7 


2.4 1 learl Pool 


3.4 


Breast ca. MDA- 
MB-231 


8.0 


Lymph Node Pool 


3.9 


Breast ca. BT 549 


4.9 


Fetal Skeletal Muscle 


4.9 


Breast ca. T47D 


52.9 


Skeletal Muscle Pool 


13.5 


Brc3.st C3.. 


0.0 


Spleen Pool 


0.0 


Breast Fool 


6.7 


1 hymus Pool 


4.7 


Trachea 


0.9 


CNS cancer 

(glio astro) L87-MG 


0.9 


Lung 


1.7 


CNS cancer 
(glio/astro) U-118-MG 


12.1 


Fetal Lung 


_ _ iCNS cancer 

l(neuro;met)SK-N-AS 


0.0 


Lung ca.NCI-N417 

— — 


n n CNS cancer (astro) SF- 


0.0 


[Lung ca. LX-1 


. ? ICNS cancer (astro) 
ISNB-75 


5.2 


Lung ca.NCI-H146 


2.1 


CNS cancer (glio) 
SNB-19 


0.0 




Lung ca. SHP-77 




CNS cancer (glio) SF- 
295 


0.7 


|Lung ca. A549 


0.0 !Brain (Amygdala) Pool 


4.2 


tung ca. NCI-H526 


0.0 


Brain (cerebellum) 


100.0 


Lung ca. NCI-H23~* 
Lung ca. NCI-H460 


10.2 


Brain (fetal) 


14.7 


3.4 


Brain (Hippocampus) 
Pool 


9.0 


Lung ca. HOP-62 


~67o 


Cerebral C orle\ Pool 


9.7 


It ~~ x i/" m u^oo 
ILung ca. JNCl-rlDzz 


8 5 jBrain (Substantia 
nigra) Pool 


3.5 


Liver 


0.5 "~|Brain (Thalamus) Pool 


10.5 


tela! Liver 


1.5 iBrain (whole) 


12.9 


jLiver ca. HepG2 


__22_ 


Npii«al ( . i J Pool 


7.6 


iKidney Pool 


9.8 


\drenal ulanJ 


10.2 


Fetal Kidney 


7.9 


Pituitar) gland Pool 


3.1 — 


Renalca" 786-0 


" 0.0 


Salivary Gland 




Renal ca. \4 l 'S 


0.0 Thyroid (female) 


0.9 


Renal ca. ACHN 


0.0 


Pancreatic ca. 
CAPAN2 


2.8 


Renal ca. UO-31 


3.3 


Pancreas Pool 


6.6 



Table ADC . Panel 4D 



Tissue Name 


Rel. Exp.(%) 
Ag3541, Run 


Tissue Name 


Rel. Exp.(%) 
Ag3541, Run 
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a 





166447041 




166447041 


Secondary ThI act 


2.7 


HUVEC II -lbeta 


0.0 


Secondary Tti2 act 




n u vj_/^ iriN gdinma 




Secondary Trl act 


0.0 


HUVEC INI alpha I 
IFN gamma 


0.0 


Secondary Thl rest 


0.0 


HUVEC 1 NF alpha l 
IL4 


0.0 


Secondary Th2 rest 


0.0 


HUVEC IL-1 1 




Secondary Trl rest 


0.0 


Lung Microvascular EC 
none 


0.0 


Primary Thl act 


2.7 


Lung Microvascular EC 
TNFalpha + IL-1 beta 


0.0 


Primary Th2 act 


0.0 


Microvascular Dermal 
EC none 


0.0 


Primary Trl act 


0.0 


Microsvasular Dermal 
EC TNFalpha + IL-1 beta 


0.0 


Primary Thl rest 


0.0 


Bronchial epithelium 
TNFalpha + IL1 beta 


0.0 


Primary Th2 rest 


0.0 


Small airway epithelium 
none 


0.0 


Primary Trl rest 


0, 


Small airway epithelium 
TNFalpha + IL-1 beta 


0.0 


CD45RA CD4 
l\ inph< >c\ te act 


0.0 


Coronery artery SMC rest 




CD45ROCD4 
lymphocyte act 


1.7 


Coronery artery SMC 
TNFalpha + IL- 1 beta 


0.0 


CDS lymphocyte act 


0.0 


AstroC) tes rest 


1.8 


Secondan CD8 
lymphocyte rest 


0.0 


Astrocytes TNFalpha + 
IL-1 beta 


0.0 


Secondary CD8 
lymphocyte act 


0.0 


KU-812 (Basophil) rest 


4.2 


CD4 lymphocyte none 


0.0 


KU-812 (Basophil) 
PMA/ionomycin 


1.4 


2ryThl/Th2/Tii anti- 

PT^O^ PHI 1 


0.0 




CCD1106 

(Keratitiocytes) none 


0.0 


LAK cells rest 


2.8 


CCLM lUo 
(Keratinocytes) 
TNFalpha + IL-1 beta 


9.8 


!LAKceiiriL-2 


b~o 


Liver cirrhosis 


22.2 


LAK cells IL-2+IL-12 


0.0 


Lupus kidney 


18.4 


LAK cell, 11 -2 11 N! 
gamma 


0.0 


NCI-H292 none 


10.4 


LAK cells IL-2+ IL-18 


0.0 


NCI- H2i)2 11-4 


7.2 


LAK cells 
PMA/ionomycin 


0.0 


NCI-H292 IL-9 


15.2 

™ ™ , ™ 
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NK Cells IL-2 rest 


1.7 


NCI-H202 IL-13 


3.1 


Two Way MLR 3 day ! 


0.0 


IFN gamma 


7.3 


Two Wav MLR 5 dav 


5.3 


HPAEC none 


0.0 


Two Way MLR 7 day 


0.0 


HPAEC TNF alpha + IL- 
1 beta 


0.0 


PBMC rest 


0.0 


Lung fibroblast none 


1.7 


PBMC PWM 


0.0 


T unfT fil-n-nhlaot TMP 
JUUIl^ llUlUUlaol J.rNJT 

alpha + IL-1 beta 


5.7 


rfjlvlL 1HA-1 




Lung fibroblast IL-4 




Ramos {B eel!) none 


0.0 


Lung fibroblast IL- 1 ^ 


0.0 


Ramos (B cell) 
ionomycin 


2.2 


Lung fibroblast IL-13 


3.2 


B lymphocytes PWM 


0.0 


Lung fibroblast IFN 
gamma 


0.0 


B lymphocytes CD40L 
(and IL-4 


0.0 


Dermal fibroblast 
CCD1 070 rest 


2.9 


|EOL-l dbcAMP 


0.0 


Dermal fibroblast 
CCD 1070 TNF alpha 


2.9 

— 


bUJL-l UDCAJVlr 


0.0 


Dermal fibroblast 


0.0 


|PMA/ionomycin 


CCD1070IL-1 beta 


Dendritic cells none 


0.0 


Dermal fibroblast IFN 
gamma 


0.0 


(Dendritic cells LPS 


3.5 Dermal fibroblast IL-4 


1.5 


(Dendritic cells anti- 
(l)4(i 


0.0 llBD Colitis 2 


5.4 


JMonocytes rest 


().() IBD Crohn's 


0.0 


pvlonocytes LPS 


0.0 


Colon 


14.1 


km 




Lung 


0.0 


M.tu op id jc I IN 


2.1 


Thymus 


100.0 


HUVEC none 


0 0 


Kidney 


2.3 


(HUVEC starved 









CNSneurodegenerationvl.O Summary: Ag3541 - Expression of this gene is 
low/und etectable (CTs > 34.5) across all of the samples on this panel (data not shown). 



General_screeningjpanel_vl.4 Summary: Ag3541 Significant expression of this gene is 
seen only in cerebellum, fetal brain, the breast cancer cell line T47D, and ovarian cancer 
cell line OVCAR-5 (CTs=32-35). Therefore, expression of this gene can be used to 
differentiate between these samples and others on this panel. 

Panel 4D Summary: Ag3541 - There is significant expression of this gene only in thymus 
(CT=33.8). Therefore, expression of this gene may be used to identify thymic tissue. 
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Furthermore, drugs that inhibit the function of this protein may regulate T cell development 
in the thymus and reduce or eliminate the symptoms of T cell mediated autoimmune or 
inflammatory diseases, including asthma, allergies, inflammatory bowel disease, lupus 
erythematosus, or rheumatoid arthritis. Additionally, therapeutics designed against this 
putative protein may disrupt T cell development in the thymus and function as an 
immunosuppresant for tissue transplant. 

AE. CG59309-01: ACYL-COENZYME A THIOESTER HYDROLASE 

Expression of gene CG59309-01 was assessed using the primer-probe set Ag3540, 
described in Table AEA. Results of the RTQ-PCR runs are shown in Tables AEB, AEC, 
AED and AEE. 

Table AEA . Probe Name Ag3540 



Primers 

L_ 


Sequences 


Length 


Start 
Position 


SEQ ID I 

NO: I 


Forward b 1 -ccacgttggctctagcttatta-3 ' 


22 


649 


487 | 


(Probe 


TET-5 ' -tgaagatctccccaataacatggaca-3 ' - 
TAMRA 


26 


677 


488 


Reverse 


5 ' -ttcgaagtactccagggatatg-3 ' 


22 


704 


489 



Table AEB . CNS neurodegeneration vl.O 



Tissue Name 


ReL Exp.(%) Ag3540, 
Run 210638385 


Tissue Name 


Rel. Exp.(%) Ag3540, 
Run 210638385 


AD 1 Hippo 


13.7 


Control (Path) 3 
Temporal Ctx 


8.2 


AD 2 Hippo 


26 2 jControl (Path) 4 
(Temporal Ctx 


34.2 


AD 3 Hippo 


AD 1 Occipital 
JCtx 


23.2 


AD 4 Hippo 


3.4 


AD 2 Occipital 
Ctx (Missing) 


0.0 


AD 5 hippo 


30.4 


AD 3 Occipital 
Ctx 


7.8 


AD 6 Hippo 


55.9 


A!) 4 Occipital 
Ctx 


15.0 


Control 2 Hippo 


[AD 5 Occipital 

[Ctx 


8.1 


Control 4 Hippo 


4 . |AD 6 Occipital 


763 
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Control (Path) 3 
Hippo 


, _ jControl 1 Occipital 

_ |Ctx__ 


3.6 


AD 1 Temporal Ctx 


1 1 _ Control 2 Occipital 
UU fCtx 


96.6 


AD 2 Temporal Ctx 


Q jControl 3 Occipital 
JCtx 


36.3 


AD 3 Temporal Ctx 


. _ (Control 4 Occipital 
48 Ctx 


3.9 


AD 4 Temporal Ctx 


[S 6 Control (Path) 1 
jOccipital Ctx 


100.0 


AD 5 Inf Temporal 
Ctx 


36.9 


Control (Path) 2 
Occipital Ctx 


7.6 


AD 5 SupTemporal 
Ctx 


27.4 


Control (Path) 3 
Occipital Ctx 


1.6 


AD 6 Inf Temporal 
Ctx 


47 jControl (Path) 4 
jOccipital Ctx 


16.6 


AD 6 Sup Temporal 
Ctx 


, . « {Control 1 Parietal 

642 jCtx 


8.7 


Control 1 Temporal 
Ctx 


7.0 


Control 2 Parietal 
Ctx 


20.7 


Control 2 Temporal 
Ctx 


53.2 


Control 3 Parietal 
Ctx 


27.2 


Control 3 Temporal 
Ctx 


19.9 


Control (Path) 1 
Parietal Ctx 


88.9 


Control 4 Temporal 
Ctx 


(Control (Path) 2 
Parietal Ctx 


10.8 


Control (Path) 1 
Temporal Ctx 


68.3 


Control (Path) 3 
Parietal Ctx 


10.1 


Control (Path) 2 
{Temporal Ctx 


25.3 


Control (Path) 4 
Parietal Ctx 


47.6 | 



Table AEC . General screening_panel_vl.4 



Tissue Name 


Rel. Exp.(%) 
Ag3540, Run 
217049291 


Tissue Name 


Rel. Exp.(%) 
Ag3540, Run 
217049291 


Adipose 




Renal ca. TK-10 


0.1 


Melanoma* 
Hs688(A).T 


0.7 


Bladder 


1.1 


Melanoma* 
Hs688(B).T 


0.5 


Gastric ca. (liver met.) 
NCI-N87 


5.6 


Melanoma* Ml 4 


0.2 


Gastric ca. KATO III 


0.0 


Melanoma* 
LOXIMVI 


0.0 


Colon ca. SW-948 


0.0 


Melanoma* SK- 


0.0 


Colon ca. SW480 


10.3 
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MEL-5 ! 


- ' ™ ~j 





Squamous cell 
carcinoma SCC-4 


0.3 


Colon ca.* (SW480 
met) SW620 


2.8 


Testis Pool 


0.3 


Colon ca. HT29 


0.8 


Prostate ca.* (bone 
met) PC-3 


0.8 


Colon ca.HCT-1 16 


0.0 


Prostate Pool 


0.3 


Colon ca. CaCo-2 


3.5 


Placenta 


1.4 


Colon cancer tissue 




Uterus Pool 


0.1 


Colon ca SW1116 


0.0 


Ovarian ca. 
OVCAR-3 


1.6 


Colon ca. Colo-205 


3.3 


Ovarian ca. SK- 
OV-3 


3.6 


Colon ca. SW-48 


1.7 


Ovarian ca. 
OVCAR-4 


0.4 


Colon Pool 


0.2 | 


[Ovarian ca. 
( >VCAR-5 


23.7 


Small Intestine Pool 


_____ 
0.3 


[Ovarian ca. 
llGROV- 1 


0.0 


Stomach Pool 


0.1 


Ovarian ca. 
( >\ CAR-8 


0.0 

,....,..,..„ 


Bone Marrow Pool 


0.2 


Ovary 


0.1 


Fetal Heart 


0.4 


jtjreasi ca. ivi^r-/ 


0 0 


Heart Pool 


0.2 


Breast ca. MDA- 


2.5 


Lymph Node Pool 


0.3 


Breast ca. B 1 549 


3.0 


Fetal Skeletal Muscle 




Breast ca. T47D 


100.0 


Skeletal Mn-.de Pool 


0.4 


Breast ca. MDA-N 


0.0 


Spleen Pool 




Brc ist Pool 


0.3 


Thymus Pool 


0.3 


Tr3.chc3 


0.4 


CNS cancer 
(glio/astro) U87-MG 


0.0 




0.0 


CNS cancer 
(glio/astrc )U-118-M( 


03 

- - — 


Fct&l Lung 


0.2 


CNS cancer 
(neuro;met) SK-N-AS 


L 


Lung ca. NCI-N417 


0.0 


CNS cancer (astro) SF- 


0.6 




Lung ca. LX-1 


3.5 


CNS cancer (astro) 
SNB-75 


3.1 


Lung ca. NCI-H146 


0.0 


CNS cancer (glio) 
SNB-19 


0.0 

: 


Lung ca. SHP-77 


0.1 


CNS cancer (glio) SF- 
295 


0.2 


Lung ca. A549 


1.4 


Brain (Amygdala) Pool 


i o- 7 . 
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I.ungoa M 1-11526 


0.7 iBrain (cerebellum) 


2.1 


Lung ca. NCI-H23 


1.3 Brain (fetal) 


0.5 


Lung ca. NCI-H460 


0.8 


Brain (Hippocampus) 
Pool 


1.0 


Lung ca. HOP-62 


1.2 


Cerebral Cortex Pool 


0.9 


Lung ca. Nd-hiDzz 


0.0 


Brain (Substantia 
nigra) Pool 


1.3 


Liver 


2.6 


Brain (Thalamus) Pool 


I.I 


Fetal Liver 


0.8 


P.i i i (whoki 


1.4 


Liver ca. HepG2 


0.1 


Spinal Cord Pool 


0.5 


Kidney Pool 


0.7 


Adrenal Gland 


0.8 


Fetal Kidney 


0.6 


Pituitary gland Pool 


0.1 


Renal ca. 786-0 


0.0 


Salivary Gland 


0.2 


Renal ca. A498 


0.0 


Thyroid (female) 


0.7 


Renal ca. ACHN 


0.0 


Pancreatic ca. 
CAPAN2 


9.4 


Renal ca. UO-3 1 


1.1 


Pancreas Pool 


0.9 



Table AED . Panel 4D 



Tissue Name 


Rel. Exp.(%) 
Ag3540, Run 
166447040 


Tissue Name 


Rel. Exp.(%) 
Ag3540, Run 
166447040 


Se-Londar} fhl act 


4.8 


HUVEC 1L- lbeta 


1.7 


Secondary Th2 act 


10.2 


HUVEC I FN gamma 


0.9 


Secondary Trl act 


12.9 


HUVEC TNF alpha + 
I FN gamma 


1.5 


Secondary Thl rest 


IHUVEC TNF alpha + 
111,1 


0.8 


Secondan t h2 rest 


1.4 HUVEC1L-11 


15 


Secondary Trl rest 


^ ^ Lung Microvascular EC 
mone 


0.6 


Primary Thl act 


. 7 iLung Microvascular EC 
iTNFalpha + IL-lbeta 


0, 


Primary Th2 act 


, „ IMicrovascular Dermal 

0.6 

I.C. none 


1.5 


Primary Trl act 


7 _ jMicrosvasular Dermal 

HC 1 \ T 1 alpha + IL-lbeta 


0.8 


Primary Thl rest 


Bronchial epithelium 
LNFalpha - ILlbeta 


,3 


Primary Th2 rest 


2 ^ Small airway epithelium 
mone 


0.6 


Primary Trl rest 


. ISmall airway epithelium 
|TNFalpha + 11 -lbeta 


0.0 
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CD45RA CD4 
lymphocyte act 


4.1 


Coronery artery SMC rest 


0.9 


CD45RO CI )4 
lymphocyte act 


10.9 


Coronery artery SMC 
TNFalpha + IL-lbeta 


0.0 


CD8 lymphocyte act 


6.6 


Astrocytes rest 


2.6 


Secondary CD 8 
lymphocyte rest 


17.0 


Astrocytes TNFalpha + 
IL-lbeta 


2.1 


Secondary CD8 
lymphocyte act 


6.0 


KU-8 12 (Basophil) rest 


2.2 


CD4 lymphocyte none 


2Q jKU-8 12 (Basophil) 
PMA/ionomycin 


10.2 

* "" ""1 


2ryThl/Th2/Trl_anti- I 
L-JJyj CHI 1 


24 CCD 1106 

( Keratinocytes) none 


6.8 


LAK cells rest 


CCD 1106 
2.0 j(Keratinocytes) 

TNFalpha + IL-lbeta 


25.7 


LAK cells 1L-2 


16.2 Liver cirrhosis 


12.0 


LAK cells IL-2+IL-12 


12.8 iLupus kidney 


5.1 


LAK cells IL-2+IFN 
gamma 


15.6 


NCI-H292 none 


_____ 
44.8 


LAK cells 1L-2+ 1L-18 


7.4 


NCI-H292 II -4 


37.6 


LAK cells 
PMA/ionomycin 


3.4 


NCI-H292 IL-9 


41.2 


NK Cells IL-2 rest 


9.0 


NCI-H292 1L-I3 


19.8 


Two Way MLR 3 day 


10.5 NCI-H292 I FN gamma 


30.1 


Two Wa\ MLR 5 da> 


7.2 jHPAEC none 


1.2 


.Two Way MLR 7 day 


HPAEC TNF alpha + I L- 
cV ' 1 bcia 


3.3 


PBMC rest 


0.5 Lung fibroblast none 


0.9 


PBMC PWM 


_ „ Lung fibroblast TNF 
alpha hlL-1 beta 


0.7 


PR Mr PI 1 4-T 


1.0 Lung fibroblast 11-4 


0.5 


Ramos ( B cell) none 


0.0 1 ung fibroblast IL-9 


0.0 


Ramos (B cell) 
ionomycin 


0.0 {Lung fibroblast IL- 13 


0.9 


B lymphocytes PWM 


j |Lung fibroblast IFN 
J gamma 


!f 


B lymphocytes CD40L 
and TL-4 


, „ iDermal fibroblast 
CCD1 070 rest 


l.l 


EOL-1 dbcAMP 


ft n IDermal fibroblast 

.CCD 1070 TNF alpha 


18.9 


hOL-1 dbcAMP 

PM \ ioIHMP\. ill 


0 „ IDermal fibroblast 
CCD1070 II -1 beta 


1.9 


Dendritic cells none 


|4q Dermal fibroblast II N 
J gamma 


0.0 
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Dendritic cells LPS 




8.9 


Dermal fibroblast 11 -4 


1 5 


Dendritic cells anti- 
CD40 




7.9 


IBD Colitis 2 


„ 

2.9 


Monocytes rest 




0.0 


I BD Crohn's 


1.9 


Monocytes LPS 




0.6 


(Colon 


82.9 


Macrophages ivst 




40.3 


jLung 


9.7 


Maciophages LPS 




6.1 


jThymns 


100.0 


HUVEC none 


1.1 Kidney 


1.8 


HUVEC starved 


v» .'. . . 





Table AEE . Panel 5 Islet 



Tissue Name 


Rel. Kx|>.(%) 
Ag3540, Run 
242386396 


Tissue Name 


Rel. Exp.(%) | 
Ag3540,Run 
242386396 


97457_Patient- 
02go_adipose 


3.3 


94709_Donor 2 AM - A_adipose 


9.1 


97476_Patient- 
07sk_skeletal muscle 


0.8 


94710_Donor 2 AM - B_adipose 


1.6 i 

™ - ~ I 


97477_Patient- 
07ut_uterus 


0.0 


9471 l_Donor 2 AM - C_adipose 


1-4 I 


97478_Patient- 
07pljplacenta 


12.9 


94712_Donor 2 AD - A_adipose 


2.8 1 


99 1 67_Bayer Patient 
1 


15.5 


94713_Donor 2 AD - B adipose 


5.8 1 


97482 Pulient- 
08ut_uterus 


3.4 


94714JDonor 2 AD - C_adipose 


— — ^ 

4.2 | 


97483 Pdtienl- 
OSpl placenta 


3.4 
100.0 


94742_Donor 3 U - 

A Mesenchymal Stem Cells 

94743_Donor 3 U - 

B Me_eiidi}m il -.ten Cells 


3.0 
1.1 


97486_Patient- 
09sk_skeletal must lc 


"-IS" Pinuii- 
09ut_uterus 


,.« 


94730_Donor 3 AM - A adipose 


4.3 


97488_Patient- 
f *^>p ! placenta 


2.6 9473 1 Donor 3 AM - B_adipose 


2.0 


97492_Patient- 
I0ut_uterus 


3.1 94732_Donor 3 AM - C_adipose 


2.0 


97493 Patient- 
10pl_placenta 


23.2 j94733_Donor 3 AD - Aadipose 


10.7 

— — — H 


97495_Patient- 
llgoadipose 


0.8 94734_Donor 3 AD - Badipose 


3.0 I 


97496_Pali.ni 

1 lsk skeletal muscle 


0.0 j94735_Donor 3 AD - C_adipose 




97497_Patient- 
1 1 ut_ uterus 


,5 


77 1 38_Liver_HepG2untreated 


__ 

01 
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97498 Patient- 
llpl_placenta 


g ^ 73556_Heart_Cardiac stromal 
(cells (primary) 


0.0 


97500_Patient- 
12go_aclipose 


6.5 8173 5_Small Intestine 


4.8 


97501_Patienl- 
12sk ske Liul muscle 


4 . |72409_Kidney_Proximal 
Convoluted 1 iibule 


0.7 


97502 Patient- 
12ut_uterus 


6 ? 82685_Small 

! * i!ItC,tl*Tj_Dll'jdcri!!!i! 


3.6 


97503_Patient- 
12pl_placenta 


2 ^ |90650_Adrenal_Adrenocortical 
sad en orn a 


0.6 


94721_Donor2U- 
AJVIesenchymal 
Stem Cells 


2.2 72410JCidney_HRCE 


8.0 


94722_Donor2U- 
B_Mesenchymal 
Stem Cells 


0.6 |72411_Kidney_HRE 


8.5 


94723_Donor2U- 
C_Mesenchymal 
Stem Cells 


3 j 1 73 1 3 9_Uterus_Uterine smooth 
i muscle cells 


0.0 



CNS_neurodegeneration_vl.O Summary: Ag3540 - This panel confirms the expression 
of this gene at low levels in the brain in an independent group of individuals. However, no 
differential expression of this gene was detected between Alzheimer's diseased postmortem 
brains and those of non-demented controls in this experiment. 



General_screening_panel_vl.4 Summary: Ag3540 This gene is most highly expressed 
in a breast cancer cell line (CT=27.1). Thus, expression of this gene could be used to 
differentiate between this sample and other samples on this panel and as a marker to detect 
the presence of breast cancer. Furthermore, therapeutic modulation of the expression or 
function of this gene may be effective in the treatment of breast cancer. 

Among metabolic tissues, this gene, an acyl coA thioesterase homolog, has a low 
level of expression in adipose, adult and fetal liver, adrenal, thyroid and pancreas. Acyl 
CoA thioesterases have multiple roles in lipid homeostasis. Therefore, therapeutic 
modulation of this gene product may be a treatment for endocrine and metabolic disease, 
including Types 1 and 2 diabetes and obesity. 

In addition, this gene is expressed in all CNS regions examined. Thus, therapeutic 
modulation of the expression or function of this gene may be effective in the treatment of 
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neurologic disorders such as Alzheimer's disease, Parkinson's disease, epilepsy, stroke, 
schizophrenia and multiple sclerosis. 

References: 

1 . Hunt MC, Alexson SE. The role Acyl-CoA thioesterases play in mediating 
5 intracellular lipid metabolism. Prog Lipid Res. 2002 Mar;41(2):99-130. 



2. Hunt MC, Nousiainen SE, Huttunen MK, Orii KE, Svensson LT, Alexson SE. 
Peroxisome proliferator-induced long chain acyl-CoA thioesterases comprise a highly 
conserved novel multi-gene family involved in lipid metabolism. J Biol Chem. 1999 Nov 
10 26;274(48):343 17-26. 

Panel 4D Summary: Ag3540 Highest expression of the CG59309-01 gene is seen in the 
thymus and colon (CTs=31.5). Significant levels of expression are also seen in a cluter of 
treated zmd untreated samples derived from the NCI-H292 mucoepidermoid cell line. Thus, 
expression of this gene could be used as a marker for thymus and colon. Furthermore, 

15 therapeutic modulation of the expression or function of this gene may regulate T cell 
development in the thymus and reduce or eliminate the symptoms of T cell mediated 
autoimmune or inflammatory diseases, including asthma, allergies, inflammatory bowel 
disease, lupus erythematosus, or rheumatoid arthritis. Additionally, small molecule or 
antibody therapeutics designed against this putative protein may disrupt T cell development 

20 in the thymus and function as an immunosuppresant for tissue transplant. 

Panel 5 Islet Summary: Ag3540 This gene has moderate expression in skeletal muscle, 
(highest expression CT=30.5). Acyl CoA thioesterases function in peroxisomal fatty acid 
oxidation. Therefore, therapeutic modulation of this homolog may increase fatty acid 
oxidation in muscle and be a treatment for Type 2 diabetes and obesity. 

25 References: 

1 . Hunt MC, Solaas K, Kase BF, Alexson SE. Characterization of an acyl-coA 
thioesteirase that functions as a major regulator of peroxisomal lipid metabolism. J Biol 
Chem. 2002 Jan 1 1;277(2):1 128-38. 
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AF. CG57364-01: CG6896 



Expression of gene CG573 64-01 was assessed using the primer-probe sets Ag3218 
and Ag3378, described in Tables AFA and AFB. Results of the RTQ-PCR runs are shown 
in Tables AFC, AFD, AFE and AFF. 



Table AFA . Probe Name Ag3218 



Primers 


Sequences 


Length 


Start 
Position 


| SEQ ID 

! NO: 


FOFW at'CI 


5 1 -ctcctgaagcaggtcctctt-3 1 


20 


249 


490 


Probe 


TET-5 ' -cctcccagtgttgtccttctggagg-3 ' - 
TAMB \ 


25 


270 


j 491 


Reverse 


5 1 -gacttcttccaggtcatttcg-3 1 




303 


' 492 


Table AFB. Probe Name Ag3378 


Primers 


Sequences 


Length 


Start 
Position 


1 SEQ ID 

I NO: 


For\< arci 


5 1 -CtCCtgaaqcaqgtcctCtt-3 ' 




249 


493 


Probe 


TET-5 ' -cctcccagtgttgtccttctggagg-3 1 - 
TAMEA 


25 


270 


494 


Reverse 


5 ' -gacttcttccaggtcatttcg-3 ' 


21 ' 


_____ 





Table AFC . CNS_neurodegeneration_vl.0 



Tissue Name 


Rel. Exp.(%) 
Ag3218, Run 
209861784 


Rel. Exp.(%) 
Ag3378, Run 
21(11545^3 


Tissue 
Name 


Rel. Exp.(%) 
Ag3218, Run 
209861784 


Rel. Exp.(%) 
Ag3378, Run 
2 101 54573 


AD 1 Hippo 


37.6 


30.4 


Control 
(Path) 3 
Temporal 
Ctx 

Control 
(Path) 4 
Temporal 
Ctx 


17.6 


16.7 


AD 2 Hippo 


31.0 


37.6 


37.6 


31.2 

— 1 


AD 3 Hippo 


34.2 


21.5 


AD 1 

Occipital 

Ctx 


56.3 


40.3 


AD 4 Hippo 


40.6 


25.3 


AD 2 

Occipital 

Ctx 

(Missing) 


0.0 


0.0 


AD 5 hippo 


100.0 




\D3 


43.2 


24.1 
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Occipital 
jCtx 






AD 6 Hippo 


62.9 


55.9 


AD 4 
Occipital 
( tx 


80.1 


24.3 


Control 2 
Hippo 


55.1 


52.9 


AD 5 

Occipital 

Ctx 


17.9 


25.2 


Control 4 
Hippo 


35.4 


39.5 


Occipital 
Ctx 


66.9 


55.5 


Control (Path) 
3 Hippo 


22.8 


26.8 


Control 1 
Occipital 
Ctx 


27.9 


17.4 


AD 1 Temporal 
Ctx 


40.3 


28.3 


Control 2 
Occipital 
( :tx 


94.0 


64.6 


AD 2 Temporal 
Ctx 


83.5 


94.6 


Control 3 
Occipital 
Ctx 


«, 


40.6 


AD 3 Temporal! _„ „ 
Ctx J 30 - 8 


24.5 


Control 4 
Occipital 
Ctx 


20.3 


22.5 


AD 4 Temporal 
Ctx 


61.1 


Control 

26.8 < Path > 1 , 
Occipital 

JCtx 


79.6 


51.4 




AD5Inf 
Temporal Ctx 


84.7 


(Control 

ioo.o L Path > 2 , 

jOccipital 

(Ctx 


34.4 


24, 


AD 5 

SupTemporal 
Ctx 


55.9 


(Control 

39.8 < Pa,h >, 3 , 
(Occipital 

Ctx 


1 " 

25.2 


16.2 


AD 6 Inf 
Temporal Ctx 


47.0 


(Control 

4 ^ jOccipital 

JCtx 


76.3 


45.1 


AD 6 Sup 
1 emporal Ctx 


63.7 


41 „ (Control 1 
41,2 Parietal C't\ 


31.0 


21.9 


Control 1 
Temporal Ctx 


32.8 


(Control 2 
|ParietalCtx 


67 4 


45.1 

L. , 


Control 2 
j m] oi >! ( 


52.1 


39.2 


Control 3 
Parietal Ctx 
Control 
(Path) 1 


31.4 


29.3 


Control 3 
Temporal Ctx 


34.9 


28.1 


48.6 




58.6 
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Parietal Ctx 




— 
27.0 


Control 4 
Temporal Ctx 


62.9 


36.3 


Control 
(Path) 2 
Parietal Ctx 


46.3 


Control (Path) 
1 Temporal Ctx 


75.8 


_ 
50.0 


Control 
(Path) 3 
Parietal Ctx 


26.1 


23.8 

_J 


Control (Path) 
2 Temporal Ctx 


56.6 


(Control 
41.8 (Path) 4 

(Parietal Ctx 


48.0 


54.3 





H Table A FP. Panel 1 .3D 

O 

O - _ 



ru 



Tissue Name 


Rel. 

Exp.(%) 
Ag3218, 
Run 
168013878 


Rel. 

Exp.(%) 
Ag3378, 
Run 
165674263 


Tissue Name 


Rel. 

Exp.(%) 
Ag3218, 
Run 
168013878 


Rel. 

Exp.(%) 
Ag3378, 
Run 
165674263 


Liver 

adenocarcinoma 


10.7 


20.2 


Kidney (fetal) 


48.3 


13.9 


Pancreas 


10.8 


13.1 


iRenal ca. 786- 

|o 




10.4 


Pancreatic ca. 
CAPAN 2 


9.6 


5.4 


Renal ca. 

IA498 


19.2 


14.9 


Adrenal gland 


5.1 


18.4 




Renal ca. RXF 
393 


39.0 


33.2 


Thyroid 


12.3 


33.2 


;Renal ca. 
[ACHN 


12.1 


n.3 


Salivary gland 




5.1 




5.5 

— 


IRenal ca. UO- 

.! 31 


18.9 


■ 7.8 


Pituitary gland 


21.5 


74.7 


Renal ca. TK- 
10 


20.0 


10.1 


Brain (fetal) 


19.5 


36.1 






18.0 


8.7 


Brain (whole) 


22 1 


29.9 


(Liver (fetal) 


5.5 


25.3 


Brain (amygdala) 


,7 4 


46.7 


Liver ca. 
j(hepatoblast) 
HepG2 


14.2 


14.1 


Brain (cerebellum I 


25.2 


23.5 


jLung 


14.1 


18.7 


Brain 

1 hippocampus) 


28.1 


85.9 


Lung (fetal) 


17.2 


4.0 


Brain (substantia 
nigra) 


11.5 


16.7 


Lung ca. 
|(small cell) 

Ilx-i 


6.5 


14.8 


Brain (thalamus) 


57.0 


67.4 


jLung ca. 
(small cell) 
N< [-H69 


20.6 


4.8 
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Cerebral Cortex 


75.8 


Lung ca. 
36.9 |(s.cell var.) 
SHP-77 


100.0 


39.8 


Spinal cord 


9.7 


jLung ca. 

13 2 l (large 

|cell)NCI- 

JH460 


5.0 


37.1 


glio/astro U87- 
\H. 


22.8 


. , 1 iniL- c.i i lor- 
ism. cell) A549 


27.7 


13.6 


gllO/abliO LJ-llo- 

MG 


37.4 


79.6 


Lung ca. (non- 
s.cell) NCI- 
1 [23 


61., 


44.1 


astrocytoma 
SW1783 


29.9 


14.9 


Lung ca. (non- 
s.cell) HOP- 
62 


29.9 


13.7 


neuro*; met SK- 
N-AS 


17.1 


jLung ca. (non- 
52.5 Is.cl) NCI- 
1 1522 


11.3 


3.1 


astrocytoma SF- 
539 


15 , 


jLung ca. 
lo.u j(squam.) sw 
1900 


23.2 


13.5 


astrocytoma SNB- 
75 


43.8 


Lung ca. 
50.0 j(squam.) NCI- 
IH596 


41.5 


10.2 


glioma SNB-19 


17.9 


2^ 2 (Mammary 
i i tnd 


14.8 


35.1 


glioma U251 


47.6 


Breast ca/ 
■ (pl.el) VK'F-7 


48.6 


39.0 


glioma SF-295 


12.3 


Breast ca.' 
10.7 |(pl.ef) MDA- 
MB-231 


25.9 


r,~r- 

60.7 


Heart (fetal) 


38.4 


„ „ jBreast ca.* 
|(pl.er) T47D 


77.4 


21.2 


Heart. 


3.5 


c n Breast ca. BT- 
5 -° [549 


47.0 


95.9 


Skeletal muscle 
(fetal) 


17.0 


10 0 I Breastca - 
,UU JMDA-N 


16.6 


7.3 


Skeletal muscle 


4.4 


7.2 Ovary 


10.1 


4.7 


Bone marrow 


1.3 


. . jOvarian ca. 
OVCAR-3 


36.3 


31.2 


Thymus 


13.9 


n „ iOvarianca. 
OVCAR-4 


33.0 


20 : 7 4 


Spleen 


2.6 


. 0 « Ovarian ca. 
OVCAR-5 


42.6 


15.7 | 


Lymph node 


1.7 


0 'Ovarian ca. 
V JOVCAR-8 


8.7 
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Colorectal 


18.2 


. „ Ovarian ca. 
U * |lGROV-l 


11.3 


15.1 


Stomach 


14.8 


j Ovarian ca.* 
33.7 (ascites) SK- 
iOV-3 


43.5 


17.0 


Small intestine 


18.3 


66.!! (Items 


10.5 


21.8 


Colon ca. SW480 


12.9 


14.2 Placenta 


2.6 


15.0 


Colon ca.* 

SW620(SW480 

met) 


17.0 


14.2 (Prostate 


11.7 


30.6 


Colon ca. HT29 


17.2 


Prostate ca.* 
18.8 (bone met)PC- 

I 3 _ 


35.4 


40.3 


Colon ca. HCT- 
116 




16.5 


18.2 (Testis 


23.3 


100.0 


Colon ca. CaCo-2 


20.9 


_ , Melanoma 
Hs688(A).T 


5.0 


1.4 


Colon ca. 
tissue(OD03866) 


14.7 


Vic Ian- n a 
21.9 j(met) 

|Hs688(B).T 


6.0 


3.5 


Colon ca. HCC- 
2998 


22.1 


. . iMelanoma 
iUACC-62 


14.3 


12.2 


Gastric ca.* (liver 
met)NC I-N87 


48.6 


„_ . iMelanoma 


3.1 


8.2 


Bladder 


6.2 


Melanoma 
I. OX 1MVI 


30.1 


84 


Trachea 


12.8 


iMelanoma* 
49.3 [(met) SK- 
MIX-5 


21.8 


13.1 


Kidney 


43.5 


35.4 (Adipose 


9.2 


3.0 



Table AFE . Panel 2.2 



Tissue Name 


Rel. Exp.(%) 

Ag3218,Run j Tissue Name 
174416494 f 


Rel. Exp.(%) 
Ag3218, Run 
174416494 


Normal Colon 


]KidneyMarg^ 
](OpO«48)_ 


70.2 


Colon cancer 
(OD06064) 


- f. Kidney malignant 

jcancer (OD06204B) 


3.9 


Colon Margin 
(OD06064) 


jKidney normal 
3.6 adjacent tissue 
](OD06204E) 


6.7 


Colon cancer 
(OD06159) 




Kidney Cancer 1 _ 1 
<<>l)04450-01) 13,1 


Colon Margin 


. _ 7 -" . 


Kidney Margin 3.1 
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3 



(OD06159) 


;(OD04450-03) 




Colon cancer 
(OD062 97-04) 


2.6 


Kidney Cancer 
8120613 


2.5 


Colon Margin 
(OD06297-05) 


5.6 


Kidney Margin 
8120614 


18.2 


CC Gr.2 ascend colon 
(01)03921) 


20.0 


Kidney Cancer 
9010320 


2.4 


CC Margin (OD03921) 


13.7 


Kidney Margin 
0010321 


4.4 


Colon cancer metastasis 
\yjLj\jo iut-j 


0.0 


Kidney Cancer 
8120607 


23.0 


Lung Margin 


11.0 


Kidney Margin 
8120608 


15.1 


Colon mets to lung 


29.9 


Normal Uterus 


2.3 


Lung Margin 

{(JLJLHhj i -UZ ) 


0.3 


Uterine Cancer 064011 


6.1 


Normal Prostate 


5.6 


Normal Thyroid 


6.6 


Prostate Cancer 
(OD04410) 


3.9 


Thyroid Cancer 
064010 


6.8 


prostate Margin 

\//~\r\f\A a i 


6.1 


Thyroid Cancer 
A302152 


11.9 


]— 

JNormal Ovary 


7.0 


Thyroid Margin 
A302153 


7.7 


lOvarian cancer 

i{\JL)UOZOD-\Jj ) 


1.3 


Normal Breast 


3.4 


(Ovarian Margin 
(OD06283-07) 


0.0 


Breast Cancer 
(OD04566) 


9.9 


Ovarian Cancer 064008 


31.2 


Breast Cancer 1024 


16.8 


Ovarian cancer 
(OD06145) 


3.5 


Breast Cancer 
(OD04590-01 i 


100.0 


Ovarian Margin 
(OD06145) 


8.4 

i - 


Breast Cancer Mets 
,ODiM-- >0 03) 


26.2 


Ovarian cancer 
|(OD06455-03) 


13.7 


Breast Cancer 
Metastasis (OD04655- 
05) 


36.3 


lOvarian Margin 
j(OD06455-07) 


1 . 1 iBreast Cancer 064006 


5.4 


jNormal Lung 


5.4 


Breast Cancer 9 100266 


12.8 


(invasive poor diff. lung 
adeno(OD(_)4945-01 


14.5 


Breast Margin 
9100265 


1.0 


jLung Margin 
(ODO4945-03) 


2.7 


Breast Cancer 
A209073 


3.3 


jLung Malignant Cancer 
|(OD03126) 


,.s 


Breast Margin 
A2090734 


11.7 
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Lung Margin 
(OD03126) 


5.1 


_ — _ 

Breast cancer 
(OD06083) 


6.9 


Lung Cancer 
(OD05014A) 


12.8 


Breast cancer node 
metastasis (OD06083) 


10.7 


Lung Margin 
(OD05014B) 


3.3 


Normal Liver 


9.4 




Lung cancer 
(OD06081) 


6.3 


Liver Cancer 1026 


2.6 


Lung Margin 
(OD06081) 


2.7 


Liver Cancer 1025 


9.7 


Lung Cancer 

l/i'Yr^ft/IOI'7 ft'W 
'AKJUU^Zj /-(II) 


12.9 


Liver Cancer 6004-T 


10.4 


Lung Margin 

l\\JLJ\JtZj /-\JZ ) 


6.4 


Liver Tissue 6004-N 


5.3 


jOcular Melanoma 
[Metastasis 


4.6 


Liver Cancer 6005-T 


4.2 


Ocular Melanoma 
Margin (Liver) 


0.1 

. — 


Liver Tissue 6005-N 


11.5 


Melanoma Metastasis 


1 .o 

i ^, 


Liver Cancer 064003 


22.5 


Melanoma Margin 
(Lung) 


4.6 


Normal Bladder 


6.1 


jNormal Kidney 


10.4 


Bladder Cancer 1023 


10.8 


(Kidney Ca, Nuclear 
grade 2 (OD04338) 


14.6 


Bladder Cancer 
A302173 


15.1 


Kidney Margin 
(OD04338) 


10.5 


Normal Stomach 


15.0 


Kidney Ca Nuclear 
grade 1/2 (OD04339) 




44.8 


Gastric Cancer 
9060397 


7, 


Kidney Margin 
(OD04339) 


17.7 


Stomach Margin 
9060396 


10.4 


jKidney Ca, Clear cell 
(type (OD04340) 


5.3 


Gastric Cancer 
9060395 


8.4 


i Kidney Margin 
(OD04340) 


25.3 


Stomach Margin 
9060394 


10.4 


Kidney Ca, Nuclear 
grade 3 (OD04348) 


7.5 




Gastric Cancer 064005 


7.7 



Table AFF . Panel 4D 





Rel. 


Rel. 




Rel. 


Rel. 




Exp.(%) 


Exp.(%) I 




Exp.(%) 


Exp.(%) 


Tissue Name 


Ag3218, 


Ag3378, 


Tissue Name 


Ag3218, 


Ag3378, 




Run 


Run 




Run 


Run 




164682519 


165296553 




164682519 


165296553 


Secondary Thl act 


18.2 


25.7 


HUVEC IL-lbeta 


14.5 


12.9 
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Secondary Th2 act 


39.0 


2g g JHUVEC 1FN 
~ > U gamma 


47.0 


25.3 


Secondstry Trl act 


33.2 


III \ I C rNF 

19.1 [alpha + IFN 
gamma 


43.5 


45.1 


Secondary Thl rest 9.5 


III VEC rNF 

(alpha + IL4 


37.1 


48.0 


Secondary Th2 rest i 1 1 .2 


5.1 IUJVECIL-11 


43.5 


18.0 


Secondary Trl rest 


22.7 


Lung 

8.0 iMicrovascular EC 
Inone 


16.8 


61.6 


Primary Thl act 


43.2 


'Lung 

?7 g Microvascular EC 
ITNFalpha + IL- 
jlbeta 


18.6 


14.7 


Primary Th2 act 


30.1 


^2 q iMicrovascular 
IDermal EC none 


31.2 23.8 


Primary Trl act 


24.7 


14.4 


Microsvasular 
Dermal EC 
TNFalpha + IL- 
lbeta 


66.4 


22.1 


Primary Thl rest 


25.2 


17.4 


Bronchial 
epithelium 
TNFalpha + 
ILlbeta 


30.6 


29.3 


Primary Th2 rest 


15.5 


7.5 


Small airway 
epithelium none 


36.1 


24.3 


Primary Trl rest 

[ 


21.3 


Small airway 
f 7 lepithelium 

ITNFalpha + IL- 

[j[ b fta 


76.3 


62.9 


lymphocyte act 


35.4 


1 , , jCoronery artery 
[SMC rest 


49.7 28.1 


ICD45RO CD4 
lymphocyte act 


27.9 


25.9 


Coronery artery 
SMC TNFalpha + 
IL-lbeta 


25.3 


23.7 


CD 8 lymphocyte 
act 


21.0 


14.8 


Astrocytes rest 


22.2 


31.2 


Secondary CD8 
lymphocyte rest 


39.2 


17.8 


Astrocytes 
TNFalpha + IL- 
lbeta 


26.1 


25.0 


Secondary CD8 
lymphocyte act 


20.9 


7.4 


KU-812 
(Basophil) rest 


90.8 


85.3 


CD4 lymphocyte 
none 


4.5 


11.8 


KU-812 

(Basophil) 

PMA/ionomycin 


87.1 


72.2 


!_". 


2.6 


100 


CCD 11 06 


36.6 


36.9 
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Thl/Th2/Trl_anti- 
CD95 CHI 1 






(Keratinocytes) 
none 






LAK cells rest 


11.7 


CCD! 106 
1 _ » l(Keratinocytes) 
TNI- alpha • IL- 

— — — 4^ a 


33.4 


20.4 


LAK cells IL-2 


6.8 


27.5 Liver cirrhosis 


25.5 


19.9 


LAK cells IL- 

2+IL-12 


37.1 


1 1 .6 (Lupus kidney 


44.4 


15.7 


LAK cells IL- 


20.7 


19.1 |NCI-H292 none 


79.6 


64.6 


LAK cells IL-2+ 
IL-18 


21.9 


14.7 NCI-H292 IL-4 


85.3 


96.6 


LAK cells 


4.7 


3.3 INCI-H292 IL-9 


100.0 


98.6 

: 


NK Cells IL-2 rest 


11.9 


11.3 |NCI-H292 IL-13 


68.8 | 50.7 


Two Way MLR 3 
day 


23.7 


no INCI-H292IFN 
[gamma 


80.1 56.6 

J J 


Two Way MLR 5 
day 


12.5 


6.1 HPAEC none 


38.4 


27.2 


Two Way MLR 7 
day 


12.3 


8? MI'AK IM 

[alpha + IL-l beta 


42.6 


43.2 


PBMC rest 


6.0 


^ ^ jLung fibroblast 
|none 


31.2 


21.3 


PBMC PWM 


40.3 


27.7 


Lung fibroblast 
TNF alpha + IL-l 
beta 


14.7 


24.5 


PBMC PHA-L 


37.9 


17.7 


Lung fibroblast 
IL-4 


47.0 


42.6 


Ramos (B cell) 
none 


11.7 


14.9 


Lung fibroblast 
IL-9 


49.3 


30.6 


iRamos I B cell) 
lonomycin 


33.9 


„, , „ Lung fibroblast 
268 ilL-13 


36.6 


42.6 


B lymphocytes 
PWM 


33.7 


4 q 9 jLung fibroblast 
U N gamma 


44.8 


22.5 


B lymphocytes 
CD40L and IL-4 


34.4 


, Dermal fibroblast 
CCD 1070 rest 


33.7 


47.3 


EOL-1 dbcAMP 


50.0 


SDermal fibroblast 
28.1 [CCD1070TNF 
jalpha 


47.3 


33.2 

— 


|EOL-l dbcAMP 
PMA/ionomycin 


44.1 


32.1 


Dermal fibroblast 
CCD 1070 IL-l 
beta 


50.0 


34.6 


Dendritic cells 
none 


33.9 


19.6 


Dermal fibroblast 
IFN gamma 


24.0 


34.4 


(Dendritic cells LPS 


21.4 


10.2 


Dermal fibroblast 


24.3 


32.8 
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I JlL-4 






Dendritic cells 
anti-CD40 


49.7 


33.9 jlBD Colitis 2 


6.0 


1 1.7 


Monocytes rest 


10.7 


10.3 jlBD Crohn's 


25.3 


26.1 


Monocytes LPS 


30.6 


9.3 (Colon 


70.7 


100.0 


Macrophages rest 


41.2 


33.7 jLung 


64.6 


17.7 


Macrophages LPS 


20.0 


7.5 


1 hymus 


80.7 


56.3 


HUVEC none 


2ft.X 


29.5 


Kidney 


26.4 


41.5 


HUVEC starved 


26.2 


37.6 









CNS_neurodegeneration_vl.O Summary: Ag3218/Ag3378 - Two different experiments 
using probe/primer sets with the same sequence are in very good agreement. This panel 
confirms the expression of this gene at low levels to moderate levels in the brain in an 
independent group of individuals. However, no differential expression of this gene was 
detected between Alzheimer's diseased postmortem brains and those of non-demented 
controls in this experiment. Please see Panel 1 .3D for a discussion of the potential utility of 
this gene in treatment of central nervous system disorders. 

Panel 1.3D Summary: Ag3218/Ag3378 - Two different experiments using probe/primer 
sets with the same sequence are in good agreement. Highest expression is seen in testis and 
a lung cancer cell line (CTs=30-31). This panel confirms the expression of this gene at low 
levels in the brain. Therefore, this gene may play a role in central nervous system disorders 
such as Alzheimer's disease, Parkinson's disease, epilepsy, multiple sclerosis, schizophrenia 
and depression. 

This gene product is also expressed in adipose, pancreas, thyroid, pituitary, heart, 
1 5 and liver. This widespread expression in tissues with metabolic function suggests that this 
gene product may be important for the pathogenesis, diagnosis, and/or treatment of 
metabolic and endocrine diseases, including obesity and Types 1 and 2 diabetes. 

Based on expression in this panel, this gene may be involved in gastric, pancreatic, 
brain, colon, renal, lung, breast, ovarian and prostate cancer as well as melanomas. Thus, 
20 expression of this gene could be used as a diagnostic marker for the presence of these 
cancers. Furthermore, therapeutic inhibition using antibodies or small molecule drugs 
might be of use in the treatment of these cancers. 
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Panel 2.2 Summary: Ag3218 - This gene is expressed at low to moderate levels in many 
samples on this panel, with the highest levels of expression in breast cancer sample 
OD04590-01 (CT=30.3). This gene is expressed in a cluster of breast cancer samples with 
no expression in normal breast (CT>35). Similarly, this gene is expressed in ovarian cancer 
5 samples at higher levels than the matched margin samples. 

Interestingly, this gene is expressed at higher levels in kidney cancer margin 
samples than in the matched cancer samples. 

This gene is homologous to a mouse myosin phosphatase targeting subunit (MYPT) 
which have been found to play a role in cell division. MYPT undergoes mitosis-specific 
10 phosphorylation which is reversed during cytokinesis. 

References: 

1 . Totsukawa G, Yamakita Y, Yamashiro S, Hosoya H, Hartshorne DJ, Matsumura 
F. Activation of myosin phosphatase targeting subunit by mitosis-specific phosphorylation. 
J Cell Biol 1999 Feb 22; 144(4): 73 5 -44. 

15 Panel 4D Summary: Ag3218/Ag3378 - Two different experiments using probe/primer 
sets with the same sequence are in very good agreement. Highest expression is seen in the 
colon and a mucoepidermoid cell line (CTs=30-32). This gene is expressed at low to 
moderate levels in a wide range of cell types of significance in the immune response in 
health and disease. These cells include members of the T-cell, B-cell, endothelial cell, 

20 macrophage/monocyte, and peripheral blood mononuclear cell family, as well as epithelial 
and fibroblast cell types from lung and skin, and normal tissues represented by colon, lung, 
thymus and kidney. This ubiquitous pattern of expression suggests that this gene product 
may be involved in homeostatic processes for these and other cell types and tissues. This 
pattern suggests a role for the gene product in cell survival and proliferation. Therefore, 

25 modulation of the gene product with a functional therapeutic may lead to the alteration of 
functions associated with these cell types and lead to improvement of the symptoms of 
patients suffering from autoimmune and inflammatory diseases such as asthma, allergies, 
inflammatory bowel disease, lupus erythematosus, psoriasis, rheumatoid arthritis, and 
osteoarthritis. 
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AG. CG59241-01: Amiloride-sensitive sodium channel 
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Expression of gene CG59241-01 was assessed using the primer-probe set Ag3407, 
described in Table AGA. Results of the RTQ-PCR runs are shown in Tables AGB, AGC 
and AGD. 



Table AG A. Probe Name Ag3407 



Primers 


Sequences 


Length 


Start 
Position 


SEQID | 

NO: 1 


Forward 


5 ' -gtcaccctctgcaacactaatg-3 ' 


22 


268 


496 


„ , ItET-5 ' -ctgtcccagctcagctaccctgactt-3 ' - 

n " 0De [tamra 


26 


298 


497 


Reverse is ' -tttcatccagtcccagcat-3 ' 


19 


340 


498 



Table AGB . CNS_neurodegeneration_vl.O 



"*~ 

Tissue Name 


ReI.Exp.(%)Ag3407,l 

Run 210349883 | AlssueJName 


Rel. Exp.(%) Ag3407, 
Run 210349883 


AD 1 Hippo 


18.4 


Control (Path) 3 
Temporal Ctx 


4.1 


ATI 0 T4ir\TM-\ 
r\LJ Z flippo 


29.7 


Control (Path) 4 
I . rnp -ral Ctx 


40.3 


AD 3 Hippo 


18.3 


KD\( )ccipital 
Ctx 


36.9 


AD 4 Hippo 


5.4 


AD 2 Occipital 
Ctx (Missing) 


0.0 


AD 5 hippo 


91.4 


AD 3 Occipital 
Ctx 


19.1 


AD 6 Hippo 


on „ (AD 4 Occipital 
807 jCtx 


18.8 


Control 2 Hippo 


„ . lAD 5 Occipital 
jCtx 


18.3 




Control 4 Hippo 


\D 6 Occipital 

Ictx 


28.9 


Control (Path) 3 
Hippo 


0 „ Control 1 Occipital 
88 |Ctx 


4.3 


AD 1 Temporal Ctx 


_ 0 c IControl 2 Occipital 
28 - 5 |Ctx 


80.1 


AD 2 Temporal Ctx 


. 1 „ Control 3 Occipital 
41 JCtx 


20.2 


AD 3 Temporal Ctx 


jControl 4 Occipital 

JCtx 


6.0 


AD 4 Temporal Ctx 


, iControl (Path) 1 
jOccipital Ctx 


92.7 


AD 5 Inf Temporal 


100.0 'Control (Path) 2 


".3 " 



680 



o 



Ctx 




Occipital Ctx 


| 


AD 5 SupTemporal 
Ctx. 


56.6 


Control (Path) 3 
Occipital ( t\ 


3.0 

- 


AD 6 Inf Temporal 
Ctx 


82.4 


Control (Path) 4 
Occipital Ctx I 


41.2 


AD 6 Sup Temporal 
Ctx 


44.1 


C ontrol 1 Parietal 
Ctx 


21.9 


Control 1 Temporal 
Ctx 


15.3 


Control 2 Parietal 
Ctx 


79.0 


Control 2 Temporal 
Ctx 


24.1 


Control 3 Parietal 
Ctx 


22.2 


Control 3 Temporal 
Ctx 


[Control (Path) 1 
jParietal Ctx 


77.9 


Control 4 Temporal 
Ctx 


Control (Path) 2 
jParietal Ctx 


47.6 


Control (Path) 1 
remporal Ctx 


53.6 


Control (Path) 3 
Parietal Ctx 


6.2 


Control (P il ii 1 
Temporal Ctx 


56.6 


Control (Path) 4 
Parietal Ctx 


67.4 


Table AGC. General screening panel vl.4 


Tissiae Name 


Rel. Exp.(%) i 
Ag3407, Run Tissue Name 
216821458 


Rel-Exp.(%) j 
Ag3407, Run I 
216821458 


Adipose j 


0.2 Renal ca FK-IO* 


16.6 


Melanoma* 
Hs688(A).T 


2.3 Bladder 


0.3 


Melanoma* 
Hs688(B).T j 


0.4 


Gastric ca. (liver met.) 
NCI-N87 


8.8 


Melanoma* Ml 4 


2.0 


Gastric ca. KATO 111 


0.7 


Melanoma* 
LOXIMVI 


2.5 


Colon ca. SW-948 


3.7 


Melanoma* SK- 1 
MEL-5 


8.7 C 


Dion ca. S W480 

iloii ca.* (SW480 
et) SW620 




14.1 


Squamous cell 
carcinoma St C-4 


1.2 | C 




21.2 


Testis Pool 


0.4 jColon ca. HT29 


10.7 


Prostate ca.* (bone | 
met) PC-3 j 


4.4 


Colon ca. HCT-116 


64.2 


Prostate Pool 


2.3 


Colon ca. CaCo-2 


32.3 


Placenta 


0.5 


Colon cancel- tissue 


13.2 


Uterus Pool 


0.0 


Colon ca. SW 1 1 16 


12.5 


Ovarian ca. \ 
OVCAR-3 


8.4 


Colon ca. Colo-205 


0.3 
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Ovarian ca. SK- 
OV-3 


9.7 


Colon ca. SW-48 


0.6 


Ovarian ca. 
OVCAR-4 


1.6 


Colon Pool 


2.8 


Ovarian ca. 
OVCAR-5 


18.9 


Small Intestine Pool 


4.5 


zz : — 
Ovarian ca. 

lUIvU V 1 


4.9 


Stomach Pool 


1.4 


Ovarian ca. 
OVCAR-8 


5.9 


Bone Marrow Pool 


1.8 


( )\;u 


2.0 


Fetal Heart 


2.4 


Breast ca. MCF-7 


16.7 


Heart Pool 


0.3 


Breast ca. MDA- 
MB-231 


12.1 


Lymph Node Pool 


3.5 


Breast ca. BT 549 


22.7 


Fetal Skeletal Muscle 


1 .9 


Breast ca. T47D 


27.4 


Skeletal Muscle Pool 


0.0 


Breast c t. Ml J \-N 


4.5 


Spleen Pool 


0.0 


Breast Pool 


2.9 


Thymus Pool 


2.1 


Trachea 


9.0 


CNS cancer 
(glio/astro) U87-MG 


0.9 


Lung 


0.0 


CNS cancer 
(glio/astro) U-I18-MG 


11.7 


Fetal Lung 


10.8 


CNS cancer 
(neuro;met) SK-N-AS 


58.6 


Lung ca. NCI-N417 


1.3 


CNS cancer (astro) SF- 
539 


28.1 


Lung ca. LX-1 


21.8 


CNS cancer (astro) 


24.7 


Lung ca. NCI-H146 

— 


5.4 

- - — — 


CNS cancer (glio) 


7.3 


Lung ca. SHP-77 


11.7 


lino cancer ^gnoj or- 
295 


4.8 


Lung ca. A549 




Brain (Amygdala) Pool 




Lung cd. lMV—i-rijZD 


0 0 


Brain (cerebellum) 


36 1 


T una m Tvf 1 1 f - 


7 4 




100 0 


Lung ca. NCI-H460 


5.4 


Brain (Hippocampus) 
Pool 


5.6 


Lung ca. HOP-62 


2.9 


( srebral ( ortex Pool 


5.6 


Lungca. NCI-H522 


8.5 


Brain (Substantia 
..i - i) Pool 


7.1 


Liver 


0.0 


Brain (Thalamus) Pool 


11.3 


Fetal Li ver 


0.0 


Butin (\\ hole) 


13.4 


Liver ca. HepG2 


0.8 


Spinal Cord Pool 


12.7 


Kidney Pool 


i, 


Adrenal Gland 


°:° 
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Fetal kidney 


3.7 


Pituitar> gland Pool 


0.0 


|Renafca. 786-0 


1.7 


Salivary Gland 


0.9 


Renal ca. A498 


| H. 


Thyroid (female) 


0.0 


iRenal ca. ACHN 




Pancreatic ca. 
CAPAN2 


2.3 


Renal ca. UO-31 


0.2 


Pancreas Pool 


2.6 



Table AGP . Panel 4D 



Tissue Name 


1} ,.T 17, . „ (0/\ 

tsQY. H/Xp.^ /o) 

Ag3407, Run 
165296462 


Tissue Name 


T? Y rk (0/\ 

Ag3407, Run 
165296462 


Second; i\ I hi act 


7.9 


HUVEC IL-lbeta 


0.0 


Secondary Th2 act 


Ill 


xlu VJbU irJN gamma 




Secondary Trl act 


40.1 


HUVEC TNF alpha + 
I FN gamma 


0.0 


Secondary Th 1 rest 


4.4 


HUVEC TNF alpha + 


0.0 


Second;- r\ Th2 re^i 


7.0 


III A EC 11 -II 


0.0 


Secondary Trl rest 


11.7 


Lung Microvascular EC 


0.0 


Primary Th 1 act 


61.1 


Lung Microvascular EC 

nr\TF;ilr*Via 4- TT 
1 IN r aipila ~ llj" 1 UCLcl 


0.0 


Primary Th2 act 


69.3 


Microvascular Dermal 
EC none 


0.0 


Primary Trl act 


90.8 


Microsvasular Dermal 


0.0 


Primary Thl rest 


20.0 


TNFalpha +* ILlbeta 


0.0 


Primary Th2 rest 


42.6 


Small airway epithelium 
none 


3.0 


Primary Trl rest 


52.5 


Small airway epithelium 
TNFalpha + 1L-1 beta 


0.0 


CD45RA CD4 
Ivmplmcvte act 


2.8 


Coronery artery SMC rest 


3.6 


CD45ROC 1)4 
lymphocyte act 


14.0 


Coronery artery SMC 
1 Ml alpha + IL-lbeta 


0.0 


CD8 lymphocyte Jt t 


5.8 


Astrocytes rest 


11.6 


Secondary ( 1 )S 
lymphocyte rest 


18.9 


Astrocytes TNFalpha + 
IL-lbeta 


0.0 


Secondary CD8 
lymphocyte act 


22.2 


KU-812 (Basophil) rest 


0.0 


CD4 lymphocyte none 


0.0 


LI -812 (Basophil) 

PMA/ionomycin 


0.0 


2ry Nil fh2 Trl anti- 


4.5 


CCD 11 06 


2.7 



683 



y^Dyj v_-Hi l 


( Keratinocytes) none 




LAK cells rest 


3.3 1 (Keratin ocytes) 

TNFalpha + IL-lbeta 


0.0 


LAK cells IL-2 


4.0 jLiver cirrhosis 


13.5 


LAK cells IL-2+IL-12 


5 .7 jLupus kidney 


A I 


LAK cells IL-2+IFN 
gamma 


21.3 |NCI-H292 none 


_ 

9.0 


LAK cells 1L-2+ IL-18 


6.7 " ""NCI-H292 II -4 


_ I4.S 


LAK cells 
PMA/ionomycin 


0.0 1NCI-H292 IL-9 


3.5 


NK Cells IL-2 rest 


6.0 lNci-H292IL-13 


0.0 


Two Way MLR 3 day 


5.0 NC1-H2Q2 I FN gamma 


5.5 


Two Way MLR 5 day 


2.3 HPAEC none 0.0 


Two Way MLR 7 day 


s , HPAEC 1 NF alpha - II - 
|1 beta 1 


PBMC rest 


0.0 jLung fibroblast none 


2.8 


PBMC PWM 


2j ^ JLung fibroblast TNF 
alpha i IL-1 beta 


0.0 


PBMC PHA-L 


20.4 Lung fibroblast IL^4 | 0.0 


Ramos (B ceil i none 


i) ') 1 un_ llbiobkist 11 


0.0 


Ramos (B cell) 
ionomycin 


2.8 JLung fibroblast IL-1 3 


0.0 


B lymphocytes PWM 


ioo o ' 1111 J 1,1 " : "'" 11 11 ' 

gamma 


1.4 


B lymphocytes CD40L 
and IL-4 


1 Q „ (Dermal fibroblast I , . . 

CCD 1070 rest i4 " 4 


EOL-1 dbcAMP 


j r iDcrmal fibroblast 

CCD ln-<! TNFalpha 


68.8 


fcUL-l abcAMF 
PMA/ionomycin 


f- 9 IDermal fibroblast 
< ( 01 070 IL-1 beta 


0.0 


Dendritic cells none 


0Q Dermal fibroblast IFN 
jgamma 


0.0 


Dendritic cells LPS 


0.0 Dermal fibroblast IL-4 


14.1 


Dendritic cells anti- 
CD40 _ 


6.0 


IBD Colitis 2 


0.0 


Monocytes rest 


0.0 


Mil) Crohn's 


0.0 


Monocytes LPS 


6.5 


Colon 


42.3 


Vlacix 'phage-- rest 


0.0 


Lung 


35.8 


Macrophages LI'S 


0.0 


Thymus 


45.4 


I II) VIC none 


0.0 


Kidney 


55.1 


HUVEC starved 


0.0 
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CNSneurodegenerationvl.O Summary: Ag3407 This panel confirms the expression of 
this gene at low levels in the brains of an independent group of individuals. However, no 
differential expression of this gene was detected between Alzheimer's diseased postmortem 
brains and those of non-demented controls in this experiment. Please see Panel 1 .4 for a 
5 discussion of the potential utility of this gene in treatment of central nervous system 
disorders. 

Generall_screening_panel_vl.4 Summary: Ag3407 Highest expression of the CG59241- 
01 gene is seen in fetal brain (CT=31.3). Furthermore, low to moderate levels of expression 
is also observed in CNS cancer cell lines (CTs=32-34). The CG59241-01 gene codes for a 

10 putative amiloride-sensitive sodium channel. A similar amiloride-sensitive sodium channel 
was shown to be highly expressed in malignant glioblastoma multiforme tumors and to be a 
characteristic feature of malignant brain tumor cells (Ref.l). Therefore, therapeutic 
modulation of the activity of the protein encoded by this gene may be beneficial in the 
treatment of CNS cancer. Significant expression is also seen in a cluster of cell lines 

15 derived from brain, colon, breast, and ovarian cancers. Therefore, therapeutic modulation 
of the activity of this gene or its protein product, through the use of small molecule drugs, 
protein therapeutics or antibodies, might be beneficial in the treatment of these cancers. 

In addition, this gene is expressed at low levels in all regions of the central nervous 
system examined, including amygdala, hippocampus, substantia nigra, thalamus, 
20 cerebellum, cerebral cortex, and spinal cord. Therefore, this gene may play a role in central 
nervous system disorders such as Alzheimer's disease, Parkinson's disease, epilepsy, 
multiple sclerosis, schizophrenia and depression. 

References: 

I. Bubien JK, Keeton DA, Fuller CM, Gillespie GY, Reddy AT, Mapstone TB, 
25 Benos DJ. (1999) Malignant human gliomas express an amiloride-sensitive Na+ 
conductance. Am J Physiol 276(6 Pt 1):C1405-10 

Panel 4D Summary: Ag3407 Highest expression Of the CG59241-01 gene is detected in 
PWM treated B lymphocytes (CT=32). Similar expression is also detected in primary 
activated Thl, Th2 and Trl cells, as well as TNF alpha treated dermal fibroblast CCD1070 
30 cells (CTs=32). Therefore, expression of this gene can be used to distinguish these samples 
from other samples in the panel. Furthermore, this gene is expressed in activated 
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lymphocytes. Likewise, no expression of this gene is seen in PBMC that contain normal B 
cells (CT=40), but it is induced when PBMC are treated with the pokeweed mitogen or 
PHA-L (CTs=34). In addition, the transcript is not seen in the B cell lymphoma Ramos 
regardless of stimulation. Therefore, the gene product could potentially be used 
5 therapeutically in the treatment of Crohn's disease, ulcerative colitis, multiple sclerosis, 
chronic obstructive pulmonary disease, asthma, emphysema, rheumatoid arthritis, lupus 
erythematosus, psoriasis and in other diseases in which T cells and B cells are activated. 

In addition, low expression of this gene is also observed in normal colon, lung, 
thymus and kidney tissues. The CG59241-01 gene encodes an amiloride-sensitive sodium 

10 channel . A similar channel, the amiloride-sensitive epithelial sodium channel (ENaC) 
constitutes the limiting step for sodium reabsorption in epithelial cells that line the distal 
nephron, distal colon, ducts of several exocrine glands and lung airways and plays an 
important role in pathophysiological and clinical conditions such as hypertension or lung 
edema. ENaC has been implicated in two genetic diseases, Liddle's syndrome and 

15 pseudohypoaldosteronism (PHA-1) (Ref.l). Therefore, antibody or small molecule 
therapies designed with the protein encoded for by CG59241-01 gene could modulate 
kidney/co lon/lung function and be important in the treatment of inflammatory or 
autoimmune diseases of these tissues in addition to hypertension, lung edema, Liddle's 
syndrom and PHA-1 . 

20 Reference. 

1. Hummler E. (1998) Reversal of convention: from man to experimental animal in 
elucidating the function of the renal amiloride-sensitive sodium channel. Exp Nephrol 1998 
Jul-Aug;6(4):265-71 

AH. CG58602-01: FAD binding domain containing protein 

25 Expression of gene CG58602-01 was assessed using the primer-probe set Ag3385, 

described in Table AHA. Results of the RTQ-PCR runs are shown in Tables AHB, AHC 
and AHD. 



Table AHA . Probe Name Ag3385 



Primers 


Sequences 


Length 


Start 
Position 


SEQID 
NO: 
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Forward Is ' -tcatgaatccaggcaaagtg 


3 ' 


20 


1 127 


.on 


p , |TET-5 1 -ttagcccacaagttccc 
rrODe TAMRA 


:tgactacgg-3 ' - 


* 


1468 


500 


Reverse j5 ' -tggcatgaagaaaagttcca 


-3 ' 


1 20 


1503 


501 



Table AHB . CNS_neurodegeneration_v 1 .0 



Tissue Name 


Rel. Exp.(%) Ag3385, 
Run 210154892 


Tissue Name 


Rel. Exp.(%) Ag3385, 
Run 210154892 


AD 1 Hiippo 


34.6 


Control (Path) 3 
Temporal Ctx 


21.2 


AD 2 Hippo 


47.6 


Control (Path) 4 
Temporal Ctx 


36.1 


AD 3 Hippo 


11.9 


AD 1 Occipital Ctx 


28.1 


AD 4 Hiippo 


.,., r ,.,„.„ T .^. m ._^ Tm ^ 

24.3 


AD 2 Occipital Ctx 
(Missing) 


0.0 


AD 5 H ippo 


56.3 


AD 3 Occipital Ctx 


15.0 


AD 6 Hippo 


" " 63.3 " 


AD 4 Occipital Ctx 


i4 V 


Control 2 Hippo 


42.6 


AD 5 ( kxipital ( t.\ 


■O 1 


Control 4 I lippo 


24.7 


AD 6 Occipit al Ctx 


25.3 


Control (Path) 3 
Hippo 


23.3 


Control 1 Occipital 
( i 


14.3 


M) 1 1 1 mporal 
Ctx 


23.8 


Control 2 Occipital 
Ctx 


69.3 


AD 2 Temporal 
Ctx 


73.7 


Control 3 Occipital 
Ctx 


29.5 


AD 3 Temporal 
Ctx 


7.3 


Control 4 Occipital 
Ctx 


14.9 


AD 4 Temporal 

I In 


39.0 


Control (Path) 1 
Occipital Ctx 


68.3 


AD 5 Inf Temporal 
Ctx 


100.0 


Control (Path) 2 
Occipital Ctx 


11.0 


AD 5 Sup 
Temporal Ctx 


55.5 


Control (Path) 3 
Occipital Ctx 


8.9 


AD 6 Inf Temporal 

Ux 


64.2 


Control (Path) 4 
Occipital Ctx 


17.3 


AD 6 Sup 
Temporal Ctx 


54.0 


Control 1 Parietal 
Ctx 


32.8 


Control 1 
Temporal Ctx 


23.8 


Control 2 Parietal 
Ctx 


62.0 


Control 2 
Temporal Ctx 


50.3 


Control 3 Parietal 
Ctx 


33.4 


Control 3 
Temporal Ctx 


38.4 


Control (Path) 1 
Parietal Ctx 


70.7 


Control 3 


19.2 


Control (Path) 2 


31.4 
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remporal < rx 
tcontroUPath) 1 
[Temporal Ctx 


56.6 


Parietal Ctx 

Control (Path) 3 ?n Q 
Parietal Ctx 


(Control (Path) 2 
Temporal Ctx 


47.6 


Control (Path) 4 . 
Parietal Ctx I 



Table AHC . General_screening_panel_vl.4 





Tissue Name 

\ 


Rel. Exp.(%) 
Ag3385, Run 
217043538 


Tissue Name 


Rel. Exp.(%) 
Ag3385, Run 
217(143538 




Adipose 


2.4 


Renal ca. TK-10 


... „ 




Melanoma* 
Hs688(A).T 


0.7 


Bladder 


6.6 




Melanoma* 


1.1 


Gastric ca. (liver met.) 


2.1 




Hs688(B). 1 


NCI-N87 


y 


Melanoma" M 14 


0.9 


Gastric ca. KATO III 


0.9 




Melanoma* 
LO\l\I\ 1 


1.3 


Colon ca. S W-948 


4.5 




Melanoma* SK- 
MEL-5 


2.2 


Colon ca. S W480 


0.8 




— 

Squamous cell 
carcinoma SCC-4 


0.1 


Colon ca (SW4M) 
me! s SW620 


1.3 




Testis Pool 


1 * 


Colon ca. HT29 


0.6 


(Prostate ca.* (bone 
|met) PC -3 


5.8 


L-Olon ca. hlL, l - 1 1 o 


1.9 




Prostate Pool 


4.0 


Colon ca CaCo 2 


28.5 




Placenta 


2.5 


( -i. -a cancer tissue 


2.0 




Uterus Pool 


0.5 


Colon ca. SW1116 


0.9 




Ovarian ca. 
OVCAR-3 


1.1 


Colon ca. Colo-205 


3.5 




Ovarian ca. SK- 
OV-3 


3.7 


Colon ca. S W-48 


4.2 




Ovarian ca. 
OVCAR-4 


0.2 


Colon Pool 


3.0 




Ovarian ca. 
OVCAR-5 


42.0 


Small Intestine Pool 






Ovarian ca. 
IGROV-1 


8.0 


Stomach Pool 


1.8 




Ovarian ca. 
OVCAR-8 


2.7 


Bone Marrow Pool 


0.9 




Ovary 


3.3 


Foul Heatt 


12.9 




Breast ,a. MCT-7 


10.3 


Heart Pool 


8 ; 




Breast ca. MDA- 
MB-231 


3.0 


Lymph Node Pool 


I 
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Breast ca. BT 549 




1 .3 IFetal Skeletal Muscle 




2.6 


Breast ca. T47D 




100.0 


Skeletal Muscle Pool 




25.5 


Breast ca. MDA-N 




0.4 


Spleen Pool 




' 0.2~ 


Breast Pool 


3.1 


! hymns Pool 


2.7 


Trachea 


3.2 


CNS cancer 
(glio/astro) U87-MG 


4.0 


Lung 


2.9 


CNS cancer 
(glio/astro) U-118-MG 


1.3 


Fetal Lung 


3.0 


CNS cancer 
(neuro;met) SK-N-AS 


1.8 


Lung ca. NCI-N417 


0.2 


CNS cancer (astro) SF- 
539 


1.3 


Lung ca. LX-1 


1.1 


CNS cancer (astro) 
SNB-75 


0.9 


Lung ca.NCI-H146 


0.4 


CNS cancer (glio) 
SNB-19 


5.0 


Lung ca. SHP-77 


3.1 


( N'< cancer (glio) SF- 
295 


5.5 


Lung ca. A549 


4.3 


Brain (Amygdala) Pool 




Lung ca. NL-l-hDzo 




0.4 


Brain (cerebellum) 




13 5 


JLung ca. JNd-riZ.5 


6.8 IBrain (fetal) 


5.6 


Lung ca. NCI-H460 




1.5 


Brain (Hippocampus) 

£22! , 


5.2 


Lung ca. HOP-62 




0.1 


Cerebral Cortex Pool 


?| 


Lung ca. NCI-H522 




3.6 


Brain (Substantia 
nigra) Pool 


11.5 


Liver 


_ 


13.6 


Bi tin < 1 hil liUti^s Pool 


7.2 


Fetal Liver 




12.0 


Brain (whole) 


7.2 


Liver cai. Hep( i \ 




2.7 


Spinal Core! Pool 


4.8 


Kidney Pool 


6.2 


Vdrenal ( ihrid 


6.0 


tFetal Kidney 


4.0 


Pituitary gland Pool 


1.7 


{Renal ca7786-0 


02 


Salivary Gland 


6.6 


Renal z i \ n>x 


1.4 


Thyroid (female) 


5.2 


jRenal ca. ACHN 




0.8 


Pancreatic ca. 
CAPAN2 


3.5 


Renal ca. UO-3 I 


— 


0.9 


Pancreas Pool 


4.4 


Table AHD. Panel 4D 


Tissue Name 


Rel. Exp.(%) 
Ag3385, Run 
165296471 


Tissue Name 


Rel. Exp.(%) 
Ag3385, Run 
165296471 


Secondar> Till aa 


1.2 


HUVEC IL-lbeta 


0.0 


Secondary Th2 act 


3.6 


HUVEC I FN gamma 


3.7 
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Secondary Trl act 


2.6 


HUVEC TNF alpha - 
IFN gamma 


0.7 


Secondary Th 1 rest 


0.4 


HUVEC TNF alpha -. 
IL4 


2.2 


Secondary 1 Ii2 rest 


0.9 


HUVEC IL- 11 


1.3 


Secondary Trl rest 


0.4 


Lung Microvascular EC 
none 


3.2 


Primary Thl act 


1.1 


Lung Microvascular EC 
I NI alpha IL-lbeta 


1.5 


Primary Th2 act 


0.7 


Microvascular Dermal 
EC none 


3.0 


Primary Trl act 


0.0 


Microsvasular Dermal 
EC TNFalpha + IL-lbeta 


0.0 


Primary Thl rest 


1.1 


Bronchial epithelium 
TNFalpha + IL1 beta 


0.6 


Primary Th2 rest 


05 


Small airway epithelium 
none 


0.7 


Primary Trl rest 


0 6 

' - - 


Small airway epithelium 
TNFalpha + 1L-1 beta 


0.8 


CD45RA CD4 
lymphocyte act 


2.0 


Coronery artery SMC rest 


0.5 


CD45RO CD4 
lymphocyte act 


3.7 


Coronery artery SMC 
TNFalpha + IL-lbeta 


2.0 


CDS lymphocyte act 


0.9 


Astrocytes rest 


1.5 


Secondary CD8 
l\ mphocyte rest 


2.7 


Astrocytes TNFalpha + 
IL-lbeta 


2.6 


Secondary CD8 
lymphocyte act 


0.0 


JvU-olz (rJasopnilj rest 


3.0 


CD4 lymphocyte none 


0.0 


KU-8 12 (Basophil) 
PMA/ionomycin 


8.2 


2ry Thl/Th2/Trl_anti- 
CD95 CHI 1 


0.0 


CCD1106 

(keratinocytes) none 


3.3 


LAK cells rest 


9.4 


CCD 1106 

(Keratinocytes) 

1 Nr alpha + IL-lbeta 


0.3 


JLAJs. cells IL-Z 


0.8 


Liver cirrhosis 


7.9 


JLAJs. cells lL-z+l.L-12 


1.5 


Lupus kidney 


1.5 


LAls. cells lL,-Z+lr IN 

gamma 


3.7 


NCI-H292 none 




LAK cells IL-2+ IL-18 




NCI-H292 IL-4 


8.4 


LAK cells 
PMA/ionomycin 


2.0 


NCI-H292 IL-9 


2.6 


NK Cel ls IL-2 rest 


0.7 


NCI-H292 IL-13 


2.9 


Two Way MLR 3 day- 


4.6 


NCI-H292 IFN gamma 


1.8 


Two Way MLR 5 day 


2.8 


HPAEC none 


L.. 23 
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Two Way MLR 7 day 


i ll'AEC TNF alpha + IL- 

18 libera 


1.9 




PBMC rest 


0.6 


Lunsj tibivhla^t none 


Ts 


jPBMC PWM 


11.0 


|Lung fibroblast TNF 
alpha + IL-1 beta 




0.7 




PBMC PHA-1 


2.3 


(Lung fibroblast IL-4 


L6 




Ramos (B cell) none 


0.0 


JjLung fibroblast IL-9 


2.0 




Ramos (B cell) 
ionomycin 


0.9 


JLung fibroblast IL-1 3 


0.9 




B lymphocytes PWM 


3.5 


ILung fibroblast IFN 
gamma 


0.7 


\± 


B lymphocytes CD40L 
and IL-4 


5.4 


Dermal fibroblast 
|CCD1070 rest 


1.6 






EOL-1 dbcAMP 


5.0 


Dermal fibroblast 
CCD 1070 TNF alpha 


0.0 




T7*~»T 1 AUr- A \AT> 
tZKJLy-l UDCAJVLr 

FM \ ionomycin 


1.2 


Dermal fibroblast 
1CCD1070 IL-1 beta 


2.3 




Dendritic cells none 


15.5 


jUermai tiDroDiast lrJN 
i gamma 


0.5 




Dendrit c cells 1 PS 


4.5 


Dermal fibroblast IL-4 


0.4 




Dendritic cells anti- 
CD40 


11.7 


jlBD Colitis 2 


0.3 




Monoe\ tes rest 


8.7 


IBD Crohn's 


°:° 




Monocytes LPS 


0.6 


Colon 


5.1 




Macrophages rest 


13.5 


Lung 


6.7 






Macrophages LPS 


__" 1 (. 


Thymus 


1 00.0 


z 




HUVECnone 


0.6 


|Kidney 


,1 ; 






HUVEC starved 


1.7 


f . ■ 


[1 





CNS_neurodegeneration_vl.0 Summary: Ag3385 This panel confirms the expression of 
CG58602-01 gene at low levels in the brains of an independent group of individuals. 
However, no differential expression of this gene was detected between Alzheimer's 
diseased postmortem brains and those of non-demented controls in this experiment. Please 
5 see Panel 1 .4 for a discussion of the potential utility of this gene in treatment of central 
nervous system disorders. 

General_screening_panel_vl.4 Summary: Ag3385 Highest expression of the CG58602- 
01 gene is seen in a breast cancer cell line (CT=26.3). Significant expression is also seen in 
an ovari an cancer cell line. Thus, expression of this gene could be used to differentiate 
1 0 between these samples and other samples on this panel and as a marker to detect the 
presence of breast and ovarian cancers. Furthermore, therapeutic modulation of the 
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expression or function of this gene may be effective in the treatment of breast and ovarian 
cancers. 



Among tissues with metabolic function, this gene is expressed at moderate to low 
levels in pituitary, adipose, adrenal gland, pancreas, thyroid, and adult and fetal skeletal 
5 muscle, heart, and liver. This widespread expression among these tissues suggests that this 
gene product may play a role in normal neuroendocrine and metabolic and that disregulated 
expression of this gene may contribute to neuroendocrine disorders or metabolic diseases, 
such as obesity and diabetes. 

Expression of this gene is higher in fetal skeletal muscle (CT=28.3) when compared 
10 to expression in adult skeletal muscle (CT=31.5). Thus, expression of this gene could be 
used to distinguish fetal from adult skeletal muscle. 

In addition, this gene is expressed at high levels (CTs=29-30.4) in all regions of the 
central nervous system examined, including amygdala, hippocampus, substantia nigra, 
thalamus, cerebellum, cerebral cortex, and spinal cord. Therefore, this gene may play a role 
1 5 in central nervous system disorders such as Alzheimer's disease, Parkinson's disease, 
epilepsy, multiple sclerosis, schizophrenia and depression. 

Panel 4D Summary: Ag3385 Highest expression of the CG58602-01 gene is seen in the 
thymus (CT=28). Thus, the putative protein encoded for by this gene could therefore play 
an important role in T cell development. Therefore, small molecule therapeutics designed 
20 against the proetin encoded by this gene could be utilized to modulate immune function (T 
cell development) and be important for organ transplant, AIDS treatment or post 
chemotherapy immune reconstitiution. 

AI. CG58468-01: Serum Amyloid P Component 

Expression of gene CG58468-01 was assessed using the primer-probe set Ag3356, 
25 described in Table AIA. Results of the RTQ-PCR runs are shown in Table AIB. 



Table A IA. Probe Name Ag3356 



Primers 


Sequences 


Length 


Start 
Position 


] SEQ ID 
1 NO: 


forwarc 


5 


-aggcatttattttccctcaaga-3 ' 


22 | 


106 


I 502 1 
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'Probe 


TET-5 ' -agtctatgtgtccctgatccccaagg-3 1 - 
TAMRA 


26 


137 


503 1 


Reverse 


5 ' -gttttcaggcaaagcttgaagt-3 ' 


22 


181 


504 | 



Table AIB . General_screening_panel_vl.4 





Tissue Name 


Rel. Exp.(%) 
Ag3356, Run 
21()5234"6 


Tissue Name 


Rel. Exp.(%) 
Ag3356, Run 
216523476 


Adipose 


2.2 


Renal ca. TK-10 


0.0 


Melanoma* 
Hs688(A).T 


0.0 


Bladder 


0.0 


Melanoma* 

XlaDOO^D J. 1 


0.0 


— , 

Gastric ca. (liver met.) 

MPT >J87 


0.0 


Melanoma* Ml 4 


0.0 


Gastric ca. KATO III 


0.0 


Melanoma* 
LOXIMVI 


0.0 


Colon ca. SW-948 


0.0 


Melanoma* SK- 
MEL-5 


0.0 


Colon ca. SW480 


0.0 


Squamous cell 
carcinoma SCC-4 


0.0 


( "olon ca. :| (SW4S0 
met) SW620 


■ 

0.0 


Testis Pool 


1 7 


Colon ca. HT29 


0.0 


Prostate ca.* (bone 
met) PC-3 


0.0 


Colon ca. HCT-116 


0.0 


Prostate Pool 


0.0 


Colon ca. CaCo-2 


0.0 


Placenta 


0.0 


Colon cancer tissue 


0.0 


Uterus Pool 


0.0 


Colon ca. SWIM 6 


0.0 


Ovarian ca. 
OVCAR-3 


0.0 


Colon ca. Colo-205 


0.0 


Ovarian ca. SK- 
OV-3 


0.0 


Colon ca. SW-48 


0.0 


Ovarian ca. 
OVCAR-4 


0.0 


Colon Pool 


100.0 


Ovarian ca. 
OVCAR-5 


0.0 


Small Intestine Pool 


5.6 


Ovarian ca. 
IGROV-1 


0.0 


Stomach Pool 


0.0 


Ovarian ca. 
OVCAR-8 


0.0 


Bone Marrow Pool 


10.7 


Ovary 


------- 


Fetal 1 leart 


67o 


Breast ca. M( F-7 


0 u 


1 leart Pool 


2.6 


Breast ca. MDA- 
MB-231 


0.0 


Lymph Node Pool 


25.9 


Breast ca.Bl' 549 


0.0 


Fetal skeletal Muscle 


2.1 


Breast ca. T47D 


0.0 


Skeletal Muscle Pool 


0.0 
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tireast ca. MDA-N 


0.0 jSpleenPool 


0.0 


Breast Pool 


l ( '.<» rhymus Pool 


0.0 


Trachea 


, , 1CNS cancer 

(glio/astro) U87-MG 


0.0 


Lung 


n 0 CNS cancer 

|(glio/astro) U-l 1 8-MG 


0.0 

_ 


Fetal Lung 


, n 1CNS cancer 

|(neuro;met) SK-N-AS 


0.0 


Lung ca. NCI-N417 


Q 0 f CNS cancer (astro) SF- 

_ - ™ ^ J* 39 


0.0 


Lung ca. LX-1 


n „ jCNS cancer (astro) 
SNB-75 


0.0 


Lung ca.NCI-H146 


q q jCNS cancer (glio) 


0.0 


Lung ca. SHP-77 

j ■ 


0 0 1CNS cancer (glio) SF- 

|295 


0.0 


Lung ca. A549 


0.0 |Brain (Amygdala) Pool 


0.0 


i una m NPI US "V-> 


0.0 jBrain (cerebellum) 




l^Uilg La, IN v^l-n.*-J> 


38.7 [Brain (fetal) 


— 

2.6 


Lung ca. NCI-H460 


„ „ (Brain (Hippocampus) 
Pool 


0.0 


Lung ca. 1 K )P-62 
Lung ca. NCI-H522 


0.0 jCerebral Cortex Pool 
q q jBrain (Substantia 
(nigra) Pool 


0.0 


0.0 


Liver 


2 . 3 (Brain (Thalamus) Pool 


0.0 


Fetal Li ver 


0.0 


Brain (whole) 


(' o 


[Liver ca. HepG2 


0.0 


Spinal Cord Pool 


2.1 


[Kidney Pool 


19.1 


\drenal Gland 


0.0 


Fetal Kidney 


0.0 


Piuij<ar\ gland Pool 


2-1 


Renal ca. 786-0 


0.0 


^.i r n Gland 


0.0 


RlmuI l.i \4 1 >S 


0.0 


Thyroid (female) 


0.0 


Renal ca. ACHN 


0.0 


Pancreatic ca. 
CAPAN2 


0.0 


Renal ca. UO-3 1 


0.0 jPancreas Pool 


12 „ . 



CNSneurodegenerationvl.O Summaty: Ag3356 Expression of the CG58468-01 gene 
is low/undetectable in all the samples on this panel. (CTs>35). (Data not shown.) 



General_screening_panel_vl.4 Summary: Ag3356 Expression of the CG58468-01 gene 
is restricted to the colon (CT=34). Thus, expression of this gene could be used to 
differentiate between this sample and other samples on this panel. 
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Panel 4B Summary: Ag3356 Results from one experiment with the CG56003-01 gene 
are not included. The amp plot indicates that there were experimental difficulties with this 
run. 

AJ. CG58183-01: N-METHYL-D-ASPARTATE RECEPTOR 

Expression of gene CG581 83-01 was assessed using the primer-probe set Ag3355, 
described in Table AJA. Results of the RTQ-PCR runs are shown in Tables AJB, AJC and 
AJD. 



Table AJA . Probe Name Ag3355 



Primers 


Sequences 


Length 


Start 
Position 


SEQID 
NO: 


Forward! 


5 ' -gctggccaactctgtctagac-3 ' 


21 


1617 


505 


Probe 


TET-5 ' -tgactcttccacattggacagccttt-3 ' - 
TAMRA 


26 


1 640 


506 


Reverse 


5 ' -ttactgctatggaggctgctaa-3 ' 


22 


1675 


507 



Table AJB . CNS_neurodegeneration_vl.0 



Tissue Name 


Rel. Exp.(%) Ag3355, 
Run 210142850 


Tissue Name 


Rel. Exp.(%) Ag3355, 
Run 210142850 


AD 1 Hippo 


17.7 


Control (Path) 3 
Temporal Ctx 


7.3 


AD 2 Hippo 


27.4 


Control (Path) 4 
Temporal Ctx 


47.6 


AI) 3 I fippo 


8.8 


AD 1 Occipital Ctx 


18.8 


AD 4 Hippo 


16.2 


AD 2 Occipital Ctx 
(Missing) 


0.0 


\D 5 Hippo 


53.6 


AD 3 Occipital Ctx 


3.0 


\D 6 Hippo 


51.4 


AD 4 Occipital Ctx 


27.2 


Control 2 Hippo 


41.8 


AD 5 Occipital Ctx 


, 55J 1 


Control 4 Hippo 


10.4 


AD 6 Occipital Ctx 


6.3 


Control (Path) 3 
Hippo 


4.5 


Control 1 Occipital 
Ctx 


2.8 


AD 1 Temporal 
Ctx 


18.3 


Control 2 Occipital 
Ctx 


39.0 


AD 2 Temporal 
Ctx 


48.0 


Control 3 Occipital 
Ctx 


18.2 


AD 3 Temporal 
Ctx 


5.7 


Control 4 Occipital 
Ctx 


3.4 


AD 4 Temporal 


15.2 


Control (Path) 1 


81.8 
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Ctx 






Occipital Ctx 




AD 5 Inf Temporal 

Ctx 1 


61.6 




Control (Path) 2 
Occipital Ctx 


9.0 


AD 5 Sup 
Temporal Ctx 


69.3 


Control (Path) 3 
Occipital Ctx 


0.0 


AD 6 Inf Temporal 
Ctx 


66.9 


Control (Path) 4 
Occipital Ctx 




13.3 


AD 6 Sup 
Temporal Ctx 


62.9 


Control 1 Parietal 
Ctx 




6.6 


Control 1 
Tempor.il Ctx 


8.5 


Control 2 Parietal 
( t\ 


74.7 


Control 2 
Temporal Ctx 


66.9 


Control 3 Parietal 
Ctx 


21.0 


Control 3 
Temporal Ctx 


34.9 


Control (Path) 1 
Parietal Ctx 


100.0 


Control 3 
Temporal Ctx 


7.0 


Control (Path) 2 
Parietal Ctx 


21.9 


Control (Path) 1 
Temporal Ctx 


90.1 


Control (Path) ? 
Parietal C tx 


6.0 


Control (Path) 2 
Temporal Ctx 


74.7 


Control (Path) 4 
Parietal Ctx 


50.7 




Table A JC. General screening_panel vl.4 


Tissue Name 


Rel. Exp.(%) 
Ag3355, Run 
216523475 


Tissue Name 


Rel. Exp.(%) 
Ag3355, Run 
216523475 


Adipose 


0.0 


Renal ca. TK-10 


0.0 


Melanoma* 
Hs688(A).T 


0.9 


Bladder 


0.0 


Melanoma* 
Hs688(B).T 


0.0 


Gastric ca. (liver met.) 
NCI-N87 


0.0 


Melanoma* Ml 4 


0.0 


Gastric ca. KATO III 


0.0 


Melanoma* 
LOXIMVI 


0.0 


Colon ca. SW-948 




0.0 


Melanoma* SK- 
MEL-5 


0.0 

1 - 


Colon ca. SW480 




0.2 


Squamous cell 
carcinoma S( C-4 


0.0 


Colon ca.* (SW480 
met) SW620 


0.0 


i e -,.;s Paol 


- }± 


( !olon ca. HT29 


0.0 


Prostate ca.* (bone 
met) PC -3 


0.0 


Colon ca. HCT-116 


0.0 


Prostate Pool 


2.1 


Colon ca. CaCo-2 


0.0 


Placenta 


"oVo 


< il m cancer tissue 


0.0 


1 'terns F'oi >1 


0.0 


Colon ca. SW 1116 


0.0 
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Ovarian ca. 
OVCAR-3 


0.4 


Colon ca. Colo-205 


0.0 


Ovarian ca. SK- 
OV-3 


0.3 


Colon ca. SW-48 


0.0 


Ovarian ca. 
OVCAR-4 


0.0 


Colon Pool 


2.2 


Ovarian ca. 
OVCAR-5 


0.0 


Small Intestine Pool 


3.4 


Ovarian ca. 

TOR O V.. 1 
lvJIxW V - 1 


0.0 

... .. 


Stomach Pool 


2.4 


Ovarian ca. 
OVCAR-8 


0.0 




Bone Marrow Pool 


1.5 


Ovary 


4.4 


Fetal Heart 


1.9 


Breast ca. MCF-7 


0.0 


Heart Pool 


3.6 


Breast ca. MDA- 
MB-231 


0.0 


Lymph Node Pool 


2.9 


Breast ca. BT 549 


0.0 


Fetal Skeletal Muscle 


1.8 


Breast ca. T47D 


0.0 


Skeletal Muscle Pod 


0.0 


Breast ca. MDA-N 


0.0 


Spleen Pool 


4.3 


Breast F'ool 


"""5.7 


Thymus Pool 


4 7 


Trachea 


1.7 


CNS cancer 
(glio/astro) U87-MG 


0.0 


Lung 


1.6 


CNS cancer 
(glio/astro) U-118-MG 


0.1 


Fetal Lung 


0.0 


CNS cancer 
(neurojmet) SK-N-AS 


3, 


Lung ca. NCI-N417 


17.8 


CNS cancer (astro) SF- 
539 


14.8 


Lung ca. LX-1 


0.0 


CNS cancer (astro) 
SNB-75 


17.6 


Lung ca. NCI-H146 


4.0 


CNS cancer (glio) 
SNB-1 9 


0.0 


Lung ca. SHP-77 


11.7 


CNS cancer (glio) SF- 
295 


0.0 


Lung ca. A549 


——————— 


Brain (Amygdala) Pool 




T nno not MPT 




Brain (cerebellum) 




Lung ca. NCI-H23 


1.9 


Brain (fetal) 


100.0 


Lung ca. NCI-H460 


0.3 


Brain (Hippocampus) 
Pool 


33.0 


Lung ca. HOP-62 


0.3 


Cerebral Cortex Pool 


42.3 


Lung ca. NCI-H522 


0.0 


Brain (Substantia 
nigra) Pool 


43.8 


Liver 


0.2 


Brain (Thalamus) Pool 


50.7 


Fetal Liver 


0.4 


Brain (whole) 


71.2 
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(Liver ca. HepG2 


0.0 


Spinal Cord Pool 


15.0 J 


(Kidney Pool 


1.4 


Adrenal Gland 


0.0 


Fetal Kidney 


7.2 


Pituitarj i-lond Pool 


0.0 


(Renal ca. 786-0 


0.0 


Salivary Gland 


0.1 


Renal ca. \4 l )8 


0.0 


1 1 1 - n •id female) 


0.1 


jRenal ca. ACHN 


0.0 


Pancreatic ca. 
CAPAN2 


0.0 


Renal ca. [<>-*! 


0.0 


Pancreas Pool 


2.7 



Table AJD . Panel 4D 



Tissue Name 


Rel. Exp.(%) 
Ag3355, Run 
165241988 


Tissue Name 


Rel. Exp.(%) 
Ag3355, Run 


Secondary 1 h 1 act 


0.0 


HUVEC IL-lbeta 


0.0 


Secondary 1 h2 act 


0.0 


HH VI t IFN gamma 


0.0 


Secondary Trl act 


0.0 


HUVEC TNF alpha t- 
IFN gamma 


0.0 


Secondary Thl rest 


0.0 


III IV1 C TNF alpha + 
IL4 


0.0 


Secondary Th2 rest 


11.8 III VI C II -1 1 


0 (i 


Secondary Trl rest 


0.0 


Lung Microvascular EC 
none 


0.0 


Primary Thl act 


0.0 


Lung Microvascular EC 
TNFalpha + IL-lbeta 


0.0 


Primary Th2 act 


q q Microvascular Dermal 
EC none 


0.0 


Primary Trl act 


n „ JMicrosvasular Dermal 

EC r NF alpha + IL-lbeta 


0.0 


Primary Thl rest 


„ „ Bronchial epithelium 
1'NPalpha + 11 I beta 


0.0 


Primary Th2 rest 


0.0 


Small airway epithelium 
none 


0.0 


Primary Trl rest 


0 0 (Small airway epithelium 
TNFaIpha + IL-lbeta 


0.0 


CD45RA CD4 
lymphocyte act 


0.0 


Coronery artery SMC rest 


0.0 


CD45RO CD4 
lymphocyte act 


0.0 


Coronery artery SMC 
TNFalpha + IL-lbeta 


0.0 


CD8 lymphocyte act 


0.0 


Astrocytes rest 


0.0 


Secondary CD8 
lymphocyte rest 


„ „ .Astrocytes TNFalpha + 
IL-lbeta 


57.8 


Secondary CD8 
lymphocyte act 


0 0 


KU-812 (Basophil) rest 


0.0 



CD4 lymphocyte none 


0.0 


1KU-8 12 (Basophil) 
I'M \ ionomycin 


67.4 


2n ! hi Th2 Trl_anti- 
CD95 CHI 1 


0.0 


CCD1106 

((Keratinocytes) none 


0.0 


LAK cells rest 


0.0 


[CCD1106 

1 (Keratinocytes) 

TNFalpha + IL-lbeta 


0.0 


i ■. il 

1V\1V Cells llu-Z 


0.0 


(Liver cirrhosis 


39 8 


L . / V rv Cells 1 £>l 1 j- l A 


0.0 


jLupus kidney 


- - °'° - 


i^/\js. cens lij-ZT^iriN 
gamma 


0.0 




NCI-H292 none 


0.0 




i . / \ is. cello [ [ . i 1 1 ^ i o 


0.0 




NCI-H292 IL-4 


- — — 2£ — _ 


1j/\jv cens 
P\IA 'ionoimein 


0.0 


! 

!NCI-H292 IL-9 

I 


0.0 


iniv L-euis rest 


0.0 


NCI-H292 IL-13 


0 


i wo w <iy ivms. j uay 


0.0 


NCI-H292 IFN gamma 




l wo way mjlk j day 


0.0 


HPAEC none 


— 


Two Way MLR 7 day 


0.0 




HPAEC TNF alpha + IL- 
1 beta 


0.0 


PBMC rest 


0.0 


Lung fibroblast none 


0.0 


PBMC PWM 


0.0 


Lung fibroblast TNF 
alpha + Il.-I beta 


0.0 


[PBMC PHA-L 


0.0 


Lung fibrobl ist II - I 


0.0 


Ramos (B cell) none 


0.0 




Lung fibroblast (L-9 


0 0 


Ramos (B cell) 
ionomycin 


0.0 


|Lung fibroblast IL-13 


0.0 


B lymphocytes PWM 


0.0 




Lung fibroblast IFN 
gamma 


0.0 


B lymphocytes CD40L 
and IL-4 


0.0 


f Dermal fibroblast 
CCD 1070 rest 


0.0 


EOL-1 dbcAMP 


0.0 




Dermal fibroblast 
CCD 1070 TNF alpha 


0.0 


EOL-1 dbcAMP 
PMA/ionomycin 


0.0 




Dermal fibroblast 
CCD1070IL-1 beta 


0.0 


Dendritic cells none 


0.0 




Dermal fibroblast IFN 
gamma 


0.0 


Dendritic cells LPS 


0.0 




Dermal fibroblast IL-4 


0.0 


Dendritic cells anti- 
CD40 


0.0 


JIBD Colitis 2 


0.0 


Monoc> les rest 


0.0 




IBD Crohn's 


0.0 


Monocytes LPS 


0.0 




Colon 


12.7 


Macrophages rest 


0.0 




Lung 


15 2 


[Macrophages LPS 


0.0 




Thymus 


100.0 


§[UVEC none 


0.0 




Kidney 


73.2 
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CNSneurodegenerationvl.O Summary: Ag3355 This panel confirms the expression of 
CG58 183-01 gene at low levels in the brains of an independent group of individuals. 
However, no differential expression of this gene was detected between Alzheimer's 
diseased postmortem brains and those of non-demented controls in this experiment. Please 
5 see Panel 1 .4 for a discussion of the potential utility of this gene in treatment of central 
nervous system disorders. 

General_screening_panel_vl.4 Summary: Ag3355 Highest expression of CG58183-01 
gene is detected in fetal brain (Ct=29.2). In addition, this gene is expressed at high levels in 
all regions of the central nervous system examined, including amygdala, hippocampus, 
10 substantia nigra, thalamus, cerebellum, cerebral cortex, and spinal cord (CTs= 29-32). 
Therefore, this gene may play a role in central nervous system disorders such as 
Alzheimer's disease, Parkinson's disease, epilepsy, multiple sclerosis, schizophrenia and 
depression. 

This gene codes for N-methyl-D-aspartate (NMDA) receptor 3A protein. In cats 
1 5 and rhodent models competitive NMDA receptor antagonists, such as D-(E)-4-(3- 

phosphonoprop-2-enyl)piperazine-2-carboxylic acid, which act at the neurotransmitter 
recognition site were shown to be effective in reducing ischaemic brain damage when 
administered prior to the onset of an ischaemic episode (Ref. 1). Therefore, therapeutic 
modulation of the activity of the protein encoded by this gene may be beneficial in the 
20 treatment of ischaemic brain. 

Among tissues with metabolic or endocrine function, this gene is expressed at low 
levels in pancreas, heart, and the gastrointestinal tract. Therefore, therapeutic modulation of 
the activity of this gene may prove useful in the treatment of endocrine/metabolically 
related diseases, such as obesity and diabetes. 

25 Furthermore, low to moderate expression of this gene is detected in lung cancer, 

and CNS cancer cell lines. Therefore, therapeutic modulation of the activity of this gene or 
its protein product, through the use of small molecule drugs, protein therapeutics or 
antibodies, might be beneficial in the treatment of lung cancer or CNS cancer. 



References: 
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I. McCulloch J. (1991) Ischaemic brain damage— prevention with competitive and 
non-competitive antagonists of N-methyl-D-aspartate receptors. Arzneimittelforschung 
41(3A):319-24. 

Panel 4D Summary: Ag3355 Expression of the CG581 83-01 gene is limited to a few 
5 samples, with highest expression in the thymus (CT=33.5). Thus, expression of this gene 
may be useful as a marker of thymic tissue. Low, but significant levels of expression are 
also seen in the kidney, in TNF -alpha and IL-1 beta treated astrocytes and in the 
PMA/ionomycin treated basophil cell line KU-812. Thus, this gene product may be 
involved in the normal homeostasis of this tissue. Therefore, agonistic antibodies or protein 
1 0 therapeutics may be important in the treatment of inflammatory or autoimmune diseases 
that affect the kidney, including lupus and glomerulonephritis. In addition, the expression 
of this transcript in astrocytes treated with TNF-a and IL-1 indicates that therapeutics 
designed against the protein encoded by this gene may be useful for the treatment of 
inflammatory CNS diseases such as multiple sclerosis. 

15 AK. CG59315-01: connexin 

Expression of gene CG593 15-01 was assessed using the primer-probe set Ag3542, 
described in Table AKA. Results of the RTQ-PCR runs are shown in Tables AKB and 
AKC. 

Table AKA . Probe Name Ag3542 



jPrimers 


Sequences 


Length 


Start 
Position 


SEQID 
NO: 


j Forward 


5 ' -ggacacctcccaacctagatc-3 ' 




1024 


508 


Probe 


TET-5 • -tacctgtcttccttccttgaggctgg-3 ' - 
TAMRA 


: " 


1046 


509 


(Reverse 


5 ' -ttgcattcttgtgtccatgag-3 ' 




1081 


510 



20 Table AKB . General_screening_panel_v 1 .4 



I — - 

j Tissue Name 


Rel. Exp.(%) 
Ag3542, Run 
217049297 


Tissue Name 


Rel. Exp.(%) 
Ag3542, Run 
217049297 


Adipose 


17.3 


Renal ca. TK-10 


6.8 


jMelanoma* 
Hs688(A).T 


0.4 


Bladder 


2.5 


JMelanoma* 


1.0 


Gastric c i ( [ivei mt t ) 


13.1 



701 



Hs6K8(J3s. 1 
Melanoma* Ml 4 


~ 

12.2 


M 1 NS" 

Gastric ca. KATO 111 


12.2 


Melanoma* 
LOXIMVI 


0.0 


Colon ca. SW-948 


3.8 


Melanoma* SK- 

_ _ „ 


0.7 


Colon ca. SW480 


39.0 


Squamous cell 
carcinoma SCC-4 


2.0 


colon ca. (5>W4oU 
met) SW620 


6.7 


Testis Pool 


0.3 


C olon ca H 1 29 


2.3 


Prostate ca.* (bone 
met) PC-3 


0.0 


Colon ca. HCT-116 


17.7 


Prostate Pool 


0.0 


Colon ca. CaCo-2 


2.8 


Placenta 


1.2 


Colon cancer tissue 


1.6 


t tcrus Pool 


0.0 


Colon ca SWT 116 


0.3 


Ovarian ca. 
OVCAR-3 


6.3 


Colon ca. Colo-205 


0.3 


Ovarian ca. SK- 
OV-3 


2.5 


Colon ca. SW-48 


0.0 


Ovarian ca. 
OVCAR-4 


0.0 


Colon Pool 


1.7 


Ovarian ca. 
OVCAR-5 


25.0 


Small Intestine Pool 


6.3 


Ovarian ca. 
HjKUV-1 


6.4 


Stomach Pool 


5.4 


Ovarian ca. 
OVCAR-8 


0.0 


Bone Marrow Pool 


3.1 




0.6 


Fetal Heart 


■ -1 

1.7 


Breast ca. MCF-7 


12.9 


Heart Pool 




Breast ca. MDA- 
MB-231 


5.0 


Lymph Node Pool 


3.6 


Breast ca. B r 549 


8.7 fetal Skeletal Muscle 


0.0 


Breast ca. T47D 


100.0 


•si eK tal Muscle F'ool 


6.1 


Breast ca. MDA-N 


2.7 


Spleen Pool 


5.8 


Breast Pool 


4.9 


I lr inn , p 0i 1 


3.0 


Trachea 


9.3 


CNS cancer 
(glio/astro) 1 f87-MG 


1.0 




Lung 


„ „ jCNS cancer 

l(glio/astro) U-l 1 8-MG 


13.7 


Fetal Lung 

Lung ca. NCI-N417 


32 CNS cancer 

|(neuro;met) SK-N-AS 


35.4 


1.2 


CNS cancer (astro) SF- 
539 


4.9 


Lung ca. LX-1 


11.7 


CNS cancer (astro) 
SNB-75 


2.7 
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Lung ca. NCI-H146 


2.9 


CNS cancer (glio) 
SNB-19 


1.4 


Lung ca. SHP-77 


8.1 


CNS cancer (glio) SF- 
295 


_ „ „ r 

12.5 


Lung ca. A549 


10.8 


Brain (Amygdala) Pool 


0.4 


Lung ca. NCI-H526 




2.1 


Brain (cerebellum ) 


13.6 


Lung ca. NCI-H23 


8.1 


Brain (fetal) 


6.9 


Lung ca. NCI-H460 


0.8 


Brain (Hippocampus) 
Pool 


1.5 


Lung ca. HOP-62 


10.2 


Cerebral Cortex Pool 


0.0 


Lung ca. NCI-H522 


4.9 


Brain (Substantia 
nigra) Pool 


, — __ ~ - 

0.2 


Liver 


0.0 


TMain ( I habnii.-o Pix>] 


1.2 1 


Fetal Liver 


1.2 


Brain (whole) 


~" * " To" 


Liver ca.. HepG2 


0.0 


Spinal Cord Pool 


0 1 


Kldnej Pool 


U 


Adrenal Gland 


2.0 


Fetal Kidney 




0.0 


Pituitary gland Pool 


1.8 


Renal ca. 786-0 




6.3 


Salivary Gland 


^ 2 


Renal ca. A498 




0.0 [Thyroid (female) 


1 3.8 


Renal ca. ACHN 


- 


12.1 


Pancreatic ca. 
CAPAN2 


0.0 


Renal ta. UO-31 




_____ 


Pancreas Pool 


4.2 


Table AKC. Panel 4D 


Tiiisue Name 


Rel. Exp.(%) 
Ag3542, Run 
166453844 


Tissue Name 




Rel. Exp.(%) 
Ag3542, Run 
166453844 


Secondary Thl act 




3.9 


HUVEC lL-lbeta 




0.0 


Secondary Th2 act 




5.4 


HUVI C IFN gamma 


2.4 


Secondary Trl act 


3.8 


HUVEC TNF alpha + 
IFN gamma 


0.4 


Secondary Thl rest 


33.0 


HUVEC TNF alpha + 
IL4 




— 

0.0 


Secondary Th2 rest 


5.3 


HUVEC IL-11 


1.4 


Secondary Trl rest 


14.8 


Lung Microvascular EC 
none 


3.1 


Primary Thl act 


6.1 


Lung Microvascular EC 
TNFalpha + IL-lbeta 


1.9 


Primary Th2 act 


0.6 


Microvascular Dermal 
EC none 


,.9 


Primary Trl act 


5.5 


Microsvasular Dermal 
EC 1 NFalpha - II -lbeta 


0.0 


Primary Thl rest 


84.7 


Bronchial epithelium 


2.7 
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TOFalpha i ILlbeta 




Primary Th2 rest 


2^ q Small airway epithelium 

jnone 


1.6 


Primary Trl rest 


1 , „ 'Small airway epithelium 
iTN Pal pha + IL-1 beta 


3.8 


CD45R \ (D4 
lymphocyte act 


0.3 


Coronery artery SMC rest 


2.1 


CD45RO CD4 
lymphocyte act 


3.6 


Coronery artery SMC 
TNFalpha + IL-1 beta 


0.5 


< D8 lymphocyte act 


0.9 


Astrocytes rest 


5.8 


Secondary CD8 
lymphocyte rest 


? _ lAstrocytes TNFalpha + 
JIL-lbeta 


12.0 


Secondary CD8 
lymphocyte act 


12.6 jKU-812 (Basophil) rest 


0.0 


CD4 lymphocyte none 


136 ! KU-812 (Basophil) 
>J ° PMA/ionomycin 


5.4 


2ryThl/Th2/Trl_anti- 
CD95 CHI 1 


40.6 


CCD 11 06 

(Keratinocytes) none 


1.7 


LAK cells rest 


3.7 


CCD1106 
(Keratinocytes) 
TNFalpha + IL-lbeta 


4.9 


LAK cells IL-2 


7.6 Liver cirrhosis 


81.2 


LAK cells IL-2+IL-12 


7.9 Lupus kidney 


5.0 


LAK cells IL-2+IFN 
gamma 


11.7 INCI-H292 none 


7.8 


LAK cells IL-2+ IL-18 


7.2 N< I 1 [_'>)_ II - 


3.5 


LAK cells 
PMA i( nomycin 


3.8 -NCI-1 1292 IL-9 


6.0 

— - 


NK Cells IL-2 rest 


8.0 INCI-H292 IL-13 


6.4 


Two Way MLR 3 day 


2.3 NC1-H292 TFN gamma 


2.4 


Two Way MLR 5 day 


0.9 HPAEC none 


0.0 


Two Way MLR 7 day 


HPAI C LNF alpha i II - 
U -° jl beta 


0.9 


PBMC rest 


12.8 L u n<i fibroblast none 


0.0 


PBMC PWM 


. 4 Lung fibroblast TNF 
alpha i IL-1 beta 


1.1 


PBMC PHA-L 


3.7 


Lung fibroblast U -4 


0.0 


Ramos (B cell) none 


0.0 


Lung fibroblast IL-9 




Ramos (B cell) 
ionomycin 


0.0 


Lung fibroblast IL-13 


0.4 


B lymphocytes PWM 


3.7 


Lung fibroblast IFN 
gamma 


0.0 


B lymphocytes CD40I 
and IL-4 


. . Dermal fibroblast 
[CCD 1070 rest 


0.4 


EOL-1 dbcAMP 


14.1 (Dermal fibroblast 


18.0 
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CCD 1070 TNF alpha 




EOL-1 dbcAMP 


11.7 


Dermal fibroblast 


0.0 




PlVlA/ionomycin 


if~*r'T~\'\ (\1f\ TT 1 koto 

iccui v /u 1.L- i beta 




Dendritic cells none 


6.3 


jjjermai iioro blast irfN 
Jgajrirna 


0.3 


Dendritic cells LPS 


,4 


[Dermal fibroblast IL-4 


1.4 


Dendritic cells anti- 
CD40 


2.3 


[iBD Colitis 2 


4.0 


VlonocWcs rest 


53.2 


1BD Crohn's 


3.2 


Monocytes LPS 


19.2 


jColon 


100.0 


VI aero phages rest 


0.6 


iLung 


11.1 


Macrophages LPS 


0.0 


^Thymus 


2.7 


HUVEC none 


5.4 


Kidney 


7.7 


HUVEC starved 


4.3 





CNS_neurodegeneration_vl.O Summary: Ag3542 Expression of the CG593 15-01 gene 
is low/undetectable in all the samples on this panel. (CTs>35). (Data not shown.) 



General_screening_panel_vl.4 Summary: Ag3542 Expression of the CG593 15-01 gene 
is highest in a breast cancer cell line (CT=31.3). Furthermore, there is significant 
5 expression in a cluster of cell lines derived from brain cancer, colon cancer and ovarian 
cancer. Therefore, expression of this gene could be used to differentiate between these 
samples and other samples on this panel and as a marker to detect the presence of colon, 
brain, ovarian, and breast cancer. Furthermore, therapeutic modulation of the expression or 
function of this gene may be effective in the treatment of colon, brain, ovarian, and breast 
10 cancers. 

Low but significant levels of expression are also seen in the cerebellum. Therefore, 
therapeutic modulation of the expression or function of this gene may be useful in the 
treatment of neurologic disorders, such as Alzheimer's disease, Parkinson's disease, 
schizophrenia, multiple sclerosis, stroke and epilepsy. 

15 Among metabolic tissues, this gene is expressed at low levels in adipose. Therefore, 

this gene product may be useful in the treatment of obesity. 

Panel 4D Summary: Ag3542 Expression of the CG593 15-01 gene is highest in the 
normal colon (CT=30). Furthermore, expression is undetectable in colon samples from 
Crohn's and colitis patients. Thus, expression of this gene could be used to differentiate 
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between normal and inflammed colon. This gene encodes a connexin homolog, a gap 
junction protein involved in intercellular communication. 

The expression of this connexin-like protein in several of the resting and activated 
T lymphocyte preparations and in resting monocytes suggests that small molecule 
5 antagonists or therapeutic antibodies that block its function may also be useful in the 
treatment of a number of inflammatory and autoimmune diseases in which T cells and 
monocytes play a pivotal role. These include, but are not limited to, Crohn's disease, 
ulcerative colitis, multiple sclerosis, chronic obstructive pulmonary disease, asthma, 
emphysema, rheumatoid arthritis, lupus erythematosus, or psoriasis. 

10 References: 

1. Kwak BR, Mulhaupt F, Veillard N, Gros DB, Mach F. Altered pattern of 
vascular connexin expression in atherosclerotic plaques. Arterioscler Thromb Vase Biol 
2002 Feb 1 ;22(2):225-30 



1 5 AL. CG59203-01 : Lysozyme C-like protein 

Expression of gene CG59203-01 was assessed using the primer-probe set Ag3392, 
described in Table ALA. Results of the RTQ-PCR runs are shown in Tables ALB and 
ALC. 



Table ALA . Probe Name Ag3392 



IprimersJ Sequences 


Length p M :' ,t S£ Q ,D 
* Position NO: 


Forward 


5 ' - tgtgaggtttcctaaactggaa-3 1 


22 540 : 511 


jProbe 


TET-5 ' -ctttgcagcaacgccctagggttt-3 ' - 
TAMRA 


24 1 576 


512 


'Reverse 


5 1 -tgacacaggcatttggacat-3 1 


20 607 


513 



20 Table ALB . General_screening_panel_vl.4 



Tissue Name 


Rel. Exp.(%) 
Ag3392, Run 
216821373 


Tissue Name 


Rel. Exp.(%) 
Ag3392, Run 
216821373 


Adipose 


0.0 


Renal ax. TK-IO 


2.7 
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Melanoma* 


0.0 


Bladder 


0.0 


Melanoma* 
nsr>Nr\( n ). I 


0.0 


Gastric ca. (liver met.) 

JNd-JNo / 


0.7 


Melanoma* Ml 4 




( iasit ie ca K ATO III 


0.4 


Melanoma* 

LA JXllVI V I 


0.0 


Colon ca. SW-948 


0.6 


Melanoma* SK- 
MEL-5 


0.0 


Colon ca. SW480 


2.0 


Squamous cell 
carcinoma SCC-4 


0.0 


Colon ca. ; (SW480 
met) SW620 


4.7 


Testis Pool 


100.0 


Colon ca. HT29 


0.0 


Prostate ca.* (bone 
met) PC-3 


0.0 


Colon ca. HCT-116 


0.0 


Prost it. j Pool 


O.d 


Colon ca. CaCo-2 


1.2 


Placenta 


0.7 


Colon cancer tisane 


0.0 


Uterus Pool 


0.0 


Colon ca. SW1116 


0.0 


Ovarian ca. 
OVCAR-3 


0.4 


Colon ca. Colo-205 


11.6 


Ovarian ca. SK- 
OV-3 


1.0 


Colon ca. SW-48 


2.2 


Ovarian ca. 
iUVLAK-4 


0.0 


Colon Pool 


1.1 


Ovarian ca. 
OVCAR-5 


0.0 


Small Intestine Pool 


0.4 


Ovarian ca. 
IGROV-1 


0.0 


Stomach Pool 


0.0 


lOvflri^n C3.. 
OVCAR-8 


_ _ — 

0.0 


i 

Bone Marrow Pool 


- 

1.3 


lOvary 


b.o 


Fetal Heart 


0.0 


Breast ca. MCF-7 


15.2 


Heart Pool 


1.1 


Breast ca. MDA- 
MB-231 


0.0 


Lymph Node Pool 


0.0 


Breast ca. BT 549 


1.7 


i c i! ^UkUl Muscle 


0 0 


Breast ca. T47D 


0.0 


Skeletal Muscle Pool 


0.5 


Breast ca. MDA-N 


0.0 


Spleen Pool 


0.0 


Breast Pool 


(i 0 


Thymus Pool 


0.9 


Trachea 


1.1 


CNS cancer 
(glio/astro) U87-MG 


1.4 


Lung 


ICNS cancer 
!(glio/astro)U-118-MG 


0.5 


Fetal Lung 


0.8 


CNS cancer 
(neuroimet) SK-N-AS 


0.0 


Lung ca. NCI-N417 


0.0 


CNS cancer (astro) SF- 
539 


0.0 
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Lungca . LX-1 




5.4 


jCNS cancer (astro) 
lSNB-75 


6.1 


Lung ca. NCI-Ill 46 




0.0 


ICNS cancer (glio) 
jSNB-19 


. ........ 

0.0 


Lung ca. SHP-77 




0.0 


jCNS cancer (glio) SF- 

J295 


0.0 


Lung ca. A549 


— 


3.1 


^Brain (Amygdala) Pool 


0 0 


Lung ca. NCI-H526 






fBrain (cerebellum) 


0 0 


Lung ca. NC1-1 123 


— 


0.9 


jBrain (fetal) 


0.0 


Lung ca. NCI-H460 




1.1 


jBrain (Hippocampus) 
[Pool 


0.0 


Lung ca. H< >P-62 




0.0 


ICerebral Cortex Pool 

-4 — - 


0.5 


Lung ca. NCI-H522 




0.0 


Brain (Substantia 
nigra) Pool 


0.0 


Liver 




1.5 


Brain ("1 halamus) Pool 


().<> 


Fetal Liver 




1.2 


jBrain (whole) 


0.0 


Liver ca. HepG2 




18.3 


Spinal CorJ Pool 


0.0 " 


Kidney Pool 




1.6 


\drcnai (Hand 


0.0 


Fetal Kidney 




0.0 


PumUn gland Pool 


_ ..... 


Renal ca. 786-0 






'Salivary Gland 




Renal ca. A498 




0.0 


Thyroid (female) 


00 


Renal ca. ACHN 


_ 


0.0 


iPancreatic ca. 
:CAPAN2 


0.0 


Renal c.i.UO-31 


_ 


0.0 


Pancreas Pool 


0.0 


Table ALC. Panel 4D 


Tissue Name 


Rel. Exp.(%) 
Ag3392, Run 
lto2«>M70 


1 Tissue Name 




Rel. Exp.(%) 
Ag3392, Run 
165296470 


Secondary Thl act 


0.0 


HUVEC IL-lbeta 




0.0 


Ncv.oH.Jai> 1 hi act 


36.9 


jHUVF.C IFN gamma 




0.0 


Secondary Trl act 


0.0 


(HUVEC TNF alpha + 
[iFN gamma 




0.0 


Secondary Thl rest 


0.0 


HUVFCTNF alpha + 
IL4 




0.0 


Secondary Th2 rest 


0.0 


HUVEC 11,-1 1 




0.0 


Secondary Trl rest 


0.0 


jLung Microvascular EC 
•none 


0.0 


Primary Thl act 


0.0 


jLung Microvascular EC 
|TNFalpha + IL-lbeta 


0.0 


Primary Th2 act 


0.0 


Microvascular Dermal 
EC none 




0.0 


Primary Trl act 


0.0 


MicrosN asular Dermal 




0.0 
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1 




(EC TNFalpha + IL-lbeta 




Primary Thl rest 


0.0 




Bronchial epithelium 
TNFalpha + ILlbeta 


0.0 


Primary Th2 rest 


0.0 


Small airway epithelium 
none 


0.0 


Primary Trl rest 


0.0 




Small airway epithelium 
TNFalpha + IL-lbeta 


0.0 


CD45RA CD4 
lymphocyte act 


0.0 


Coronery artery SMC rest 


0.0 


CD45ROCD4 
\\ mphocyte act 


0.0 




Coronery artery SMC 
TNFalpha + IL-lbeta 


0.0 


CD8 lymphocyte act 


0.0 


(Astrocytes rest 


0.0 


Secondary CD8 
Kniphi m;>Ic re^t 


0.0 


[Astrocytes TNFalpha + 
11.- 1 beta 


0.0 


Secondarj CD8 
lymphocyte act 


0.0 




KU-8 12 (Basophil) rest 


0.0 


CD4 lymphocyte none 


0.0 


KU-8 12 (Basophil) 
I'MA/ionomycin 


0.0 


2ry Thl/Th2/Trl_anti- 
CD95 CH11 


0.0 


C( Dl 106 

l(Keratinocytes) none 


0.0 








CCD1106 




LAK cells rest 


0.0 


j(Keratinocytes) 
I \l alpha - IL-lbcla 


0.0 


JU/\JV C/ClIS llj-Z 


0.0 


1 iver cirrhosis 


100.0 1 


T AK' r-pllc TT 9-4-TT 10 
Lj/\iV LcIlS 1 Ij-ZtI L,- 1 Z 


0.0 


iLupus kidney 


0.0 


f ••, i; relic TT ?+T"F"M 

I it \ IV C/Cil> 1 I . i 1 i 1 IN 

gamma 


0.0 


|NCI-H292 none 


0.0 


I Ak" < f*lk II .1+ 11-18 


0.0 




NCI-H292 IL-4 




LAK cells 
PMA/ionomycin 


16.8 


|NCI-H292 IL-9 


0.0 


Mk" fVI c IT 0 rfct 


0.0 


N< 1-1 1292 IL-13 


0.0 


Twn Wav 1VTT R ^ Hav 


0.0 


(NCI-H292 IFN gamma 


0.0 


Two Way MLR 5 day 


0 0 


jHPAEC none 




Two Way MLR 7 day 




0.0 




HPAEC TNF alpha + IL- 
1 beta 


0.0 


PBMC test 


0.0 




I ung llbroblastnone 


0.0 


PBMC PWM 


11.1 




Lung fibroblast TM : 
alpha + IL-1 beta 


0.0 


PBMC PHA-L 


0.0 




Lung fibroblast IL-4 


0.0 


Ramos (B cell) none 


0.0 




Lung fibroblast IL-9 


0.0 


Ramos (B cell) 
ionomycin 


0.0 


I ,ung fibroblast IL-13 


0.0 


B lymphocytes PWM 


26.8 




Lung fibroblast IFN 
gamma 


0.0 


B lymphocytes CD40L 


0.0 




Dermal fibroblast 


0.0 
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and IL-4 




LL IJlv/v rest 





EOL-1 dbcAMP 


0.0 


|Dermal fibroblast 
JCCD1070 TNF alpha 


0.0 


uoc/\ivi r 


0.0 


Dermal fibroblast 


0.0 


PMA/ionomycin 


CCD 1070 IL-1 beta 


Dendritic cells none 


0.0 


(Dermal fibroblast IFN 
gamma 


0.0 


Dendrit c cells 1 PS 


0.0 


jDermal fibroblast IL-4 


0.0 


Dendritic cells anti- 
CD40 


0.0 


IBD Colitis 2 


0.0 


Monocj tcs rest 


0.0 


jlBD Crohn's 


0.0 


Monocytes LPS 


0.0 


Colon 


0.0 


Macrophages rest 


0.0 


fLung 


0.0 


Ylacrop lages ! PS 


0.0 


1 hvmus 


0.0 


HUVEC none 


00 


Kidney 


0.0 


HUVEC starved 







CNS_neurodegeneration vl.O Summary: Ag3392 Expression of the CG59203-01 gene 
is low/undetectable in all samples on this panel (CTs>35). (Data not shown.) 



General screening panel_vl.4 Summary: Ag3392 Highest expression of the CG59203- 
01 gene is seen in the testis. Thus, expression of this gene could be used as a marker of 
5 testicula r tissue. Furthermore, therapeutic modulation of the expression or function of this 
gene may be effective in treating infertility or hypogonadism. 

Panel 4D Summary: Ag3392 Significant expression of this gene is detected in a liver 
cirrhosis sample (CT = 33.8). Furthermore, expression of this gene is not detected in 
normal liver in Panel 1.3D, suggesting that its expression is unique to liver cirrhosis. 
1 0 Therefore, therapeutic modulation of the expression or function of this gene may reduce or 
inhibit fibrosis that occurs in liver cirrhosis. In addition, expression of this gene could also 
be used for the diagnosis of liver cirrhosis. 

AM. CG58662-01: cytoplasmic protein 

Expression of gene CG58662-01 was assessed using the primer-probe set Ag3387, 
1 5 described in Table AMA. Results of the RTQ-PCR runs are shown in Tables AMB, AMC 
and AMD. 

Table AMA . Probe Name Ag3387 
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Primers 


Sequences 


Length 


Start 
Position 


SEQID 
NO: 


Forward 


5 1 -aacctgcactcctccatga-3 ' 


19 


504 


514 


Probe 


TET-5 ' -agaccccagcagggtatcctctgag-3 1 - 
IAMF \ 


25 


532 


515 


Reverse 


5 1 -ctctgtcagtgcccacatct-3 1 


20 


564 


516 



Table AMB . CNS neurodegeneration vl.O 



Tissue Name 


1 Rel. Exp.(%) Ag3387, 
Run 210155038 


Tissue Name 


! Rel. Exp.(%) Ag3387, 
Run 210155038 


AD 1 Hippo 


157 [Control (Path) 3 
JTemporal Ctx 


7.3 


AD 2 Hippo 


346 Control (Path) 4 
(Temporal Ctx 


— . „ - ,.j 

42.0 


AD 3 Hippo 


5.5 (AD 1 Occipital Ctx 


17.9 


IAD 4 Hippo 


9 y AD 2 Occipital Ctx 
((Missing) 


0.0 


[AD 5 Hippo 


95.3 


AD 3 Occipital Ctx 


5.4 


IAD 6 Hippo 
Control 2 1 [ippo 
Control 4 Hipp.- 


33.9 


AD 4 Occipital Ctx 


19.6 


41.5 

«M. 


AD 5 Occipital Ctx 
AD 6 Occipital Ctx 


_ 611 
19.5" 

5.8 


Control (Path) 3 
Hippo 


6.7 


Control 1 Occipital 
Ctx 


AD 1 Temporal 
Ctx 


11.2 


Control 2 Occipital 
Ctx 




83.5 


jAD 2 Temporal 

jCtx 


37.6 


Control 3 Occipital 
Ctx 


19.3 


jAD 3 Temporal 1 

Ctx ! 40 


Control 4 Occipital 


— — — — — . 
5.0 


AD 4 Temporal 
Ctx 


21.5 


Control (Path) 1 
Occipital Ctx 


88.9 


AD 5 h i Temporal 
Ctx 


100.0 


Control (Path > 2 
Occipital Ctx 


13.4 


AD 5 Sup 
Temporal Ctx 


37.9 


Control (Path) 3 
< >ccipital Ctx 


5.8 


AD 6 Inf Temporal) 
Ctx 


35.6 


Control (Piili i \ 
Occipital Ctx 


24.3 


AD 6 Sup 
Temporal Ctx 


39.2 


Control 1 Parietal 
Ctx 


9.4 


Control 1 
Temporal Ctx 


6.7 


Control 2 Parietal j 
Ctx 


44.4 


Control 2 
Tempor al Ctx 


65.5 


Control 3 Parietal I 
Ctx 


28.5 


Control 3 


19.3 


Control (Path) 1 


90.8 1 
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Temporal Civ 


(Parietal Ctx 




Control 3 

Temporal Ctx j 


11.4 


( ontrol (Path) 2 T 
JParietal Ctx 


25.7 


Control (Path) 1 j 
Temporal Ctx j 


83.5 


[Control (Path) 3 j 
JParietal Ctx 


5.6 


Control (Path) 2 | 
Temporal Ctx J 


56.6 


(Control (Path) 4 
Parietal Ctx 


56.6 


Table AMC. General screening_panel vl .4 


Tissue Name 


Rel. £xp.(%) 
Ag3387, Run 
217043912 


j 

Tissue Name 

j 


j Rel. Exp.(%) 
Ag3387, Run 
| 217043912 


Adipose 


8.2 


Renal ca. I K- 10 


66.4 


Melanoma* 
Hs688(A).T 


30.6 


RlarlHpr 
LDldUUCI 


1— _ u ' 2 _ 


(Melanoma* 
Hs688(B).T 


34.6 


Gastric ca. (liver met.) 
[NCI-N87 




[Melanoma* M14 


27.0 


Gastric ca. KATO III 


20.6 


Melanoma* 
LOXIMV1 




Colon ca. SW-948 

L „ 


... 


Melanoma* Sk- 
[MEL-5 


25.3 


Colon ca. SW480 


33.0 


(Squamous cell 
carcinoma SCC-4 


5.7 


Colon ca (SW4X0 
met) SW620 


29.9 


Testis Pool 


17.8 


f olon ca. H 120 


8.8 


(Prostate ca.* (bone 
met) PC- 3 


27.7 


Colon ca. HCT-116 


15.4 


[Prostate Pool 


16.6" 


C olon ca. CaCo-2 


1 " " 13.V ■ ' 


(Placenta 


11.0 


C >ion cancer tissue 


------- - 


Uteris Pool 


1.7 


Colon ca SW1116 




0\ ai ian ca. 

OVCAR-3 


17.0 


Colon ca. Colo-205 


4.4 


(Ovarian ca. SK- 

iOV-3 

----- .——.^ 


11.3 


Colon ca. SW-48 


9.3 


Ovarian ca. 
OVCAR-4 


7.1 


Colon Pool 


20.4 


Ovarian ca. 
OVCAR-5 


37.4 


Small Intestine Pool 


12.5 ] 


Ovarian ca. 
IGROV-1 


23.7 


Stomach Pool 


12.1 


Ovarian ca. 
OVCAR-8 


16.8 


Bone Marrow Pool 


4.3 


Ovarv 


20.0 


Fetal Heart 


18.7 


Brtast ca MCF 7 


5.6 


Heart Pool 


12.3 
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Breast ca. MDA- 
MB-231 


41.5 


Lymph Node Pool 


18.9 


[ire i-i ca. BT 54") 


55. 1 


Fetal Skeletal Muscle 


8.9 


Breast ca T47D 


63.3 


Skeletal Muscle Pool 


23.2 


breast ca. mua-in 


23.5 


spleen Pool 


10.8 


Breast Pool 


25.5 


Thymus Pool 


18.0 


Trachea 


13.6 


CNS cancer 
(glio/astro) U87-MG 


62.4 


Lung 


5.3 


CNS cancer 
(glio/astro) U-118-MG 


12.5 


Fetal Lung 


32, 


CNS cancer 
(neuro;met) SK-N-AS 


8.1 


Lung ca. NCI-N417 


6.2 


CNS cancer (astro) SF- 
539 


17.3 


Lung ca. LX-1 


33.7 


CNS cancer (astro) 
SNB-75 


27.2 


Lung ca. NCI-H146 


10.0 


CNS cancer (glio) 
SNB-19 


25.9 


Lung ca. SHP-77 


39.2 


CNS cancer (glio) SF- 
295 


54.0 


Lung ca. A549 


43.8 Brain (Amygdala) Pool 




1 ungca. NCI-H526 
|Lung"ca. NC1-H23 


6.5 fBrain (cerebellum) 
44. 8 Brain (fetal) 


90.1 

56^6 


|Lung ca. NCI-H460 
1 mi" v i IIOP-62 


23.8 
53.6 


Brain (Hippocampus) 
IWI 

Cerebral Cortex Pool 


45.7 
60.3 


(Lung ca . NCI-H522 


100.0 


Brain (Substantia 
nigra) Pool 


61.6 1 


jLiver 4.4 
' ctal I iver 17.0 
Liver c; . 1 IepG2 33.9 
Kidney Pool 37.1 


brain (Thalamus) Pool 
Brain (whole) 
Spina! Cord Pool 
Adrenal Gland 


75.8 

63.3 l 
23.5 ~ ~ \ 


.Fetal Kidney 16.4 Pituitan i land Pool 


3.9 


Renal ca. 786-0 | 14. 8 |Salivary Gland 


9.9 


Renal ca. A498 7.5 ^Thyroid (female) 


29.1 


Renal ca.ACIIN ' 26.2 fe^o ^ 
j CAPAN2 


39.8 


Renal ca. UO-31 81.8 Pancreas Pool 


49.3 



Table AMD . Panel 4D 





! Rel. Exp.(%) 1 




Rel. Exp.(%) 


Tissue Name 


1 Ag3387, Run | 


Tissue Name 


Ag3387, Run 




j 165296475 [ 




165296475 
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Secondary Thl act 


24.1 


jHUVEC IL-lbeta 


12.1 
27.0 


29.9 


HUVEC IFN gamma 


Secondary Trl act 


26.2 


HUVEC TNF alpha + 
IFN gamma 


19.1 


Secondary Thl rest 


18.9 


HUVEC TNF alpha + 
TL4 


16.0 


Secondary Th2 rest 


22 8 HUVEC IL-1 1 


16.4 


Secondary Trl rest 


2g j Lung Microvascular EC 

fnone 


36.6 


Primary Thl act 


1 2 4 jLung Microvascular EC 
[TNFalpha + IL-1 beta 


27.2 


Primary Th2 act 


20 9 jMicrovascular Dermal 
EC none 


43.5 


Primary Trl act 


? . 7 jMicrosvasular Dermal 

!EC TNFalpha + IL-1 beta 


23.2 


Primary Thl rest 


46.0 


Bronchial epithelium 
TNFalpha I ILlbeta 


18.0 


Primary Th2 rest 


36.1 


Small airway epithelium 
none 


3.9 


Primary Trl rest 


37.6 


Small airway epithelium 
TNFalpha + IL-lbeta 


33.2 


CD45RA CD4 
lymphocyte act 


5.7 


Coronery artery SMC rest j 19.6 


CD45ROCD4 
lymphocyte act 


19.9 


Coronery artery SMC 
TNFalpha + IL-lbeta 


12.7 


CD8 lymphocyte act 


16.2 


Astrocytes rest 


27.9 


Secondary CD8 
lymphocyte rest 


11.3 


Astrocytes TNFalpha + 
IL-lbeta 


21.8 


Secondary CD8 
lymphocyte act 


12.3 


KU-812 (Basophil) rest 


7.6 


CD4 lymphocyte none ] 


5.7 


KU-812 (Basophil) 
PMA 'ionomycin 


22.1 


2ry Thl/Th2/Trl_anti- 
mos phi i 


12.2 


CCD1106 

(Keratinocytes) none 


12.0 


LAK cells rest 


19.2 


CCD 1106 
(Keratinocytes) 
TNFalpha + IL-1 beta 


19.3 


LAK cells IL-2 


25.9 [Liver cirrhosis 


9.4 


LAK cells 1L-2+IL-12 


10.2 Lupus kidney 


3.2 


LAK cells 1L-2+1FN 
gamma 


26.4 InCI-H292 none 

_ — . — _ JL „ — 


62.0 

— — — J 


LAK cells IL-2+IL-18 1 


21.2 NCI-H292IL-4 


71.2 


LAK cells 
PMA/ionomycin 


4.2 JNCI-H292 IL-9 52.5 j 


NK Cel ls IL-2 rest j 


21.2 {NCI-H292 IL-1 3 24.8 j 
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rwo Way MLR 3 day 


30.6 


NCI-II292 1FN gamma 


11 7 


1 wo Way Ml R 5 day 


~ "26.2 


IIP \LC none 


27.7 


Two Way MLR 7 day 


16.8 


HPAECTNF alpha I II - 
1 beta 




PBMC rest 


10.7 


Lung fibroblast none 




PBMC PWM 


21.3 


Lung fibroblast TNF 
alpha + IL-1 beta 


12.1 


PR MP PHA.I 


25 .5 


Lung fibroblast IL-4 


48.0 


Ramos (B cell) none 


18.4 


Lung fibroblast IL-*> 


34.6 


Ramos (B cell) 
ionomyein 


68.3 


Lung fibroblast IL-1 3 

L. _ 


51.1 


B lymphocytes PWM 


54.0 


Lung fibroblast IFN 
gamma 


63.3 


B lymphocytes CD40L 

L.«*4 TT A 

:and 1L-4 


0.0 


Dermal fibroblast 
CCD 1070 rest 




20.4 


EOL-1 dbcAMP 


18.4 

_______ 


Dermal fibroblast 
CCD 1070 TNF alpha 


74.2 


EOL-1 dbcAMP 


3.1 


Dermal fibroblast 


4.6 


PMA/ionomycin 


rprji f)7n TT 1 T-if»to 

v^luiu/u iL-i oeia 


Dendritic cells none 


28.1 


normal fi Kt-/\K la tr-t TT7M 

gamma 


10.1 


Dendritic cells LPS 


29.7 


Dermal fibroblast IL-4 


23.8 


Dendritic cells anti- 
CD40 




Irs Li colitis z 


1 A I 
1.4 


Monocytes rest 


17.2 


IBD Crohn's 


2-4 i 


Monocytes LPS 


1.6 


Colon 


21.8 


Macrophages test 


30.8 


Lung 


16.6 


VI n rophc g LPS 


17.4 


Thymus 


100.0 


Uh^ LC none 


22 S 


Kidney 


48.0 


HUVEC starved 


:k 5 







CNS_neurodegeneration_vl.O Summary: Ag3387 This panel does not show differential 
expression of the CG58662-01 gene in Alzheimer's disease. However, this expression 
profile confirms the presence of this gene in the brain. Please see Panel 1.3D for discussion 
of utility of this gene in the central nervous system. 



General screening panel vl.4 Summary: Ag3387 Expression of the CG58662-01 gene 
is ubiquitous in this panel, with highest expression in a lung cancer cell line (CT=29.5). In 
addition, this gene is expressed at higher levels in kidney cancer cell lines when compared 
to normal kidney expression. Thus, expression of this gene could be used to differentiate 
these samples from other samples and as a marker for these cancers. Furthermore, 



715 



therapeutic modulation of the expression of function of this gene may be effective in the 
treatment of lung and kidney cancer. 

Among metabolic tissues this gene is expressed at moderate to low levels in 
adipose, adrenal gland, pancreas, pituitary, and adult and fetal skeletal muscle, heart and 
liver. This widespread expression among these tissues suggests that this gene plays a role 
in norm al metabolic and neuroendocrine function and that disregulated expression of this 
gene may contribute to neuroendocrine diseases or metabolic disorders, such as obesity and 
diabetes. 

In addition, this gene is expressed at moderate to low levels in all CNS regions 
examinded and may be a small molecule target for the treatment of neurologic diseases, 
such as Alzheimer's disease, Parkinson's disease, schizophrenia, multiple sclerosis, stroke 
and epilepsy. 

Panel 4D Summary: Ag3387 The CG58662-01 gene is expressed at high to moderate 
levels in a wide range of cell types of significance in the immune response in health and 
disease, with highest expression in the thymus (CT=31). In addition, expression is seen in 
members of the T-cell, B-cell, endothelial cell, macrophage/monocyte, and peripheral 
blood mononuclear cell family, as well as epithelial and fibroblast cell types from lung and 
skin, and normal tissues represented by colon, lung, thymus and kidney. This ubiquitous 
pattern of expression suggests that this gene product may be involved in homeostatic 
processes for these and other cell types and tissues. This pattern is in agreement with the 
expression profile in General_screening_panel_vL5 and also suggests a role for the gene 
product in cell survival and proliferation. Therefore, modulation of the gene product with a 
functional therapeutic may lead to the alteration of functions associated with these cell 
types arid lead to improvement of the symptoms of patients suffering from autoimmune and 
inflammatory diseases such as asthma, allergies, inflammatory bowel disease, lupus 
erythematosus, psoriasis, rheumatoid arthritis, and osteoarthritis. 

AN. CGJ59371-01: Novel cytoplasmic protein 

Expression of gene CG59371-01 was assessed using the primer-probe set Ag3558, 
described in Table ANA. Results of the RTQ-PCR runs are shown in Tables ANB, ANC, 
AND and ANE. 
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Table ANA . Probe Name Ag3558 



Primers 


Sequences 


Length 


r 

Start 
Position 


SEQ 1 
ID 

NO: 1 


For a on! 


5 ' -cttgaggr h i a ; ragaag-3 1 


21 


208 


5lfj 


Probe 


TET-5 ' -tgcttatcaactcacagagaaggaca-3 1 - 
TAMRA 


26 


231 


518 J 


Reverse 


5 ' -gttggtctctcagtcgctgta-3 ' 


21 


263 


519 



Table ANB . General_screening_panel_vl.4 



Tissue Name 


Rel. Exp.(%) 
Ag3558, Run 
213391281 


Tissue Name 


Rel. Exp.(%) 
Ag3558, Run 
213391281 


Adipose 


0.4 


Renal ca. TK-10 


14.3 


Melanoma* 
Hs688(A).T 


1.9 


Bladder 


5.7 


Melanoma* 
Hs688(B).T 


2.2 


Gastric ca. (liver met.) 
NCI-N87 


6.4 

1 


Melanoma* Ml 4 


41.5 


Gastric ca. KATO III 


82.9 


Melanoma* 
LOXIMVI 


30.1 


Colon ca. SW-948 


14.8 


Melanoma* SK- 
MEL-5 


23.8 


Colon ca. SW480 


81.2 


Squamous cell 
carcinoma SCC-4 


34.6 


Colon ca ' (SW480 
met) SW620 


24.0 


restis Pool 


7.6 


Colon ca. HT29 


20.6 


Prostate ca.* (bone 
met) PC-3 


9.7 


Colon ca. HCT-116 


61.6 


Prostate Pool 


0.1 


( 'olon ca. Ca( 'o-2 


17.7 


Placenta 


0.3 


( olon cancel tissue 


7.7 


lUterus Pool 


0 f 


Colon ca. SW1116 


6 - 9 1 


Ovarian ca. 
OVCAR-3 


15.3 


Colon ca. Colo-205 


32 

- - 1 


Ovarian ca. SK- 
OV-3 


62.0 


Colon ca. SW-48 


9.3 


Ovarian ca. 
OVCAR-4 


27.5 


Colon Pool 


0.3 1 


Ovarian ca. 
OVCAR-5 


14.4 


Small Intestine Pool 


0.1 


Ovarian ca. 
IGROV-1 


5.8 


Stomach Pool 


2.0 


Ovarian ca. 
OVCAR-8 


2.5 


Bone Marrow Pool 


0.3 
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jOvary 


0.2 


Fetal 1 [earl 3.3 


iBreast ca. MCF-7 


15.5 


Heart Pool 0.0 


|Breast ca. MDA- 
MB-231 


100.0 


Lymph Node Pool 


0.5 


Breast ca. BT 549 


72.7 


Fetal Skeletal Muscle 


0.7 


Breast ca. T47D 


17.1 


Skeletal Muscle Pool 


0.0 


Breast ca. MDA-N 


13.5 


Spleen F'oui 


0.8 


Breast Pool 


0.2 


rhymus Pool 


7.2 


Trachea 


0.4 


CNS cancer 
(glio/astro) U87-MG 


17.6 


Lung 


0.0 


CNS cancer 

(glio astro) U-l 18-MG 


61.6 


Fetal Lung 


j r ICNS cancer 

(neuroanet) SK-N-AS 


14.9 


Lung ca. NCI-N417 


- . ( KS C U1C '! i lSttO) ^} - 

J539 


21 6 


Lung ca. LX-1 


?7 0 jCNS cancer (astro) 
jSNB-75 


30.4 


jLung ca. NCI-H146 



Lung ca. SHP-77 


15.2 

42.0 


CNS cancer (glio) 
SNB-19 

C NJS cancel , SI - 
295 


4.6 
0.8 




28.5 


Brain ( \rnygdala) Pool 


0.1 


Lung ca. NCI-H526 


6.4 


Brain (cerebellum) 


0.0 


T una ra Nri-H?'! 


21.3 


Brain (fetal) 


1.1 


Lung ca. NCI-H460 


0.7 


Brain (Hippocampus) 
Pool 


0.2 


Lungca. HOP-62 


4.7 


( erebral Cortex Pool | 


- - 

0.1 


]Lung ca. NCI-H522 


23.8 


Brain (Substantia 
nigra) Pool 


0.1 


i Liver 


0.0 


Bt iif ( 1 haLiniu-) P. h)[ 


6.6 1 


1 etal Liver 


0.8 


Brain (whole) 


0.0 


Liver ca . HepG2 
Kidney Pool 
Fetal "kidney 


7.7 


^jifii.J ( old P. ml 


0.1 

0.2 1 
0.0 l 


0.1 


Vdrenal Gland 


4.1 


Pit ill ir; .■ md Pool 


Renal ca. 786-0 


52.5 Salivary Gland 


0.0 ! 


Renal ca. A498 


6.5 (Thyroid (female) | 


0.2 " H 


Renal ca. ACHN 


14.0 


Pancreatic ca. | 
CAPAN2 


43.2 


Renal ca.UO-31 


20.2 


Pancreas Pool 


0.9 —\ 



Table ANC . General_screening_panel_vl.5 
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Tissue Name 


Rel. Exp.(%) 
Ag3558, Run 
248592792 


Tissue Name 


Rel. Exp.(%) 
Ag3558, Run 
248592792 


Adipose 0.4 


Renal ca. TK-10 


16.6 


Melanoma* 
Hs688(A).T 


2.0 


Bladder 


3.3 


Melanoma* 

HS065(B).l 


3.1 


Gastric ca. (liver met.) 
NCI-N87 


6.7 


Melanoma 7 M 1 4 


34.9 


Gastric ca K A R) III 


— — -— — — 
93.3 


Melanoma* 
LOXIMVI 


26.4 


Colon ca. SW-948 


13.0 


Melanoma* SK- 
MEL-5 


29.5 


Colon ca. SW480 


76.3 


Squamous cell 
carcinoma SCC-4 


33.0 


Colon ca.* (S W480 
met) S W620 


25.5 


[ e lis Pool 


7.2 






Prostate ca.* (bone 
met) PC-3 


10.3 


Colon ca.HCT-1 16 


55.5 


Prostate Pool 


0.1 


Colon ca. CaCo-2 


23.5 


Placenta 


0.1 


Colon cancer tissue 


5.0 


Lterus 3 ool 


0.2 


Colon ca. SW1116 


4.6 


Ovarian ca. 
OVCAR-3 


29.5 


Colon ca. Colo-205 


5.6 


Ovarian ca. SK- 
OV-3 


40.6 


Colon ca. SW-48 


6.8 


Ovarian ca. 
cwrc^ ad /i 
UVLAK-4 


29.5 


Colon Pool 


0.2 


Ovarian ca. 
OVCAR-5 


11.7 


Small Intestine Pool 


0.2 


Ovsjrifln C3. 
IGROV-l 


4.6 


Stomach Pool 


0.2 


Ovarian ca. 
OVCAR-8 


5.2 


Bone Marrow Pool 


0.2 


Ovary 


0.2 


Fetal Heart 


2.0 


Breast ca. MCF-7 


18.0 


Heart Pool 


0.0 


Breast ca. MDA- 
MB-231 


85.3 


Lymph Node Pool 


0.5 


Breast ca. BT 549 


100.0 


Fetal Skeletal Muscle 


0.8 


Breast ca. T47D 


23.3 


Skeletal Muscle Pool 


0.0 


Breast ca. MDA-N 


20.2 


Spleen Pool 


1.0 


Breast Pool 


0.3 


i hymus Pool 


_3.7 


Trachea 


0.4 


CNS cancer ; 
(glio/astro) U87-MG | 


15.4 


Lung 


0.0 


CNS cancer 
(glio/astro) U- 11 8-MG ■ 


64.6 



jFetal Lung 


3.5 


CNS cancer 
(neuro;met) SK-N-AS 


24.7 


Lung ca. NCI-N417 


; » 


CNS cancer (astro) SF- 
539 


21.6 


Lung ca. LX-1 


j 

32.1 


CNS cancer (astro) 
SNB-75 


30.4 


Lung ca.NCI-H146 


13.6 


CNS cancer (glio) 
SNB-19 


4.8 


Lung ca. SHP-77 


q JCNS cancer (glio) SF- 

J295 


3.3 


Lung ca. A549 


35.4 Bta.iif \tri) j Jala; Pool 


0 0 


Lung ca. NCI-H526 


3.4 (Brain (cerebellum) 




Lung ca. NCI-H23 


15.9 {Brain (fetal) 


0.7 


Lung ca. NCI-H460 


ft . Brain (Hippocampus) 
JPool 


0.0 


[Lung ca. HOP-62 


4.5 Cerebral Cortex Pool 


0.0 


Lung ca. NCI-H522 


22 5 jBrain (Substantia 
(nigra) Pool 


0.0 

__J 


Liver 


0.0 |Brain (Thalamus) Pool 


I— <">.-. J 


Fetal Liver 
Liver ca. HepG2 


1.3 

6.6 


Brain (whole) 


0.0 


spinal < . »rd Pool 


0.0 


Kidney Pool 


0.1 


\dt -n i! « .1 ii, I 




F Ltd] kiJl'C 


4.6 


Pslui1ar\ gland Poo! 


0.0 

(Ml 


Renal ca 7X6-0 


44.1 


Sail van Gland 


Renal ca. A498 


4.2 


rh\ioiJ [ female) 


0.1 


Renal ca. ACHN 


15.2 


Pancreatic ca. 
CAPAN2 


48.3 


Renal ca. UO-31 


20.4 


Pancreas Pool 


0.5 1 


Table AND. Panel 2.2 



Tissue Name 


ReL Exp.(%) 
Ag3558, Run 
173762113 


Rel. Exp.(%)j 

Ag3558, Run! Tissue Name 
174924057 j 


Rel. Exp.(%) 
Ag3558, Run 
173762113 


Rel. Exp.(%) 
Ag3558, Run 
174924057 


Normal Colon 


12.5 


- , Kidney Margin 
(OD04348) 


3.3 




Colon cancer 
(OD06064) 


100.0 


Kidney 

100.0 j mali 8 nant 
Icancer 
:(()I)()62(.)4B) 


5.6 


8.9 


Colon Margin 
(OD06064) 


27.5 


Kidney normal 
17.7 adjacent tissue ^ 
(OD06204D 


0.6 


0.3 
2.8 


Colon cancer 
(OD06159) 


3.1 


. r. (Kidney Cancer \ 
(OD04450 01) 


6.3 



jColon Margin 
(OD06159) 


1 " 

| 5.6 


1 73 


Kidney Margin 
(OD04450-03) 


0.6 


0.0 


jColon cancer 
|(OD06297-04) 


1 11.5 


16.6 


Kidney Cancer 
8120613 


0.0 


0 0 


jColon Margin 
[(OD06297-05) 


1 12.5 


7 „ (Kidney Margin 
J8120614 


0.3 


0.3 


CC Gr.2 ascend 
colon 

(OD03921) 


1 6.9 


- 7 IKidney Cancer 
I [9010320 


0.2 


1.9 


ICC Margin 
(OD03921) 


6.8 


r a IKidney Margin 
J90 10321 


2.1 


2.3 


jColon cancer 
metastasis 
(OD06 04) 


6.8 


4 g |Kidney Cancer 
18120607 


0.8 


1.7 


(Lung Margin 
|(OD06104) 


17 9 


1 1 r Kidney Margin 
18120608 


0 3 




jColon mets to 
ilung 

(OD04451-01) 




— - — \~ 

23.3 (Normal Uterus 


1.6 


0.5 


jLung Margin 
(OD04451-02) 


2.5 


. , j Uterine Cancer 
0640 n 


3.5 


3.6 


Normal Prostate 


2.1 


0 q INormal 
Th\ roul 


0.3 


0.0 


JProstate Cancer 
(OD04410) 


0.0 




n j Thyroid Cancer 
,064010 


1.1 


0.7 


(Prostate Margin 
(01)04410) 


0.4 


0.3 


Thyroid Cancer 
A302I52 


1.2 


1.0 


Normal Ovary 


2.7 


0.4 


Thyroid Margin! 
A302153 


0.3 


0.3 


Ovarian cancer 
(OD06283-03) 


30.1 


32.8 Normal Breast 


2 7 


1.9 


jOvarian Margin 
(OD0f>283-07) 


1.4 


. , jBreast Cancer 
.01)04566) 


74 


8 5 


Ovarian Cancer 
064008 


7.0 


, 0 iBreast Cancer 
J1024 


4.3 


6.5 


Ovarian cancer \ 
(OD06145) 


1.2 j 


1.9 


Breast Cancer 
(OD04590-01) j 


™" ~ f 

11.8 


13.9 


Ovarian Margin) 
(OD06145) 


" ' 1 
0.9 


0.6 


Breast Cancer i 
Mets 

(OD04590-03) 


| 

8.0 


6.8 


Ovarian cancer 
(OD06455-03) j 


28.1 


jBreast Cancer j 
30.4 Metastasis 1 
I(OD04655-05) j 


7.9 


11.0 


Ovarian Margin I 
(OD06455-07) I 


0.7 I 


0.6 


Breast Cancer 
064006 




5.8 


Normal Lung 


— — f 


1.3 


Breast Cancer 


0.7 I 


0.3 
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9100266 




• 


Invasive poor 
diff. lung adeno 
(ODO4945-01 


I 25.0 


| 20 - 3 

L 


Breast Margin 
19100265 

— J 


1.8 


j 

1 1.1 


Lung Margin 
(ODO4945-03) 


i i.o 


j 1.7 


'Breast Cancer 
|A209073 


2.5 




Lung Malignant 
Cancer 
(OD031 26) 


1 63 


| 6.4 


(Breast Margin 
jA2090734 


3.0 


! 

1.2 

i. 


jLung Margin 
1(OD03126) 


1.3 


1.0 


iBreast cancer 
(OD06083) 


15.7 


24.7 


Lung Cancer 
(OD05014A) 


13.5 


10.3 


(Breast cancer 
(node metastasis 
J(Op06083) 


16.5 


15.0 


Lung Margin 
(OD05014B) 


3.1 


4.8 


Normal Liver 


0.0 


0.0 


Lung cancer 
(OD0608I) 


38.4 


28.7 


Liver Cancer 
1026 


1.2 


0.0 


Lung Margin 
(OD06081) 


1.2 


, , iLiver Cancer 
,1025 


4.1 




Lung Cancer 
(OD04237-01) 


7.4 


3.8 


ILiver Cancer 
16004-T 


0.9 


0.9 


Lung Margin 
(OD04237-02) 


0.8 


1.9 


ILiver Tissue 
6004-N 


2.2 


1.4 


Ocular 

Melanoma 

Metastasis 


0.2 


0.6 


Liver Cancer 
(6005-T 


0.8 


1.0 


Ocular 
Melanoma 
Margin (1 i\cr) 


0.0 

— — - 


0.0 


iLiver Tissue 
J6005-N 


1.3 


0.3 


Melanoma 
Metastasis 


13.2 


12.5 


Liver Cancer 
1064003 


1 6 
i.o 


2 0 


Melanoma 
Margin (Lung) 


„ 

_ 


1.1 


'Normal 
(Bladder 


4.4 




Normal Kidney 


0.3 


0.0 


Bladder Cancer 
1023 - 


4.6 


1.9 


Kidney Ca, 
Nuclear grade 2 
$ a) 


0.0 


1.0 


SB ladder Cancer 
IA302173 


23.0 


15.4 


Kidney Margin 
(OD04338) 


0.8 


0.3 


Normal 1 
1 Stomach 


6.5 , 


9.5 


Kidney Ca 
Nuclear grade 
1/2 (OD04339) 


0.6 


2.6 


IGastric Cancer I 
19060397 

-J - - - 


2.9 


3.2 


Kidney Margin I 
(OD04339) 


0.8 | 


0.6 


j Stomach 
(Margin 


- — ^-.^ 

4.9 


1.0 
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9060396 




i 


Kidney Ca, 
Clear cell type 
(OD04340) 


0.6 


1.2 


iGastric Cancer 
J9060395 


6.3 


4.6 


Kidney Margin 
(OD04340) 


0.7 

... — „ 


M 


Stomach 
JMargin 
19060394 


3.5 


3.0 


Kidney Ca, 
Nuclear grade 3 
(OD04348) 


36.9 


31.0 


j 

| Gastric Cancer 
1064005 

„,.L_™_ 


22.1 

™ ■ 


20.0 



Table ANE . Panel 4D 



1 Tissue Name 


Rel. Exp.(%) 
I Ag3558, Run 
j 166488678 


1 Tissue Name 


! Rel. Exp.(%) 
Ag3558, Run 
166488678 


Secondary Th 1 act 


[ 17.9 


iHUVEC IL-lbeta 


' 10.3 


{Secondary Th2 act 


12 1 


;rFUVEClFN gamma 


12.2" 


Secondary Trl act 


12.3 


jHUVEC TNF alpha + 
(IFN gamma 


12.8 


(Secondary Thl rest 


1.7 


iHUVEC TNF alpha + 
ylL4 


14.9 


|lecondary Th2 rest 


2.6 


III \ 1 ( II -1 1 


8.6 


(Secondary Trl rest 


2.5 


Lung Microvascular EC 
none 


5.2 " j 


Primary Thl act 


9.1 


.Lung Microvascular EC 
rNFalpha4 IL-lbeta 


4.8 


Jprimary Th2 act 


11.6 


iMicrovascular Dermal 
[EC none 


— — 1 
19.5 


(Primary Trl act 


11.4 


iMicrosvasular Dermal 
|EC TNFalpha + IL-lbeta 


8.5 


(Primary Thl rest 


32.8 


(Bronchial epithelium 
INFalpha ILIbaa 


1.0 


(Primary Th2 rest 


10.4 


jSmall airway epithelium 
none 


0.5 


Primary Trl rest 


13.4 


JSmall airway epithelium 
jTNFalpha t- IL-lbeta 


5.8 


CD45RA CD4 
jlymphocyte act 


10.7 


jCoronery artery SMC rest 


2.3 


(CD45RO CD4 
[lymphocyte act 


17.2 


ICoronery artery SMC 
JTNFalpha + IL-lbeta j 


1.3 


jCD8 lymphocyte act 


12.2 


{Astrocytes rest 


1.4 


(Secondary CD8 
[lymphocyte rest 


11.7 


[Astrocytes TNFalpha h 
JlL-lbeta 


0.7 


(Secondary CD 8 


10.4 


Kl 1-812 (Basophil) resi 


3.1 
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lymphocyte act 



JCD4 lymphocyte none 



KU-8 12 (Basophil) 
iPMA/ionomycin 



2ryThl/Th2/Trl_anti- 
CD95CH11 



LAK cells rest 



LAK cells IL-2 



(CCD 1106 

(Keratinocytes) none 



CCD1106 
(Keratinocytes) 
TNFalpha + IL-lbeta 



Liver cirrhosis 



LAK cells IL-2+IL-12 



Lupus kidney 



LAK cells IL-2+IFN 
gamma 



NCI-H292 none 



LAK cells IL-2+ IL-18 



NCI-H292 IL-4 



LAK cells 
PMA/ionomycin 



NCI-H292 IL-9 



NK Cells IL-2 rest 



NCI-H292IL-13 



Two Way MLR 3 day 
I wo \\ ay MLR 5 da) 



Two Way MLR 7 day 




N( 1-H2v2 I FN gamma 
HPAEC none 

HPAEC TNF alpha I II - 
1 beta 



PBMC PWM I 

gBN^PliAZ'''""' 
Ramos Ji <„dlj none 
.' amos [B cell) 

fi onomyc in | 

B lymphocytes PWM ] 

B l\mphoc\tcs CD40L 
and IL-4 

lEOL-1 dbcAMP 



I ung fibroblast none 



(Lung fibroblast TNF 
[alpha + IL-1 beta 



Lung fibroblast IL-4 



Lung fibroblast IL-9 



EOL-1 dbcAMP 
PM -V'ionotnycin 



Dendritic cells none 



Lung fibroblast IL- 13 

Lung fibroblast IFN | 
gamma ] 
JDermal fibroblast 
CCD 1 070 rest 
j Dermal fibroblast 
CCD 1070 TNF alpha 
iDermal fibroblast 
JCCD1070 IL-1 beta 



Dermal fibroblast IFN 
gamma 



Dendriti c cells LPS 



Dermal fibroblast IL-4 



Dendritic cells anti- 
CD40 



Macrophages rest 



0.0 
0.0 

3.0 



Colon 

jLung 



Macrophages LPS 



[Thymus 



724 



HUVEC starved 



18.7 

0.0 



(Kidney 



Si 
a 
W 



CNS_neurodegeneration_vl.O Summary: Ag3558 Expression of the CG59371-01 gene 
is low/undetectable in all samples on this panel (CTs>35). (Data not shown.) 

General screening panel_vl.4 Summary: Ag3558 Highest expression of the CG59371- 
01 gene is seen in a breast cancer cell line (CT=23.4). Overall, expression of this gene is 
significantly higher in cancer cell lines and fetal derived tissues than in samples derived 
from normal adult tissues. There are significant levels of expression in clusters of cell lines 
derived from pancreatic, brain, colon, gastric, renal, lung, ovarian, breast and melanoma 
cancers. Thus, expression of this gene in could be used to differentiate between the cancer 
derived samples and fetal tissues from other samples on this panel and as a marker to detect 
the presence of cancer. Furthermore, the much higher levels of expression in proliferative 
tissue suggest that this gene may be involved in cell proliferation. Therefore, therapeutic 
modulation of the expression or function of this gene may be effective in the treatment of 
these cancers. 

Among tissues with metabolic function, this gene is expressed at moderate to low 
levels in pituitary, adipose, adrenal gland, pancreas, thyroid, and adult and fetal skeletal 
muscle, heart, and liver. This widespread expression among these tissues suggests that this 
gene product may play a role in normal neuroendocrine and metabolic and that disregulated 
expression of this gene may contribute to neuroendocrine disorders or metabolic diseases, 
such as obesity and diabetes. 

This molecule is a novel protein phosphatase expressed at moderate to low levels in 
all regions of the CNS examined. Therefore, therapeutic modulation of the expression or 
function of this gene may be useful in the treatment of neurologic disorders, such as 
Alzheimer's disease, Parkinson's disease, schizophrenia, multiple sclerosis, stroke and 
epilepsy. 

General_screening_panel_vl.5 Summary: Ag3558 Results from this experiment are in 
excellent agreement with results from Panel 1 .4. Please see that panel for discussion of 
utility of this gene in cancer, metabolic disorders and the central nervous system. 
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Panel 2.2 Summary: Ag3558 Two experiments with the same probe and primer produce 
results that are in excellent agreement, with highest expression of the CG59371-01 gene in 
colon cancer (CTs— 30). Furthermore, expression is higher in kidney, lung, ovary and colon 
cancers when compared to normal adjacent tissue. In addition, significant expression is also 
seen in gastric, breast, and bladder cancer. Thus, , expression of this gene in could be used 
to differentiate between the cancer derived samples and other samples on this panel and as 
a marker to detect the presence of cancer. Furthermore, therapeutic modulation of the 
expression or function of this gene may be effective in the treatment of these cancers. 

Panel 4D Summary: Ag3558 The CG59371-01 gene is widely expressed among the 
samples on this panel, with highest expression in dermal fibroblasts treated with TNF- 
alpha. Significant levels of expression are also seen in treated and untreated samples from 
skin, lung, T-cells and B-cells. Therefore, modulation of the expression or activity of the 
protein encoded by this transcript through the application of antibodies or peptides 
therapeutics may be beneficial for the treatment of lung inflammatory diseases such as 
asthma, and chronic obstructive pulmonary diseases, inflammatory skin diseases such as 
psoriasis, atopic dermatitis, ulcerative dermatitis, and ulcerative colitis, autoimmune 
diseases such as Crohn's disease, lupus erythematosus, rheumatoid arthritis and 
osteoarthritis and in other diseases in which T cells and B cells are activated. 

AO. CG59346-01: Cortactin-binding protein 1 

Expression of gene CG59346-01 was assessed using the primer-probe set Ag3550, 
described in Table AOA. Results of the RTQ-PCR runs are shown in Tables AOB, AOC 
and AOD. 



Table AOA . Probe Name Ag3550 



(Primers 


Sequences 


Length 


Start Position 


SEQID 
NO: 


jForward 
probe"" 


5 ' -gccaacagagatgaacaaagag-3 1 




>459 
3508" 


520 


TET-5 ■ -accgcctctcct.tctcccgctct-3 ' -TAMRA 


23 


521 


Reverse 


5 ' -ttggaaggctaaagacatctga-3 ' 




3532 


522 



Table AOB . CNS_neurodegeneration_vl.O 



Tissue Name 


Rel. Exp.(%) Ag3550, 
Run 210641081 


Tissue Name 


Rel. Exp.(%) Ag3550, 
Run 210641081 
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AD 1 Hippo 


12.8 


Control (Path) 3 


5.1 


AD 2 Hippo 


38.7 


Temporal Ctx 


40.3 


AD 3 Hippo 


10.4 


AD 1 Occipital Ctx 


18.8 


AD 4 Hippo 


15.8 


AD 2 Occipital Ctx 
(Missing) 


0.0 


AD 5 Hippo 


79.6 


AD 3 Occipital Ctx 


7 2 


\l ) 6 I lippo 


49.3 


AD 4 Occipital Ctx 


25.9 


Control 2 Hippo 


37.4 


AD 5 Occipital Ctx 


37.6 


Control 4 Hippo 


10.3 


At) 6 Occipital Ctx 


19.6 


Control (Path) 3 
jHippo 


q (■ IControl 1 Occipital 
(Ctx 


2.1 


AD 1 Temporal 
( i- 


15.7 


Control 2 Occipital 
Ctx 


56.6 


Al ) 2 Temporal 
Ctx 


37.1 


Control 3 Occipital 
Ctx 


26.8 


AD 3 Temporal 
Ctx 


8.6 


Control 4 Occipital 
Ctx 


5.0 


AD 4 Temporal 1 , 
C ix I 30 6 


Control (Path) 1 
Occipital Ctx 


93.3 


AD 5 Irif Temporal 

Ctx 


66.9 


Control (Path) 2 
Occipital Ctx 


14.6 


AD 5 Sup 
Temporal Ctx 


38.7 


Control (Path) 3 
Occipital ( t.\ 


2.6 


AD 6 Inf Temporal 
Ctx 


45.4 


Control (Path) 4 
Occipital Ctx 


23.8 


AD 6 Sup 
Temporal Ctx 


53.2 


Control 1 Parietal 
Ctx 


8.7 


Control 1 
temporal Ctx 


7.3 


Control 2 Parietal 
Ctx 


48.0 


Control 2 
1 empoial Ctx 


36.6 


Control 3 Parietal 
Ctx 


17.2 


Control 3 
Temporal Ctx 


29.7 


Control (Path) 1 
Parietal Ctx 


84.1 

— , 1 


Control 3 
Temporal ( t\ 


14.6 


Control (Path) 2 
Parietal Ctx 


28.5 


Control (Path) 1 
Temporal Ctx 


100.0 


( ontrol (Path) 3 
Parietal Ctx 


4.6 


Control (Path) 2 
Temporal Ctx 


65.5 


Control (Path) 4 
Parietal Ctx 


56.6 



Table AOC . General_screening_panel_vl.4 



Tissue Name Rel. Exp.(%) j Tissue Name Rel. Exp.(%) | 
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Ag3550, Run 
217048931 




Ag3550, Run 
217048931 


Adipose 


0.5 (Renal ca. TK- 10 


27.7 


Melanoma* 

riS006(AJ. 1 


1.4 1 Bladder 


13.7 


Melanoma* 

U n <COC>/'D\ T 1 
X1S0O5(d). 1 


, , iGastric ca. (liver met.) 
JNCI-N87 


25.0 


Melanoma* Ml 4 


0.0 IGastric ca. KATO III 


24.5 


Melanoma* 

Ll ).\I .M\ 1 


0.0 Colon ca. S W-948 


1.4 


Melanoma* SK- 
MEL-5 


0.8 Colon ca. SW480 


8.8 


j Scjuamous cell 
[carcinoma SCC-4 


? - jColon ca* (SW480 
(met) SW620 


7.4 




2.0 !Colon ca. HT29 


2.0 


Prostate ca.* (bone 
[met) PC-3 


15.2 Colon ca.HCT-1 16 


7.1 


Prostate Pool 


6.7 Colon ca. CaCo-2 


02.0 


(Placenta 


1 8.9 jColon cancer tissue 


6.0 

is 


[Uterus Pool 


0.0 


Colon ca. SW1116 


jOvarian ca. 
OVCAR-3 


7.6 


Colon ca. Colo-205 


3,0 


i Ovarian ca. SK- I 

OV-3 14,9 


Colon ca. SW-48 


3.1 


Ovarian ca. 

OVCAR-4 3A 


Colon Pool 


2.2 


Ovarian ca. 
OVCAR-5 


Small Intestine Pool 


5.0 


Ovarian ca. 

IGROV-1 [ 21 


Stomach Pool 


6.9 

— - I 


Ovarian ca. j 
OVCAR-8 24 


Bone Marrow Pool 


0.2 


Ovary 1.4 


Fetal Heart 


0.1 i 


Breast ca. MCF-7 34.6 


Heart Pool 


— i 


Breast ca. MDA- 
MB-231 


8.2 


Lymph Node Pool 


1 

2.8 ! 

-^^..^.J 


Breast ca. BT 549 


0.2 


Fetal Skeletal Muscle 


0.2 


Breast ca. T-M) 


57.4 


Skeletal Muscle Pool 


0.0 


Breast ca. MDA-N 0.0 


Spleen Pool 


1.8 ! 


Breast Pool 


4.6 


Thymus Pool 


7.2 


Trachea 


14.1 


CNS cancer 
(glio,astm) U87-MG 


0.1 


Lung 


0.0 


CNS cancer 
(glio/astro)U-118-MG| 


0.1 
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Fetal Lung 




19.5 


CNS cancer 
(neuro;met) SK-N-AS 


0.0 


Lung ca. NCI-N417 




1.5 


CNS cancer (astro) SF- 


0.0 


Lung ca. LX-1 




24.7 


CNS cancer (astro) 
SNB-75 


0.0 


Lung ca. NCI-H146 


7.4 


CNS cancer (glio) 
SNB-19 


2.1 


Lung ca. SHP-77 


0.2 


v^ino eaiieci or- 
295 


0.1 


Lung ca. A549 




26.1 


Brain (Amygdala) Pool 


22.1 


Lung ca. NCI-H526 




13.9 




63 3 


jLung ca. NCI-H23 


6.6 


Brain (fetal) 




Lung ca. NCI-H460 


11.1 


Brain (Hippocampus) 
Pool 


— 

28.1 


Lung ca. HOP-62 


0.2 


Cerebral Cortex Pool 


34.2 


Lung ca. NCI-H522 




0.5 


Brain (Substantia 
nigra) Pool 


26.2 


Liver 


3.6 


Brain t Thalamus) poo] 


37.9 


Fetal Liver 


19.1 


Brain (whole) 


57.8 


Liver ca. HepG2 


26.4 


Spinal Cord Pool 


2.8 


Kidney Pool 


0.3 


Adrenal Gland 


2.6 


f tT'jl K clnc> 


1 1.7 


Pituitary gland Pool 


3.6 


Renal c i. 786-0 


23.3 


Salivarv ( .land 




25.5 


Renal ca. A498 


11.0 


Tl;\n»:d (female) 




6.9 


Renal ca. ACHN 


27.7 


Pancreatic ca. 
CAPAN2 


15.3 


Renal ca. 1 IQ-31 


12.7 


Pancreas Pool 




12.9 


Table AOD. Panel 4D 


Tissue Name 


Rel. Exp.(%) 
Ag3550, Run 
166453850 


Tissue Name 


Rel. Exp.(%) 
Ag3550, Run 
166453850 


Secondary Thl act 


0.2 


HUVEC IL-lbeta 




0.0 


Secondary Th2 act 


0.1 


HUVEC IFN gamma 




0.0 


Secondary Trl act 


0.0 


HUVEC TNF alpha + 
IFN gamma 


0.0 


Secondary Thl rest 


0.0 


HUVEC TNF alpha + 
IL4 


0.0 


Secondary Th2 rest 


II. 0 


HUVEC IL-l 1 


0.0 


Secondary Trl rest 


0.0 


Lung Microvascular EC 
none 


0.0 


Primary Thl act 


0.0 


Lung Microvascu lar I '( ' 


0.0 
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TNFalpha + IL-lbeta ' 


Primary Th2 act 


0.0 


(Microvascular Dermal 
|EC none 


L 


Primary Trl act 


0.0 


fMicrosvasular Dermal 
EC TNFalpha + IL-lbeta 


0.0 


Primary Thl rest 


0.0 


Bronchial epithelium 
TNFalpha + IL1 beta 


9.5 


Primary Th2 rest 


0.0 


Small airway epithelium 
none 


8.1 


Primary Trl rest 


0.0 


Small airway epithelium 
rNFalpha i (L-lbeta 


27.2 


CD45RA CD4 
lymphocyte act 


1.4 


Coronery artery SMC rest 


0.7 


CD45RO CD4 
lymphocyte act 


0.0 


Coronery artery SMC 
TNFalpha + IL-lbeta 


,.0 


CD8 lymphocyte act 


0.0 


Astrocytes rest 


5.2 


Secondary CD8 
lymphocyte rest 


0.0 


Astrocytes TNFalpha + 
IL-lbeta 


5.9 


Secondary CD8 
lymphocyte act 


0.0 


KU-812 (Basophil) rest 


0.0 


CD4 lymphocyte none 


0.0 


KU-812 (Basophil) 
PMA/ionomycin 


" o, 


2ry Thl/Th2/Trl_anti- 
CD95 CHI 1 


0.0 


CCD1106 

(Keratinocytes) none 


10.5 


LAK cells rest 


0.0 


CCD 1106 

(Keratinocytes) 

1 NFalplu - f L-lbeta 


23.0 


1 AK relk II -1 

' ./YIN. CCU:> \ I ^ Z, 


0.0 


Liver cirrhosis 


22.8 


LAK cells IL-2+IL-12 


0.1 


Lupus kidney 


22.5 1 


LAK cells IL-2+IFN 
gamma 


0.0 


NCI-H292 none 


66.0 


] \[< cdk 11-2 ■ JL-I 8 


0.0 


NCI-H292 IL-4 


81.8 


LAK cells 
PMA/ionomycin 


0.0 


NCI-H292 IL-9 


69.7 


NK Cells IL-2 rest 


0.0 


iNll-HzV/ 1 1,-1 j 


58.6 


Two Way MLR 3 day 1 


0.0 


iNCI-H/yz IrN gamma 


61.1 


Two Way MLR 5 day 


0.0 


nr/\EL- none 


0.0 J 


Two Way MLR 7 day 


0.0 


HPAEC TNF alpha + IL- 
1 beta 


0.0 


PBMC rest 


0.0 


Lung fibroblast none 


57.4 


PBMC PWM 


0.0 


Lung fibroblast TNF 
alpha + IL-1 beta 


45.7 


PBMC PH \-l 


0.0 


Lung fibroblast IL-4 


36.6 


Ramos (B cell) none 


0.0 


Lium fibroblast 11.-9 


33.9 


Ramos (B cell) 


0.0 


1 uns fibroblast IL-13 


1 7.2 



ionomycin 


! _ 




B lymphocytes PWM 


2.4 


Lung fibroblast IFN 
gamma 


34.9 


B lymphocytes CD40L 
and IL-4 


11.3 


Dermal fibroblast 
CCD1 070 rest 


28.5 


EOL-1 dbcAMP 


0.0 


Dermal fibroblast 
CCD 1 070 TNF alpha 


11.3 


FOT -1 rH-i^AlVTP 
wL-i uuc/\ivir 

PMA/ionomycin 


0.0 


Dermal fibroblast 
* ( 1)10 0 IL-I beta 


4.8 


Dendritic cells none 


0.0 


Dermal fibroblast IFN 
gamma 


0.3 


Dendritic cells LPS 


- 2;° 


Dermal fibroblast 1 1 ,-4 


0.0 


Dendritic cells anti- 
CD40_ 


0.3 


IBD Colitis 2 


6 9 


Monocytes rest 


36.3 


IBD Crohn's 


2.0 


Monocytes LPS 


0.0 


Colon 


36.6 


Macrophages rest 


0.0 


Lung 


14.3 


Macrophages LPS 


0 0 


Thymus 


HHhO 


HUVEC none 


"o'.o 


Kidney 


0.1 


I'l \ K curved 


o.o " 1 





CNS_neurodegeneration_vl.O Summary: Ag3550 This panel does not show differential 
expression of the CG5 9346-01 gene in Alzheimer's disease. However, this expression 
profile confirms the presence of this gene in the brain. Please see Panel 1 .4 for discussion 
of utility of this gene in the central nervous system. 



5 General_screening_panel_vl.4 Summary: Ag3550 Highest expression of the CG59346- 
01 gene is seen in the brain. Expression of this gene is seen at high levels in the cerebellum, 
cerebral cortex, and thalamus and at moderate levels in the amygdala, hippocampus, and 
thalamus. This CG59346-01 gene encodes ahomologue of Proline-rich synapse-associated 
protein- 1/cortactin binding protein 1 (ProSAPl/CortBPl). ProSAPl is PDZ-domain protein 
10 highly enriched in the postsynaptic density (PSD) and involved in in the assembly of the 
PSD during neuronal differentiation that may function with contactin, in the recruitment 
and activation of neural intracellular signaling pathways. Therefore, this gene may play a 
role in central nervous system disorders such as Alzheimer's disease, Parkinson's disease, 
epilepsy, multiple sclerosis, schizophrenia and depression. 

1 5 In addition, moderate levels of expression are seen in colon, gastric, renal, 

pancreatic, lung, ovarian, breast and prostate cancer cell lines. Thus, expression of this 

gene could be used to detect the presence of cancer. Furthermore, therapeutic modulation 
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of the expression or function of this gene may be effective in the treatment of these 
cancers, 

.Among tissues with metabolic function, this gene is expressed at moderate to low 
levels in pituitary, adipose, adrenal gland, pancreas, thyroid, and adult and fetal and liver. 
5 This widespread expression among these tissues suggests that this gene product may play a 
role in normal neuroendocrine and metabolic and that disregulated expression of this gene 
may contribute to neuroendocrine disorders or metabolic diseases, such as obesity and 
diabetes. 

In addition, this gene is expressed at higher levels in fetal lung and kidney 
1 0 (CTs=29) when compared to expression in adult lung and kidney (CTs=35-40). Thus, 

expression of this gene could be used to differentiate between the two sources of lung and 
kidney tissue. 

References: 

1 . Peles E, Nativ M, Lustig M, Grumet M, Schilling J, Martinez R, Plowman GD, 
1 5 Schlesslnger J. Identification of a novel contactin-associated transmembrane receptor with 

multiple domains implicated in protein-protein interactions. EMBO J 1997 Mar 
3;16(5):978-88. 

2. Boeckers TM, Kreutz MR, Winter C, Zuschratter W, Smalla KH, Sanmarti-Vila 
L, Wex H, Langnaese K, Bockmann J, Garner CC, Gundelfinger ED. (1999) Proline-rich 

20 synapse-associated protein- 1/cortactin binding protein 1 (ProSAPl/CortBPl) is a PDZ- 
domain protein highly enriched in the postsynaptic density. J Neurosci 1999 Aug 
1;19(15):6506-18. 

Panel 4D Summary: Ag3550 Highest expression of the CG59346-01 gene is seen in 
thymus (CT=27). In addition, significant levels of expression are seen in IL-4, IL-9, IL-13 
25 and IFN gamma activated-NCI-H292 mucoepidermoid cells as well as untreated NCI-H292 
cells. Moderate/low expression is also detected in IL-4, IL-9, IL-13 and IFN gamma 
activated lung fibroblasts, small airway epithelium (treated and untreated), and treated 
bronchial epithelium. The expression of this gene in cells derived from or within the lung 
suggests that this gene may be involved in normal conditions as well as pathological and 
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inflammatory lung disorders that include chronic obstructive pulmonary disease, asthma, 
allergy and emphysema. 

In addition, significant levels of expression are seen in treated and untreated dermal 
fibroblasts and keratinocytes, suggesting that modulation of the expression or function of 
this gene may also reduce symtptoms in inflammatory skin diseases such as psoriasis, 
atopic dermatitis, and ulcerative dermatitis. 

AP. CG57814-01 and CG57814-02: Basic 1 19 protein 

Expression of gene CG578 14-01 and varian CG578 14-02 was assessed using the 
primer-probe set Ag791, described in Table APA. 



Table APA . Probe Name Ag791 



[Primers Sequences 


Length 


Start 
Position 


SEQID 
NO: 


(Forward 
(Probe 


5 1 -aaatgtgatgaccaaggttctg-3 ' 


22 


1290 


523 


TET-5' -agcacacattatccagcgaaagcatg-3 ' - 
TAMRA 


26 


1319 


524 


iReverse 


5 ' -tgtcaaagaaacccttgttgtc-3 1 


22 


1368 


525 



Panel 1.2 Summary: Ag791 Expression of the CG57814-01 gene is low/undetectable in 
all samples on this panel (CTs>35). (Data not shown.) 

AQ. CG59327-01: MONOCARBOXYLATE TRANSPORTER 1 like protein 

Expression of gene CG59327-01 was assessed using the primer-probe set Ag3548, 
described in Table AQA. Results of the RTQ-PCR runs are shown in Tables AQB and 
AQC. 

Table AQA . Probe Name Ag3548 



Primers; 


Sequences 


Length 


Start 
Position 


SEQID 
NO: 


Forwarc 


5 ' -atttgcatacagcagctttgtc-3 1 


2: 


517 


526 


Probe 


TET-5 1 -ttcatctcccagaaatcgtcaatttg-3 ' - 

TAMRA 


26 


549 


527 


Reverse 


5 ' -accttcgtttgctccaataagt-3 ' 


22 


579 


528 



Table AQB . General_screening_panel_vl.4 
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i issue iN3rnc 


Rel. Exp.(%) 
AgjD'to, Jvun 
217048438 


Tissue Name 


Rel. Exp.(%) 
Ago34o, Kun 
217048438 


[Adipose 


0 0 


Kenai ca. 1 iv- 1 u 


3.6 


Melanoma* 

riSooo(/\). 1 


0.0 


Bladder 


0.0 


Melanoma* 

riSOoo^o^. 1 


0.0 


Gastric ca. (liver met.) 
JNCl-No/ 


0.0 


Melanoma* Ml 4 


0.0 


Uastnc ca. K . \ I ( ) 111 


0.0 


(Melanoma* 

IJLVJAIJVJ. V 1 


0.0 


Colon ca. SW-948 


0.0 


jMelanoma* SK- 
MEL-5 


0.0 


Colon ca. SW480 


1.3 


Squamous cell 
carcinoma SCC-4 


0.0 


Colon ca.* (SW480 
met) SW620 


0.0 


Testis Pool 


0.0 


Colon ca. HT29 




Prostate ca.* (bone 
met) PC -3 


0.0 


Colon ca. HCT-116 


0.0 


Prostate Pool 


0.0 


Colon ca. CaCo-2 


0.0 


Placenta 


0.0 jColon cancer tissue 


0.0 


Uterus Pool 


1.2 Colon ca. SWU 16 


0.0 


|Ovarian ca. 
[OVCAR-3 


0.0 (colon ca. Colo-205 


0.0 


lOvarian ca. SK- 
lOV-3 


0.0 jColon ca. SW-48 


0.0 


lOvarian ca. 

r ad a 


0.0 IColon Pool 


2.2 


lOvarian ca. 
OVCAR-5 


0.0 Small Intestine Pool 


0.0 


Ovarian ca. 
IGROV-1 


0.0 I Stomach Pool 


0.0 


Ovarian ca. 
OVCAR-8 


0.0 jBone Marrow Pool 


1 

0.0 | 


Ovary 


0.0 Fetal Heart 


0.0 


Breast ca. MCF-7 


0.0 


1 F-an Pool 


0.0 


Breast ca. MDA- 
MB-231 


0.0 


Lymph Node Pool 


0.0 


Breast ca. BT M l > 


' i.9 


Fetal Skeletal Muscle 


0.0 


Breast ca. 1 47D 


0.0 


Skeletal Muscle Pool 


0.0 


Breast ca. MDA-N 


0.0 


Spleen Pool 


0.0 


Breast F'ool 


0.0 jThymus Pool 


0.0 


Trachea 


0 0 jCNS cancer 

(glio/astro) U87 \ld 


0.0 


Lung 


2.4 


CNS cancer 

(ulio 'astro) U-l 18-MG 


6.5 
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Fetal Lung 


CNS cancer 
j(neuro;met) SK-N-AS 


0.0 


Lung ca. NCI-N417 


n A |CNS cancer (astro) SF- 
°-° 1539 


0.0 


Lung ca. LX- 1 


0.0 


CNS cancer (astro) 
SNB-75 


6.4 


Lung ca. NCI-H146 


0.0 


CNS cancer (glio) 
SNB-19 


— — — — — 
0.0 


Lung ca. SHP-77 


0.0 


CNS cancer (glio) SF- 

295 


0.0 


Lung ca. A549 


0.0 JBrain (Amygdala) Pool 


0.0 


Lung ca. NCI-H526 


0.0 jBrain (cerebellum) 


0.0 


Lung ca. NCI-H23 


0.0 [Brain (fetal) 


0.0 


Lung ca. NCI-H460 


A n jBrain (Hippocampus) 
00 JPool 


0.0 


Lung ca. HOP-62 


0.0 Ceiebral Cortex Pool 


0.0 


Lung ca.. JNCl-rljzz 


^ ^ jBrain (Substantia 
jnigra) Pool 


0.0 


Liver 
Fetal Fiver 


0.0 Brain (Thalamus) Pool 
0.0 Brain (whole) 


0.0 
0.0 


Liver ca. HepG2 


0.0 Spinal Cord Pool 


0.0 


[Kidney Pool 


0.0 Adrenal Gland 


0.0 


Fetal Kidnev 


3.4 Pituitary gland Pool 


0.0 


Renal ..i ~So-i» 


0.0 iSalivary Gland 


0.0 


Renal ca. A498 


100.0 Thyroid (female) 


0.0 


Renal ca. ACHN 


0.0 


Pancreatic ca. 
CAPAN2 


0.0 


Renal c.i. 1; 0-31 


0.0 


Pancreas Pool 


0.0 



Table AOC . Panel 4D 



! 

Tissue Name 

j 


Rel. Exp.(%) 
Ag3548, Run 
166453848 


Tissue Name 


Rel. Exp.(%) 
Ag3548, Run 
166453848 


Secondary Th 1 act 


0.0 


HUVEC IL-lbeta 


0.0 


Secondary Th2 act 


0.0 


HUVEC I FN gamma 


0.0 


Secondary Trl act 


0.0 


III VECTNF alpha H 
■11- N gamma 


0.0 


Secondary Thl rest 


0.0 


HUVEC TNF alpha + 
II 1 


0.0 


Second in fh2 ie>i 


0.0 


HUVE< II -1 1 


0.5 


Secondeiry Trl rest 


0.0 


Fung Microvascular EC 
none 


0.0 


Primary Th 1 act 


0.0 


;Lung Microvascular EC 


0.0 
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jTNFalpha + IL-lbeta | 


Primary Th2 act 


^ q Microvascular Dermal 

1 C none ' ! 


Primary Trl act 


0.0 


Microsvasular Dermal 
EC TNFalpha + IL-lbeta 


0.0 


Primary Thl rest 


0.0 


Bronchial epithelium 
TNFalpha i ILlbeta 


0.2 


Primary Th2 rest 


0.0 


Small airway epithelium 
none 


1.2 


Primary Trl rest 


0.0 

^ 


Small airway epithelium 
TNFalpha + IL-lbeta 


0.0 


CD45RA CD4 
lymphocyte act 


0.0 


Coronery artery SMC rest 


0.0 


CD45ROCD4 
lymphocyte act 


0.0 


Coronery artery SMC 
TNFalpha + IL-lbeta 


0.0 


CDS lymphocyte act 


0.0 


Astrocytes rest 


0.3 


Secondary CD8 
lymphocyte rest 


0.0 


Astrocytes TNFalpha + 
IL-lbeta 


0.0 


Secondary CD 8 
lymphocyte act 


0.0 


KU-812 (Basophil) rest 


0.0 


CD4 lymphocyte none 


0.0 


KU-812 (Basophil) 
PMA/ionomycin 


0.3 


2ry Thl/Th2/Trl anti- 

( iws cm i 


0.0 


CCD 1106 

(Keratinocytes) none 


0.0 


LAK cells rest 


0.0 


CCD1106 
(Keratinocytes) 
TNFalpha + IL-lbeta 


0.0 


LAK celts IL-i 


0.0 Liver cirrhosis 


6.5 


1 AK relk II - 0 +II - 1 ~> 


0.0 


Lupus kidney 


0.7 


LAK cells IL-2+IFN 
gamma 


0.0 


NCI-H292 none 


1.8 


LAK cells IL-2+ II -1 8 


0.0 


NCI-H292 IL-4 


1.7 


LAK cells 
PMA/ionomycin 


0.0 


NCI-H292 IL-9 


0.2 


NK Cells IL-2 rest 


0.0 


NCI-H292 IL-13 


0.0 


I wo V. ,o- Ml iv 1 da\ 


0.0 


N( I-H292 1FN gamma 


0.0 


Two Way MLR 5 day 


0.0 |hPAEC none 


0.5 


Two Way MLR 7 day 


1 IPAECTNF alpha + IL- 
00 1 beta 


0.0 


PBMC rest 


0.0 


Lung fibroblast none 


0.0 


PBMC PWM 


0.0 


Lung fibroblast TNF 
alpha + IL-1 beta 


0.0 


PBMC PHA-L 


O.D 


Lung fibroblast IL-4 


0.0 


Ramos B cell) none 


'Ml 


Lung fibroblast IL- 1 ) 


0.0 


R; mi > 'B cell) 


0.0 


Lung fibroblast IL-13 


0.0 
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ionomycin 




» ■ „ 




B lymphocytes PWM 


0.0 


jLung fibroblast IFN 
gamma 


0.0 


B lymphocytes CD40L 

an H TT A 
allu IJu-^t- 


0.0 


; Dermal fibroblast 
iLLUiu/u rest 


0.0 


EOL-1 dbcAMP 


0.0 


Dermal fibroblast 
jLCUlu/u liNr alpna 


0.0 


FOT 1 .Hi,- \ \ IP 

PMA/ionomycin 


0.0 


iDermal fibroblast 
ICCD1070 IL-1 beta 


0.0 


Dendriti c cells none 


0.0 


Dermal iihroblast IFN 

jgamma 


0.0 


Dendritic cells LPS 


0.0 


Dermal fibroblast II -4 


0.0 


Dendritic cells anti- 
CD40 


0.0 


[iBD Colitis 2 


2.0 


Monocytes rest 


0.0 


mi ) ( ioh,i\ 


3.5 


Monocytes LPS 


0.0 


JColon 


1.0 


Macrophages rest 


0.0 


|Lung 


2.3 


Macrophages LPS 


0.0 


iThymus 


6.7 


HUVEC none 


0.0 


iKidney 


0.0 


HUVEC starved 


0.4 ! 





CNSneurodegenerationvl.O Summary: Ag3548 Expression of the CG59327-01 gene 
is low/undetectable in all the samples on the panel (CTs>35). (Data not shown.) 



General_screening_panel_vl.4 Summary: Ag3548 Significant expression of the 
CG59327-01 gene is restricted to a sample derived from a kidney cancer cell line 
(CT=33.34). Thus, expression of this gene could be used to differentiate between this 
sample and other samples on this panel and as a marker to detect the presence of kidney 
cancer. Furthermore, therapeutic modulation of the expression or function of this gene may 
be effective in the treatment of kidney cancer. 

Panel 4D Summary: Ag3548 Significant expression of the CG59327-01 gene is restricted 
to a samples derived from untreated microvascular dermal endothelial cells (CT=30.3). 
Thus, expression of this gene could be used as a marker of these cells. 

AR. CG59494-01: Myelin P2 

Expression of gene CG59494-01, which represents a full length physical clone, 
was assessed using the primer-probe set Ag3206, described in Table ARA. Results of the 
RTQ-PCR runs are shown in Tables ARB and ARC. 



737 



Table ARA . Probe Name Ag3206 



Primers 


Sequences 


Length 


Start ! SEQ ID 
Position j NO: 


Forward 


5 ' -agtgttgatgggaaaatgatga-3 ' 


22 


160 455 


Probe 


TET-5 ' -ccataagaacagaaagttctttccaggaca- 
3 1 -TAMRA 


30 


182 j 758 


Reverse 


5 ' -ccccagcttgaaggagatc-3 ' 


19 


216 759 



Table ARB . Panel 1.3D 



Tissue Name 


Rpl Fvn 
Ag3206, Run 
165527079 


Tissue Name 


rvei. rLiXp.( /o ) 
Ag3206, Run 
165527079 


Liver adenocarcinoma 


0.0 


Kidney (fetal) 


10.5 


Pancreas 


oo 


IVCIlal La. / oD-U 




Pancreatic ca. CAP AN 
2 


17.8 


Renal ca. A498 


15.6 


[Adrenal gland 


0.0 


Renal ca. RXF 393 


0.0 


jThyroid 


0.0 


Renal ca. ACHN 


14.8 


jSalivary gland 


0.0 


Renal ca. UO-31 


0.0 


Pituitary gland 


0.0 ~~ 


Renal ca. TK-fo 


0.0 


BraTnlfeteF) 


""o.o 


Liver 


0.0 


Brain (whok ■ 


" 9~9 


l.i\er (fetal) 




Brain (amygdala) 


0.0 


Liver ca. 

(hepatoblast) HepCi2 


0.0 


Brain (cerebellum) 


o.o 


1 mi- 


o";o " 


Brain (hippocampus) 


o.o 


i ung i fetal) 


" " o.o 


Brain (substantia nigra) 


0.0 


Lung ca. (small cell) 
LX-1 


4.5 


Brain (thalamus) 


0.0 


Lung ca. (small cell) 
NCI-H69 





Cerebral Cortex 


0.0 


Lung ca. (s.cell var.) 
SHP-77 


18.0 1 


Spinal cord 


33.4 


Lung ca. (large 
cell)NCI-H460 


41.2 


glio/astro U87-MG 


0.0 


Lung ca. (non-sm. 
cell) A549 


0.0 


glio/astro U-118-MG 


0.0 


Lung ca. (non-s.cell) 
NCI-H23 


0.0 


astrocytoma SW1783 


0.0 


Lung ca. (non-s.cell) 
HOP-62 


0.0 


neuro*; met SK-N-AS \ 


0.0 


Lung ca. (non-s.cl) 
NCI-H522 


0.0 
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astrocytoma SF-539 


0.0 


Lung ca. (squam.) 
SW 900 


0.0 




astrocytoma SNB-75 


...7 


Lung ca. (squam.) 
NCI-H596 


0.0 


glioma SNB-19 


0.0 iMammary gland 


14.4 


glioma U251 


q q Breast ca.* (pl.ef) 
[MCF-7 


0.0 


glioma SF-295 


q q fereast ca.* (pl.ef) 
JMDA-MB-23 1 


0.0 


Heart (fetal) 


0 q jBreast ca.* (pl.ef) 
[T47D 


0.0 


|Heart 


15.5 Breast ca. BT-549 


0.0 


Skeletal muscle (fetal) 


0.0 ! Breast ca. MDA-N 


0.0 


Skeletal muscle 


0.0 jOvary 


0.0 


Bone marrow 


n n Ovarian ca. 

JOVCAR-3 


14.0 


Thymus 


„ ft Ovarian ca. 

OVCAR-4 


0.0 


Spleen 


n n lOvarian ca. 

IOVCAR-5 


0.0 


Lymph node 


n „ jOvarian ca. 

JOVCAR-8 


0.0 


Colorectal 


Q Q lOvarian ca. IGROV- 


11.6 


Stomach 


n n Ovarian ca.* 

[(ascites) SK-OV-3 


0.0 


Small intestine 


0.0 


Uterus 


0.0 


Colon ca. SW480 


0.0 


Placenta 


0.0 


Colon ca.* 
SW620(SW480 met) 


0.0 


Prostate 


0.0 


Colon ca. HT29 


0.0 


Prostate ca.* (bone 
met)PC-3 


100.0 


Colon ca.HCT-1 16 


0.0 


Testis 


27.5 


Colon ca. CaCo-2 


42.0 


Melanoma 
Hs688(A).T 


0.0 


Colon ca. 
tissue(OD03866) 


0.0 


Melanoma* (met) 
Hs688(B).T 


0.0 


Colon ca. HCC-2998 


0.0 


Melanoma UACC- 
62 


0.0 


( iastric ca. | liver met) 
NCI-N87 


0.0 


Melanoma Ml 4 


0.0 


Bladder 


0.0 


Melanoma LOX 
IMVI 


0.0 


Trachea 


0.0 


Melanoma* (met) 
SK-MEL-5 


0.0 
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Table ARC . Panel 4D 



Tissue Name 


Rel. Exp.(%) 
Ag3206, Run 
164531735 


Tissue Name 


Rel. Exp.(%) 
Ag3206, Run 
164531735 


Secondary ih 1 act 


0.0 


HUVEC IL-lbeta 


0.0 


m <-i r 1 h2 act 


11.9 


HUVEC I FN gamma 


0.0 


Secondziry Trl act 


0.0 


HUVEC TNF alpha ! - 
IFN gamma 


12.6 


(Secondary Thl rest 


11.9 


HUVEC TNF alpha + 
IL4 


15.9 


Secondary 1 h2 rest 


0 0 


III \ 1 ( II - ' 1 


0.0 


jSecondary Trl rest 


q q ||Lung Microvascular EC 
jnone 


75.8 


Primary Thl act 


« „ (Lung Microvascular EC 
.TNFalpha l IL-lbeta 


100.0 


Primary Th2 act 


q q (Microvascular Dermal 
FC none 


72.2 

~ — — — — 

0.0 


Primary Trl act 


„ » iMicrosvasular Dermal 

EC I NI alpha IL-lbeta 


Primary Th 1 rest 


„ _ iBronchial epithelium 
TNFalpha + IL1 beta 


0.0 


Primary Th2 rest 


q Q Small airway epithelium 
none 


0.0 


Primary Trl rest 


„ n Small airway epithelium 
UN 1- alpha + IL-lbeta 


0.0 


CD45RA CD4 
lymphocyte act 


0.0 jCoronery artery SMC rest 


0.0 


CD45RO CD4 
lymphocyte act 


n „ Coronery artery SMC 
TNFalpha + IL-lbeta 


0.0 


CDS lymphocyte act 


0.0 Astrocytes rest 


0.0 


Secondary CD8 
[lymphocyte rest 


n n lAstrocytes TNFalpha + 
IL-lbeta 


0.0 


'Secondary CD8 
[lymphocyte act 


0.0 KU-812 (Basophil) rest 


0.0 


CD4 lymphocyte none 


Q0 kll -8 12 (Basophil) 
:PMA/ionomycin 


0.0 


2ry ThlATh2nrl anti- 
1CD95 CH11 


00 jCCD1106 

l(Keratinocytes) none 


0.0 


LAK cells rest 


0.0 


CCD 1106 
(Keratinocytes) 
TNFalpha i IL-lbeta 


0.0 


LAK cells IL-2 


0.0 


Liver cirrhosis 


29.7 
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LAK cells II -2 1 11 -12 


0.0 


Lupus kidney 


0 0 


L AK cells II -2+IFN 
gamma 


0.0 (NCI-H292 none 


97.3 


LAK cells IL-2+ IL-1 8 


0.0 


NCI-H292 IL-4 


0.0 


LAK cells 
PMA/ionomycin 


0.0 


NCI-H292 IL-9 


43.8 


NK Cells IL-2 rest 


7.8 


NCI-H292 IL-1 3 


24.0 


Two Way MLR 3 day 


0.0 


NCI-H292 I FN gamma 


12.7 


Two Wa\ Ml R 5 da\ 


Ml. 


HPAEC none 


14.3 


Two Way MLR 7 day 


0.0 


HPAEC TNF alpha + IL- 
1 beta 


33.2 


PBMC rest 


0.0 


Lung fibroblast none 


0.0 


PBMC PWM 


0 0 fLung fibroblast TNF 
alpha + IL-1 beta 


0.0 


DO A 1 13 TT A T 

rtilvlL, rHA-JL 


H .- 


Lung fibroblast IL-4 


0.0 


Ramos i B cell) n. >ne 


0.0 


Lung fibroblast 1L- 0 


16.2 


Ramos (B cell) 
ionomycin 


0.0 


Lung fibroblast IL-1 3 


15.9 


]B lymphocytes PWM 


0.0 


Lung fibroblast IFN 
gamma 


0.0 


B lymphocytes CD40L 
and IL-4 


0.0 


Dermal fibroblast 
CCD 1070 rest 


0.0 


EOL-1 dbcAMP 


0.0 


Dermal fibroblast 
CCD 1070 TNF alpha 


0.0 


xiUL-1 dbcAMr 
]'\ | \ i(in< -ni v cm 


0.0 


Dermal fibroblast 
CCD 1070 IL-1 beta 


0.0 


Dendritic cells none 


0.0 


Dermal fibroblast IFN 
gamma 


0.0 


Dendritic cells LPS 


0.0 


Dermal fibroblast IL-4 


15.0 


Dendritic cells anti- 
|CD40 


0.0 


IBD Colitis 2 


27 4 


Monocytes rest 


0.0 


1BD Crohn's 


0.0 


Monocytes LPS 


0.0 


Colon 


6.7 i 


Macrophages rest 


0.0 


Lung 


0.0 


Macrophages LPS 


o'o 


Thymus 


0.0 


III \ 1 l ,!• K 


27.7 


Kidney 


b.o 


HUV1 ( ' starved 


20 -° 


z 



Panel 1.3D Summary: Ag3206 Expression of the CG59494-01 gene is restricted to a 
sample derived from a prostate cancer cell line (CT=34.9). Thus, expression of this gene 
could be used to differentiate between this sample and other samples on this panel and as a 
marker to detect the presence of prostate cancer. Furthermore, therapeutic modulation of 
the expression or function of this gene may be effective in the treatment of prostate cancer. 
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Panel 4D Summary: Ag3206 Expression of the CG59494-01 gene is primarily restricted 
to a cluster of samples derived from microvasculature of the lung and the dermis 
suggesting a role for this gene in the maintenance of the integrity of the microvasculature. 
Therefore, therapeutics designed for this putative protein could be beneficial for the 
treatment of diseases associated with damaged microvasculature including heart diseases or 
inflammatory diseases, such as psoriasis, asthma, and chronic obstructive pulmonary 
diseases. 

AS. CG59432-01 and CG59432-02: Chloride Channel 

Expression of gene CG59432-01 and CG59432-02 was assessed using the primer- 
probe set Ag5938, described in Table ASA. Results of the RTQ-PCR runs are shown in 
Tables ASB and ASC. Please note that CG59432-02 represents a full-length physical clone 
of CG59432-01 gene, validating the prediction of the gene sequence. 



Table ASA . Probe Name Ag5938 



Primers 


Sequences 


Length 


Start Position 


SEQ n> 
NO: 


Forward 


5' -ttgtgtcagtctataccattaa-3 ' 


22 


626 


529 


Probe 


TET-5 ' -accagcttggcctctgtccagt-3 ' -TAMRA 


22 


658 


530 


Reverse 


5 1 -tcctggagttcagagtatatct-3 ' 


22 


710 


531 



Table ASB . General_screening_panel_vl .5 



Tissue Name 


Rel. Exp.(%) 
Ag5938, Run 
248102142 


Tissue Name 


Rel. Exp.(%) 
Ag5938, Run 
248102142 


Adipose 


6.3 


Renal ca. TK-10 


2.7 


Melanoma* 
Hs688(A).T 


0.0 


Bladder 


17.6 


Melanoma* 
Hs688(B).T 


. 0.0 


Gastric ca. (liver met.) 
NCI-N87 


100.0 


Melanoma* Ml 4 


0.0 


(iastric ca. KATO III 


8.2 


Melanoma* 
LOXIMV1 


0.0 


Colon ca. SW-948 


0.0 


Melanoma* SK- 
MEL-5 


0.0 


Colon ca. SW480 


7.5 


Squamous cell 
carcinoma SCC-4 


21.5 


Colon ca.* (SW480 
met) SW620 


0.6 


Testis Pool 


21.0 


Colon ca. HT29 


7.1 
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Prostate ca.* (bone 
met) PC-3 


0.0 


Colon ca. HCT-116 


j 

11.0 


Prostate Pool 


6.9 


Colon ca. CaCo-2 


25.2 


Placenta 


0.0 


Colon cancer tissue 


4.7 


Uterus Pool 


1.5 


Colon ca. SW1116 


2.7 


Ovciriiin C3.. 
OVCAR-3 


20.2 


Colon ca. Colo-205 


0.0 


OV-3 


0.0 


Colon ca. SW-48 


0.0 


OVCAR-4 


0.0 


Colon Pool 


5.7 


Ovarian ca. 
OVCAR-5 


24.7 


Small Intestine Pool 


7.6 


Ovarian ca. 
IGROV-1 


0.0 


Stomach Pool 


3.2 


Ovarian ca. 
OVCAR-8 


2.8 


Bone Marrow Pool 


6.6 


Ovary 


0.0 


Fetal Heart 


1.0 


Breast ca. MCF-7 


4.2 


Heart Pool 


4.3 


Breast ca. MDA- 
MB-231 


0.0 


Lymph Node Pool 


3.8 


Breast ca. BT 549 


0.0 


Fetal Skeletal Muscle 


59 9 


Breast ca. T47D 


0.0 


Skeletal Muscle Pool 


93.3 


Breast ca. MDA-N 


0.0 


Cnlppn Pnnl 
opiccii rovi 




lire M Fool 


10.4 


Thymus Pool 


3.6 


Trachea 


15.1 


CNS cancer 
(glio/astro) U87-MG 


0.0 


Lung 


0.6 


CNS cancer 
(glio/astro) U-118-MG 


3.1 


Fetal Lung 


31.6 


CNS cancer 
(neuro;met) SK-N-AS 


0.0 


Lung ca.. NCI-N417 


0.0 


CNS cancer (astro) SF- 
539 


0.0 


Lung ca, LX-1 


0.0 


CNS cancer (astro) 
SNB-75 


0.0 


Lungca. NCI-H146 


3.2 


CNS cancer (glio) 
SNB-19 


3.6 


Lung ca. SHP-77 


0.0 


CNS cancer (glio) SF- 

295 


0.0 


Lung ca. A549 


0.0 


Brain (Amygdala) Pool 


47.3 


Lung ca. NCI-H526 


32.8 


Brain (cerebellum) 
Brain (fetal) 


15.3 


Lung ca. NCI-H23 


0.0 j 


1.8 


Lung ca. NCI-H460 


n n iBrain (Hippocampus) 1 

! p ° o1 1 


15.8 
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Lung ca. HOP-62 


0.0 


Cerebral Cortex Pool 


27.0 


Lung ca. NCI-H522 


0.0 


Brain (Substantia 
nigra) Pool 


25.2 


Liver 


0.0 


Brain (Thalamus) Pool 


29 1 


Fetal Liver 


7.0 


Brain (whole) 


9.5 


Liver ca. HepG2 


0.0 


Spinal Cord Pool 


1 1 0 


Kidney Pool 


6.8 


Adrenal Gland 


8.4 


Fetal Kidney 


17.1 


Pituitar\ _ Unci Pcn.l 


6.3 


Renal ca. 786-0 


0.0 


Salivary Gland 


4.8 


Renal ca. A498 


0.0 


Thyroid (female) 


0.0 


Renal ca. ACHN 


0.0 


Pancreatic ca. 
CAPAN2 


1.4 


Renal ca. UO-31 


0.0 


Pancreas Pool 


4.5 



Table ASC . Panel 5 Islet 



Tissue Name 


Rel. Exp.(%) 
Ag5938,Run 
248045753 


Tissue Name 


Rel. Exp.(%) 
Ag5938, Run 
24S045753 


97457_Patient- 

02go_adipose 


0.0 


94709_Donor 2 AM - A_adipose 


0.0 


97476_Patient- 
07sk_skeletal muscle 


0.0 


94710_Donor 2 AM - B_adipose 


0.0 


97477_Patient- 
07ut_uterus 


0.0 


9471 l_Donor 2 AM - C_adipose 


0.0 


97478_Patient- 
07pl_placenta 


u 


94712_Donor 2 AD - A_adipose 


0.0 


99 167_Bayer Patient 


0.0 


94713_Donor 2 AD - B_adipose 


0.0 


97482_Patient- 
08ut_uterus 


0.0 


94714_Donor 2 AD - C_adipose 


0.0 


97483_Patient- 
08pl_placenta 


0.0 


94742_Donor3U- 

A Mesenchymal Stem Cells 


0.0 


97486_Patient- 
09sk_skeletal muscle 


0.0 


94743_Donor 3 U - 

B Mesenchymal Stem Cells 


0.0 


97487_Patient- 
09ut_uterus 


0.0 


94730_Donor 3 AM - A_adipose 


0.7 


97488_Patient- 
09pl_placenta 


0.5 


9473 I Donor 3 AM - B_adipose 


0.0 


97492_Patient- 
1 Oututerus 


0.0 


94732_Donor 3 AM - C_adipose 


0.0 


97493_Patient- 
1 0pl_placenta 


0.4 


94733_Donor 3 AD - A_adipose 


0.0 


97495_Patient- 
1 lgo_adipose 


1.0 




94734_Donor 3 AD - B_adipose 


0.0 
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97496 Patient- 

1 lskskeletal muscle 


2.4 


94735_Donor 3 AD - C_adipose 


0.0 


97497 Patient- 
1 lut_uterus 


0.0 77138_Liver_HepG2untreated 


0.0 


97498 Patient- 
1 lpl_placenta 


0.0 


73556_Heart_Cardiac stromal 
cells (primary) 


0.0 


07^HA Patient 

y / D\j\j i aiicm- 
12go_adipose 


0.7 


81735_Small Intestine 


100.0 


97501_Patient- 

1 2sk skdet&l muscle 


6.8 


72409_Kidney_Proximal 
Convoluted Tubule 


0.0 


97502_Patient- 
12ut uterus . 


0.0 


82685_Small 
intestine Duodenum 


4.6 


97503_Patient- 
12pljplacenta 


0.0 


9065 0_Adrenal_Adrenocortical 
adenoma 


0.0 


94721_Donor2U- 
A_Mesenchymal 
Stem Cells 


0.0 


r ~ — 

72410_Kidney_HRCE 


0.0 


94722_Donor2U- 
B_Mesenchymal 
Stem Cells 


0.0 


7241 l_Kidney_HRE 


0.0 


94723_Donor2U- 
C_Mesenchymal 
Stem Cells 


g ^ 731 39_Uterus_Uterine smooth 
smuscle cells 


0.0 



General_screening_panel_vl.5 Summary: Ag5938 Highest expression of the CG59432- 
01 gene is seen in a gastric cancer cell line (CTK32.5). Thus, expression of this gene could 
be used to differentiate between this sample and other samples on this panel. In addition, 
low expression of this gene is seen in colon cancer CaCo-2, lung cancer NCI-H526, 
5 ovarian cancer OVCAR-5, and squamous cell carcinoma SCC-4 cell lines. Therefore, 

therapeutic modulation of the activity of this gene or its protein product, through the use of 
small molecule drugs, protein therapeutics or antibodies, might be beneficial in the 
treatment of these cancers. 

Significant expression is also detected in fetal skeletal muscle and adult skeletal 
10 muscle (CT=32.5). At least 50 disease-causing mutations in the skeletal muscle voltage- 
gated chloride channel gene (CLCN 1 ), almost all of which originate from Caucasian 
families, have been identified. Therefore, therapeutic modulation of this gene product, a 
chloride channel homolog, may be a treatment for myotonia congenita and other muscle 
channel opathies. 
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In addition, this gene is expressed at low levels in most regions of the central 
nervous system examined, including amygdala, substantia nigra, thalamus, and cerebral 
cortex. Therefore, this gene may play a role in central nervous system disorders such as 
Alzheimer's disease, Parkinson's disease, epilepsy, multiple sclerosis, schizophrenia and 
5 depression. 

References: 

1 . Sasaki R, Ito N, Shimamura M, Murakami T, Kuzuhara S, Uchino M, Uyama E. 
A novel CLCN1 mutation: P480T in a Japanese family with Thomsen's myotonia 
congenita. Muscle Nerve. 2001 Mar;24(3):357-63. 

Panel 5 Islet Summary: Ag5938 Expression of the CG59432-01 is restricted to a sample 
from small intestine (CT=3 1 .6). Thus, expression of this gene could be used to differentiate 
between this sample and other samples on this panel and as a marker for this tissue. 

AT. CG59383-01: D6MM5E 

Expression of gene CG59383-01 was assessed using the primer-probe set Ag3427, 
described in Table ATA. Results of the RTQ-PCR runs are shown in Tables ATB, ATC 
and ATD. 



Table ATA . Probe Name Ag3427 



Primers 


Sequences 


Length 


Start 
Position 


SEQID 
NO: 


Forward 


5 ' -cagtggaacagaccaagaaca-3 ' 




784 


532 


Probe 


TET-5 * -tctttcttcacagtgtttcagcaaca-3 ' - 
TAMRA 




817 


533 


Reverse 


5 ' -ggattatctctgggtctggaa-3 ' 


2, 


844 


534 



Table ATB . CNS_neurodegeneration_vl.O 



Tissue Name 


Rel. Exp.(%) Ag3427, 
Run 210351187 


Tissue Name 


Rel. Exp.(%) Ag3427, 
Run 210351187 


AD 1 Hippo 


11.4 


Control (Path) 3 
Temporal Ctx 


2.0 


AD 2 Hippo 


50.3 


Control (Path) 4 
Temporal Ctx 


23.3 


AD 3 Hippo 


10.2 


AD 1 Occipital Ctx 


9.8 
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AD 4 Hippo 


17.6 


AD 2 Occipital Ctx 
(Missing) 


0.0 


AD 5 Hippo 


59.5 


AD 3 Occipital Ctx 


0.0 


AD 6 Hippo 


42.9 


AD 4 Occipital Ctx 


17.0 


Control 2 Hippo 


21.2 


AD 5 Occipital Ctx 


25.3 


Control 4 Hippo 


6.1 


AD 6 Occipital Ctx 


29.9 


control (ratn ) 5 
Hippo 


15.1 


Control 1 Occipital 
Ctx 


3.0 


AD 1 Temporal 
Ctx 


22.7 


Control 2 Occipital 
Ctx 


32.3 


AD 2 Temporal 
Ctx 


45.1 


Control 3 Occipital 
Ctx 


26.2 


AD 3 Temporal 
Ctx 


6.0 


Control 4 Occipital 
Ctx 


8.4 


AD 4 Temporal 

Ctx 


39.0 


Control (Path) 1 
Occipital Ctx 


84.1 


AD 5 Inf Temporal 
Ctx 


100.0 


Control (Path) 2 
Occipital Ctx 


11.7 


AD 5 S up 
Temporal Ctx 


53.6 


Control (Path) 3 
Occipital Ctx 


0.9 


AD 6 Inf Temporal 
Ctx 


57.0 


Control (Path) 4 
Occipital Ctx 


10.3 


AD 6 Sup 
Temporal Ctx 


58.2 


Control 1 Parietal 
Ctx 


6.4 


Control 1 
Temporal Ctx 


18.8 


Control 2 Parietal 
Ctx 


48.6 


Control 2 
Temporal Ctx 


51.1 


Control 3 Parietal 
Ctx 




38.7 


Control 3 
Temporal Ctx 


16.5 


Control (Path) 1 
Parietal Ctx 




54.7 


Control 3 
Temporal Ctx 


5.5 


Control (Path) 2 
Parietal Ctx 


8.4 


Control (Path) 1 
Temporal Ctx 


82.9 


Control (Path) 3 
Parietal Ctx 


1.4 


Control (Path) 2 
Temporal Ctx 


33.0 


Control (Path) 4 
Parietal Ctx 


16.2 


Table ATC. General screenine_panel vl.4 


Tiss ue Name 


Rel. Exp.(%) 
Ag3427, Run 
216821480 


Tissue Name 


Rel. Exp.(%) 
Ag3427, Run 
216821480 


Adipose 


0.5 


Renal ca. TK- 10 


8.4 


Melanoma* 
Hs688(A).T ■ 


0.0 


Bladder 


1.7 
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Melanoma* 
Hs688(B).T 


0.1 


Gastric ca. (liver met.) 


0.0 


Nlelanonia* iM14 




oasxric ca. kja iu in 




Melanoma* 

T CYYTMVT 


0.0 


Colon ca. SW-948 


0.0 


IvicioilL/Iila OPv - 

MEL-5 


0.2 


Colon ca. SW480 


0.4 


Squamous cell 
carcinoma SCC-4 


14.5 


Colon ca.* (SW480 
met) SW620 


0.1 


Testis F'ool 


10.7 


Colon ca. HT29 


0.0 


Prostate ca.* (bone 
met) PC-3 


0.0 


Colon ca. HCT-116 


100.0 


Prostate Pool 


0.0 


Colon ca. CaCo-2 


0.1 


Placenta 


0.2 


Colon cancer tissue 


1.0 


Uterus Pool 


0.0 


Colon ca. SW1116 


0.0 


Ovarian ca. 
OVCAR-3 


2.9 


Colon ca. Colo-205 


0.0 


Ovarian ca. SK- 
OV-3 


50.7 


Colon ca. SW-48 


0.0 


Ovarian ca. 
OVCAR-4 


2.1 


Colon Pool 


0.4 


Ovarian ca. 
OVCAR-5 


1.1 


Small Intestine Pool 


0.3 


Ovarian ca. 
IGROV-1 


9.7 


Stomach Pool 


0.9 


Ovarian ca. 
OVCAR-8 


13.7 


Bone Marrow Pool 


0.4 


Ovary 


0.2 


Fetal Heart 


0.0 


Breast ca. MCF-7 


0.1 


Heart Pool 


0.0 


Breast ca. MDA- 
MB-23 1 


0.0 


Lymph Node Pool 


0.4 


Breast ca. BT 549 


0.0 


Fetal Skeletal Muscle 


0.2 


Breast ca. T47D 


2.1 


Skeletal Muscle Pool 


0.0 


■Breast ca. jvilj/y-in 




Spleen Pool 


0.2 


Breast Pool 


1.7 


Thymus Pool 


1.4 


Trachea 


1.7 


CNS cancer 
(glio/astro) U87-MG 


0.0 


Lung 


0.0 


CNS cancer 
(glio/astro) U-118-MG 


0.4 


Fetal Lung 


0.4 


CNS cancer 
(neuro;met) SK-N-AS 


0.1 


Lung ca. NCI-N417 


0.0 


CNS cancer (astro) SF- 
539 


0.0 


Lung ca. LX-1 


0.3 


CNS cancer (astro) 
SNB-75 


0.8 
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Lung ca. NCI-H146 


0.0 


CNS cancer (glio) 
SNB-19 


13.9 




Lung ca. SHP-77 


0.0 


CNS cancer (glio) SF- 
295 


0.0 




Lung ca. A549 


0.0 


Brain (Amygdala) Pool 


1.7 




Lung ca. NCI-H526 


0.0 


Brain (cerebellum) 


0.9 




Lungca. NCI-H23 


0.2 


Brain (fetal) 


0.1 




Lung ca. NC1-H460 


0.0 


Brain (Hippocampus) 
Pool 


2.2 




Lung ca. HOP-62 


0.5 


Cerebral Cortex Pool 






Lung ca. NCI-H522 


0.1 


Brain (Substantia 
nigra) Pool 


1 0 


o 


Liver 


0.0 


Brain (Thalamus) Pool 


2.9 




Fetal Liver 


0.1 


Brain (whole) 


1.7 




Liver ca. HepG2 


0.0 


Spinal Cord Pool 


0.2 


1 


Kidney Pool 




0.2 


Adrenal Gland 


04 




! ■ i.-! Kidney 




0.8 


Pituitar gland Poi il 


1.3 




Renal ca. 786-0 




0.1 


Salivary Gland 


0.6 


s 


Renal ca. A498 




0.1 


Thyroid (female) 


1.9 




Renal ca. ACHN 


0.0 


Pancreatic ca. 
CAPAN2 


0.1 


o 


Renal ca. 1 ! < >-3 I 


0.5 


Pancreas Pool 


2.3 


fli 


Table ATD. Panel 4D 












Tissue Name 


Rel. Exp.(%) 
Ag3427, Run 
166396769 


Tissue Name 


Rel. Exp.(%) 
Ag3427, Run 
166396769 




Secondary Thl act 


1.4 


HUVEC IL-lbeta 


0.0 




Secondary Th2 act 


0.0 


HUVEC I FN gamma 


0.0 




Secondary Trl act 


0.8 


HUVEC TNF alpha + 
I FN gamma 


0.0 




Secondary Thl rest 


1.0 


HUVEC TNF alpha + 
IL4 


0.0 




Secondary Th2 rest 


0.0 


HUVEC IL-13 


0.0 




Secondary Trl rest 


2.9 


Lung Microvascular EC 
none 


0.0 




Primary Thl act 


0.0 


Lung Microvascular EC 
TNFalpha + IL-lbeta 


0.0 




Primary Th2 act 


0.0 


Microvascular Dermal 
EC none 


0.0 




Primary Trl act 


4.9 


Microsvasular Dermal 
EC TNFalpha + IL-lbeta 


0.0 




Primary Th 1 rest 


2.5 


Bronchial epithelium 


7.9 
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1 ____ _ 


TNFalpha + ILlbeta 




\ 

Primary Th2 rest 


2.0 


none 


0.0 


Primary Trl rest 


0.6 


Small airway epithelium 
TNFalpha + IL-lbeta 


0.0 


CD45RA CD4 
lymphocyte act 


0.0 


Coronery artery SMC rest 


0.0 


CD45RO CD4 
lymphocyte act 


0.9 


Coronery artery SMC 
TNFalpha + IL-lbeta 


0.0 


CD8 lymphocyte act 


0.0 


Astrocytes rest 


5.0 


Secondary CD8 
lymphocyte rest 


1.0 


Astrocytes TNFalpha + 
IL-lbeta 


2.4 


Secondary CD8 
lymphocyte act 


0.8 


KU-812 (Basophil) rest 


0.0 


CD4 lymphocyte none 


0.0 


Js-U-olZ (oasopnilj 
PMA/ionomycin 


0.0 


2ry Thl/Th2/Trl_anti- 

CDSO I H 1 1 


2.0 


CCD 1106 

(Keratinocytes) none 


28.5 


LAK cells rest 


0.0 


CCD 1 106 

(Keratinocytes) 
TNFalpha H IL-lbeta 


100.0 


LAK cells IL-2 


0.0 


Liver cirrhosis 


23.8 


LAK cells IL-2+IL-12 


1.6 


Lupus kidney 


3.4 


LAK cells IL-2+IFN 
gamma 


3.7 


NCI-H292 none 


1.8 


LAK cells IL-2+ IL-18 


0.9 


NCI-H292 IL-4 


3.8 


LAK cells 
PMA/ionomycin 


0.0 


NCI-H292 IL-9 


2.0 


NK Cells IL-2 rest 


0.0 


NCI-H292 IL-13 


4.0 


Two Way MLR 3 day 


0.0 


NCI-H292 IFN gamma 


1.1 


Two Way MLR 5 day 


0.0 


HPAEC none 


0.0 


Two Way MLR 7 day 


0.9 


HPAEC TNF alpha + IL- 
1 beta 


0.0 




0.0 


Lung fibroblast none 


0.0 


PBMC PWM 


0.7 


Lung fibroblast TNF 
alpha + IL-1 beta 


0.8 


PBMC PHA-L 


0.0 


Lung fibroblast IL-4 


0.0 


Ramos (B cell) none 


0.0 


Lung fibroblast IL-9 


0.7 


Ramos (B cell) 
ionomyein 


0.0 


Lung fibroblast IL-13 


0.0 

.... .... 


B lymphocytes PWM 


3.4 


Lung fibroblast IFN 
gamma 


0.0 


B lymphocytes CD40L 
and IL-4 


4.0 


Dermal fibroblast 
CCD 1070 rest 


0.9 


EOL-1 dbcAMP 


0.0 


Dermal fibroblast 


5.6 
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■CCD 1070 TNF alpha 




EOL-1 d be AMP 
PMA/ionomycin 


1.1 


iDermsl fibroblast 
(CCD1070 IL-1 beta 


0.0 


Dendritic cells none 


0.0 


|Dermal fibroblast IFN 
[gamma 


0.0 


Dendritic cells LPS 


0.0 


jDermal fibroblast IL-4 


1.6 


Dendritic cells anti- 
CD40 


0.0 


JlBD Colitis 2 


5.9 


Monocytes rest 


0.0 


jlBD Crohn's 


2.4 


Monocytes LPS 


0.0 


jColon 


4.1 


Macrophages rest 


0.0 


(Lung 


1.7 


Macrophages LPS 


0.0 


{Thymus 


12.4 


HUVEC none 


0.0 


(Kidney 


102 


HUVEC starved 


00 





CNS_neurodegeneration_vl.O Summary: Ag3427 This panel confirms the expression of 
CG59383-01 gene at low levels in the brains of an independent group of individuals. 
However, no differential expression of this gene was detected between Alzheimer's 
diseased postmortem brains and those of non-demented controls in this experiment. Please 
5 see Panel 1 .4 for a discussion of the potential utility of this gene in treatment of central 
nervous system disorders. 

General_screening_panel_vl.4 Summary: Ag3427 Highest expression of the CG59383- 
01 gene is seen in a colon cancer cell line (CT=27.2). Significant expression is also seen in 
a cluster of samples derived from ovarian cancer cell lines. Thus, expression of this gene 
10 could be used to differentiate between these samples and other samples on this panel and as 
a marker for the presence of these cancers. Furthermore, therapeutic modulation of the 
expression or function of this gene may be effective in the treatment of ovarian or colon 
cancers. 

This molecule is also expressed at low levels in all regions of the CNS examined. 
1 5 Therefore, therapeutic modulation of the expression or function of this gene may be useful 
in the treatment of neurologic disorders, such as Alzheimer's disease, Parkinson's disease, 
schizophrenia, multiple sclerosis, stroke and epilepsy. 

Among tissues with metabolic function, this gene is expressed at low levels in 
adipose and pancreas. This expression suggests that this gene product may play a role in 
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normal neuroendocrine and metabolic and that disregulated expression of this gene may 
contribute to neuroendocrine disorders or metabolic diseases, such as obesity and diabetes 



Panel 4D Summary: Ag3427 Highest expression of the CG593 83-01 gene is seen in 
keratinocytes treated with the inflammatory cytokines TNF-alpha and IL-1 beta (CT=30.3). 
Therefore, modulation of the expression or activity of the protein encoded by this transcript 
through the application of small molecule therapeutics may be useful in the treatment of 
asthma, COPD, emphysema, psoriasis and wound healing. 

AU. CG58526-01: Scramblase 

Expression of gene CG58526-01 was assessed using the primer-probe set Ag3366, 
described in Table AUA. Results of the RTQ-PCR runs are shown in Table AUB. 



Table AUA . Probe Name Ag3366 



Primers 


Sequences 


Length 


Start 
Position 


I SEQID 

J NO: 


Forward 


5 1 -tgtcttcacaaatgctgacaat-3 1 


22 


729 


535 


Probe 


TET-5 1 -ttcggaattcatgttcctgcagatct-3 ' - 
TAMRA 


26 


751 


j 536 


Reverse 


5 ' -gatcattgctgctttgactgtt-3 ' 


22 


783 


! 537 



Table AUB . General screening_panel_v 1.4 



Tissme Name 


Rel. Exp.(%) 
Ag3366, Run 
217042585 


Tissue Name 


Rel. Exp.(%) 
Ag3366, Run 
217042585 


Adipose 


0.0 


Renal ca. IK- 10 


0.0 


Melanoma* 
Hs688(A).T 


0.0 


Bladder 


0.0 


Melanoma* 
Hs688(B).T 


0.0 


Gastric ca. (liver met.) 
NCI-N87 


0.0 


Melanoma* M 1 4 


0.0 


Gastric ca. KATO III 


0.0 


Melanoma* 
LOXIMYI 


0.0 


Colon ca. SW-948 


0.0 


Melanoma* SK- 
MEL-5 


0.0 


Colon ca. SW480 


0.0 


Squamous cell 
carcinoma SCC-4 


0.0 


Colon ca.* (SW480 
met) SW620 


100.0 


Testis Pool 


69.3 


Colon ca. HT29 


0.0 


Prostate ca.* (bone 


0.0 


Colon ca. HCT-116 


25.2 
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met) PC -3 


1 




Prostate Pool 


0 0 




43.2 


Placenta 


0.0 


Colon cancer tissue 


0.0 


Uterus Pool 


0 0 


Colon ca. SWT 116 


0.0 


Ovarian ca. 
OVCAR-3 


0.0 


Colon ca. Colo-205 


0.0 


Ovarian ca. SK- 
OV-3 


0.0 


Colon ca. SW-48 


0.0 


Ovarian ca. 
\j v /Yrv ^ 


0.0 


Colon Pool 


0.0 


Ovarian ca. 
OVCAR-5 


0.0 


Small Intestine Pool 

_______ 


0.0 


Ovarian ca. 
IGROV-1 


— — 

0.0 


Stomach Pool 


49.0 


Ovarian ca. 
OVCAR-8 


0.0 


Bone Marrow Pool 


0.0 


Ovary 


0.0 


Fetal Heart 


0.0 


Breast ca. MCF-7 


0.0 


Heart Pool 


0.0 


Breast ca. MDA- 
MB-231 


0.0 


Lymph Node Pool 


0.0 


Breast ca. BT 549 


0.0 


Fetal Skeletal Muscle 


0.0 


Breast ca. T47D 


0.0 


Sk< letal Muscle Pool 


0.0 


Breast ca. MDA-N 


0.0 


Spleen Pool 


3.9 


Breast Pool 


0.0 


Thymus Pool 


0.0 


Trachea 


0.0 


CNS cancer 
(glio/astro) U87-MG 


3.5 


Lung 


0.0 


CNS cancer 

Is. ti ) U-118 vlG 


0.0 


Fetal Lung 


0.0 


CNS cancer 
(neurormet) SK-N-AS 


0.0 


Lung ca.NCI-N417 


0.0 


CNS cancer (astro) SF- 

539 - ~— 


0.0 


Lung ca. LX-1 


0.0 


CNS cancer (astro) 
SNB-75 


0.0 


Lung ca. NCI-H146 


0.0 


CNS cancer (glio) 
SNB-19 


0.0 


Lung ca. SHP-77 


0.0 


CNS cancer (glio) SF- 
295 


0.0 


Lung ca. A549 


0.0 


Brain (Amygdala) Pool 


[ 12.5 


Lung ca. NCI-H526 


0.0 


Brain (cerebellum) 


f 0.0 


Lung ca. NCI-H23 


0.0 


Brain (fetal) 


; 13.8 


Lungca. NCI-H460 


0.0 


Brain (Hippocampus) 
Pool 


| 15.0 


Lung ca. HOP-62 


0.0 


Cerebral Cortex Pool 


2.6 
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0.0 


Brain (Substantia 
nigra) Pool 


7.5 


Liver 


0.0 


Brain (Thalamus) Pool 


15.7 


Fetal Liver 


0.0 


Brain (whole) 


15.2 


Liver ca. HepG2 


0.0 


Spinal Cord Pool 


7.5 


Kidney Pool 


0.0 


Adrenal Gland 


0.0 


Fetal Kidney 


0.0 


Pituitary gland Pool 


22.8 


Renal ca. 786-0 


0.0 


Salivary Gland 


0.0 


Renal ca. A498 


0.0 


Thyroid (female) 


0.0 


Renal ca. ACHN 


0.0 


Pancreatic ca. 
CAPAN2 


0.0 


Renal ca. UO-31 


0.0 


Pancreas Pool 


0.0 



CNSjneurodegenerationvl.O Summary: Ag3366 Expression of the CG58526-01 gene 
is low/undetectable in all samples on this panel (CTs>35). (Data not shown.) 



General_screening_panel_vl.4 Summary: Ag3366 Expression of the CG58526-01 gene 
is restricted to a sample derived from a colon cancer cell line (CT=34.5) and the testis. 
5 Thus, expression of this gene could be used to differentiate between these samples and 
other samples on this panel and as a marker to detect the presence of colon cancer. 
Furthermore, therapeutic modulation of the expression or function of this gene may be 
effective in the treatment of colon cancer. 

Panel 4D Summary: Ag3366 Results from one experiment with the CG58526-01 gene 
10 are not included. The amp plot indicates that there were experimental difficulties with this 
run. 

AV. CG57851-01: sulfotransferase 

Expression of gene CG57851-01 was assessed using the primer-probe set Ag3349, 
described in Table AVA. Results of the RTQ-PCR runs are shown in Tables AVB, AVC 
15 andAVD. 



Table AVA . Probe Name Ag3349 



Primers 


Sequences 


{Length 


Start 
Position 


SEQID 
NO: 


Forward 


5 * -acaaaatgatggcgatattgag-3 ' 


I 22 


237 


538 


Probe 


TET-5 ' -cgcttccattcaacttcaacaccct-3 ' - 
TAMRA 


I 25 


270 


539 



754 



[Reverse {5 ' - tcattcttatccactccaggaa-3 ' | 22 ) 295 _ ) 540 

Table AVB . CNS_neurodegeneration_vl.O 



Tissue Name 


Rel. Exp.(%) Ag3349, 
Run 210141483 


Tissue Name 


Rel. Exp.(%) Ag3349, 
Run 210141483 


AD 1 Hippo 


32.8 


Control (Path) 3 
Temporal Ctx 


0.0 


AD 2 Hippo 


61.6 


Control (Path) 4 
Temporal Ctx 


48.6 


AD 3 Hippo 


18.0 


AD 1 Occipital 
Ctx 


10.5 


AD 4 Hippo , 


8.9 


AD 2 Occipital 
Ctx (Missing) 


0.0 


AD 5 hippo 


18.0 


AD 3 Occipital 
Ctx 


0.0 


AD 6 Hippo 


11.7 


AD 4 Occipital 
Ctx 


0.0 


Control 2 Hippo 


27.4 


AD 5 Occipital 
Ctx 


8.6 


Control 4 Hippo 


17.9 


AD 6 Occipital 
Ctx 


0.0 


Control (Path) 3 
Hippo 


12.7 


Control 1 Occipital 
Ctx 


0.0 


AD 1 Temporal Ctx 


14.8 


Control 2 Occipital 
Ctx 


0.0 


AD 2 Temporal Ctx 


8.7 


Control 3 Occipital 
Ctx 


51.4 


AD 3 Temporal Ctx 


8.2 


Control 4 Occipital 
Ctx 


5.6 


AD 4 Temporal Ctx 


10.4 


Control (Path) 1 
Occipital Ctx 


100.0 


AD 5 Inf Temporal 
Ctx 


7.2 


Control (Path) 2 
Occipital Ctx 


17.8 


AD 5 SupTemporal 

Ctx 


7.4 


Control (Path) 3 
Occipital Ctx 


0.0 


AD 6 Inf Temporal 

Ctx 


9.1 


Control (Path) 4 
Occipital Ctx 


41.2 


AD 6 S up Temporal 
Ctx 


27.9 


Control 1 Parietal 
Ctx 


3.3 


Control 1 Temporal 
Ctx 


9.2 


Control 2 Parietal 
Ctx 


70.7 


Control 2 Temporal 
Ctx 


25.9 


Control 3 Parietal 
Ctx 


14.3 


Control 3 Temporal 


13.4 


Control (Path) 1 


35.8 
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Ctx 




Parietal Ctx 




Control 4 Temporal 
Ctx 


3.7 


Control (Path) 2 
Parietal Ctx 


17.7 


Control (Path) 1 
Temporal Ctx 


53.2 


Control (Path) 3 
Parietal Ctx 


0.0 


Control (Path) 2 
Temporal Ctx 


51.1 


Control (Path) 4 
Parietal Ctx 


52.1 



Table AVC . General_screening_panel_vl.4 



Tissue Name 


Rel. Exp.(%) 
Ag3349, Run 
215620671 


Tissue Name 


Rel. Exp.(%) 
Ag3349, Run 
215620671 


Adipose 


3.6 


Renal ca. TK-10 


1.9 


Melanoma* 
Hs688(A).T 


4.5 


Bladder 


35.4 


Melanoma* 
HS05&(r$). 1 


0.3 


Gastric ca. (liver met.) | 

M/~*T "MOT 


0.4 


Melanoma* Ml 4 


0.0 


Gastric ca. KATO III 


0.7 


Melanoma* 
LOXIMVI 


0.5 


Colon ca. SW-948 


2.8 


Melanoma* SK- 
MEL-5 


8.0 


Colon ca. SW480 


10.2 


Squamous cell 
carcinoma SCC-4 


0.0 


Col< n ca ' 1<\V480 
met) SW620 


13.5 


Testis Pool 


5.4 


Colon ca. HT29 


0.0 


Prostate ca.* (bone 
met) PC-3 


0.8 


Colon ca.HCT-1 16 


0.7 


Prostate Pool 


15.1 


Colon ca. CaCo-2 


3.0 


Placenta 


0.0 


Colon cancer tissue 


7.0 


Uterus Pool 


0.4 


Colon ca.SWl 116 


0.2 


Ovarian ca. 
OVCAR-3 


0.9 


Colon ca. Colo-205 


0.8 


Ovarian ca. SK- 
OV-3 


18.4 


Colon ca. SW-48 


0.0 


Ovarian ca. 
OVCAR-4 


0.0 


Colon Pool 


5.3 


Ovarian ca. 
OVCAR-5 


10.4 


Small Intestine Pool 


2.7 


Ovarian ca. 
IGROV-1 


0.3 


Stomach Pool 


5.8 


Ovarian ca. 
OVCAR-8 


1.3 


Bone Marrow Pool 


,, 


Ovary 


5.0 


Fetal Heart 


2.0 


Breast ca. MCF-7 


1.0 


Heart Pool 


2.4 
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Breast ca. MDA- 
MB-23 1 


1.3 


Lymph Node Pool 


8.9 


Breast ca. BT 549 


0.4 


Fetal Skeletal Muscle 


2.7 


Breast ca. T47D 


0.0 


Skeletal Muscle Pool 


0.0 


Breast ca. MDA-N 


1.6 


Spleen Pool 


0.4 


Breast Pool 


10.5 


Thymus Pool 


10.5 


Trachea 


1.2 


CNS cancer 
(glio/astro) U87-MG 


11.4 


Lung 


1.3 


CNS cancer 
(glio/astro) U-118-MG 


2.4 


Fetal Lung 


7.6 


CNS cancer 
(neuro;met) SK-N-AS 


0.1 


Lungca.NCI-N417 


0.3 


CNS cancer (astro) SF- 
539 


0.2 


Lung ca. LX-1 


100.0 


CNS cancer (astro) 
SNB-75 


4.6 


Lung ca.NCI-H146 


0.8 


CNS cancer (glio) 
SNB-19 


3.1 

. ._ 


Lung ca. SHP-77 


0.0 


CNS cancer (glio) SF- 
295 


5.3 


Lung ca. A549 


0.8 


Brain (Amygdala) Pool 


0.4 


Lung ca. NCI-H526 


0.0 


Brain (cerebellum) 


0.9 


Lung ca. NCI-H23 


7.1 


Brain (fetal) 


1.6 


Lung ca. NCI-H460 


0.8 


Brain (Hippocampus) 
Pool 


0.8 


Lung ca. HOP-62 


2.6 


Cerebral Cortex Pool 


2.3 


l^ung ca. iNci-ri jzz 


0.4 


Brain (Substantia 
nigra) Pool 


" 


Liver 


0.0 


Brain (Thalamus) Pool 


\ 8 


Fetal Liver 


10.5 


Brain (whole) 


3.1 


L iver ca. HepG2 


0.8 


Spinal Cord Pool 


4.1 


Kidney Pool 


7.9 


Adrenal Gland 


2.4 


Fetal Kidney 


47.3 


Pituitary gland Pool 


1.4 


Renal ca. 786-0 


2.6 


Salivary Gland 


1.4 


Renal ca. A498 


0.9 


Thyroid (female) 


0.9 


Renal ca. ACHN 


1.3 


Pancreatic ca. 
CAPAN2 


3.7 


Renal ca. UO-31 


2.8 


Pancreas Pool 


9.3 



Table A VP . Panel 4D 





Rel. Exp.(%) 




Rel. Exp.(%) 


Tissue Name 


Ag3349, Run 


Tissue Name 


Ag3349, Run 




165222879 




165222879 
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Secondary Th 1 act 


0.0 [HUVEC IL-lbeta 


0.0 




0.0 iHUVEC I FN gamma 


0.0 


Secondary Trl act 


0.8 


HUVEC TNF alpha + 
IFN gamma 


0.0 


Secondary Thl rest 


0.0 


HUVEC TNF alpha + 
IL4 


0.0 


Secondary Th2 rest 


0.0 


HUVEC IL-11 


0.0 


Secondary Trl rest 


0.0 


Lung Microvascular EC 
none 


2.3 


Primary Thl act 


0.0 


Lung Microvascular EC 
TNFalpha + IL-lbeta 


0.0 


Primary Th2 act 


0.0 


Microvascular Dermal 
EC none 


2.3 


Primary Trl act 


0.0 


Microsvasular Dermal 
EC TNFalpha + IL-lbeta 


0.6 


Primary Thl rest 


0.0 


Bronchial epithelium 
TNFalpha + 1L1 beta 


0.0 


Primary Th2 rest 


0.7 


Small airway epithelium 
none 


0.4 


Primary Trl rest 


0.0 


Small airway epithelium 
TNFalpha + IL-lbeta 


2.1 


CD45RA CD4 
lymphocyte act 


0.0 


Coronery artery SMC rest 


0.6 


CD45RO CD4 
lymphocyte act 


0.0 


Coronery artery SMC 
TNFalpha + IL-lbeta 


0.0 


CD8 lymphocyte act 


0.0 


Astrocytes rest 


0.0 


Secondary CD8 
lymphocyte rest 


0.0 


Astrocytes TNFalpha + 
IL-lbeta 


1.0 


Secondary CD8 
lymphocyte act 


0.0 


KU-812 (Basophil) rest 


0.0 


CD4 lymphocyte none 


0.0 


KU-8 12 (Basophil) 
PMA/ionomycin 


0.0 


2ry Thl/Th2/Trl_anti- 


0.0 


CCD 1106 

(Keratinocytes) none 


0.6 


LAK cells rest 


1.4 


CCD 1106 
(Keratinocytes) 
TNFalpha + IL-lbeta 


0.0 


LAK cells IL-2 


0.0 jLiver cirrhosis 


3.3 


LAK cells IL-2+IL-12 


0.0 jLupus kidney 


5.8 


LAK cells IL-2+1FN 
gamma 


1.0 


NCI-H292 none 


0.6 


LAK cells IL-2+ IL-18 


0.0 


NCI-H292 1L-4 


0.0 


LAK cells 
PMA/ionomycin 


1.3 


NCI-H292 IL-9 


1.5 


NK Cells IL-2 rest 


0.0 


NCI-H292 IL-13 


0.8 
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Two Way MLR 3 day 


0.0 


(NCI-H292 IFN gamma 


0.3 


Two Way MLR 5 day 


0.0 


iHPAEC none 


0.7 


Two Way MLR 7 day 


1.0 


|HPAEC TNF alpha + IL- 
1 beta 


0.0 


I'HMl* rest 


0.0 


jLung fibroblast none 


0.0 


PBMC PWM 


0.0 


jLung fibroblast TNF 
falpha + IL-1 beta 


0.6 




0.0 


iLung fibroblast IL-4 


0.0 


Ramos (B cell) none 


0.0 


iLung fibroblast IL-9 


0.9 


Ramos (B cell) 
ionomycin 


0.6 


JLung fibroblast IL-1 3 


0.0 


B lymphocytes PWM 


0.0 


JLung fibroblast IFN 

jgamma 


0.0 


B lymphocytes CD40L 
ana ili-h 


0.0 


(Dermal fibroblast 


0.6 


EOL-1 dbcAMP 


0.0 


1 Dermal fibroblast 

iK^\-*LJ IV J \J 1 i\r dipila 


0.8 


JbUJL-1 ubcAMr 
PMA/ionomycin 


0.0 


jDermal fibroblast 
ICCD1070 IL-1 beta 


0.0 


Dendritic cells none 


7.4 


(Dermal fibroblast IFN 

'gamma 


0.0 


Dendritic cells LPS 


2.1 


JDermal fibroblast IL-4 


0.7 


Dendritic cells anti- 
CD40 


2.9 


|lBD Colitis 2 


0.0 


Monocytes rest 


0.0 


jlBD Crohn's 


0.0 


Monocytes LPS 


1.3 


|Colon 


0.6 


Macrophages rest 


0.9 


,Lung 


0.7 


Macrophages L PS 


0.2 


IThymus 


100.0 


HU\ i < none 


0.0 


Kidney 


1.7 


HUVEC starved 


0.0 | 





CNS_neurodegeneration_vl.0 Summary: Ag3349 This panel confirms the expression of 
CG57851-01 gene at low levels in the brains of an independent group of individuals. 
However, no differential expression of this gene was detected between Alzheimer's 
diseased postmortem brains and those of non-demented controls in this experiment. The 
5 expression of this gene in the brain suggests that therapeutic modulation of the expression 
or function of this gene may be useful in the treatment of neurologic disorders, such as 
Alzheimer's disease, Parkinson's disease, schizophrenia, multiple sclerosis, stroke and 
epilepsy. 

General_screening_panel_vl.4 Summary: Ag3349 Highest expression of the CG57851- 

10 01 gene is seen in a lung cancer cell line (CT=30). Thus, expression of this gene may be 
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used to differentiate between this sample and other samples on this panel and as a marker 
for lung cancer. This gene encodes a sulfotransferase homolog. Sulfotransferases are 
involved in the metabolism of drugs and endogenous compounds in the body and also 
synthesize the complex glycoproteins found on the cell surface of cancer cells. Therefore, 
therapeutic modulation of the expression or function of this gene may be effective in the 
treatment of lung cancer. 

Among tissues with metabolic function, this gene is expressed at moderate to low 
levels in adipose and pancreas. This expression among these tissues suggests that this gene 
product may play a role in normal metabolic function and that disregulated expression of 
this gene may contribute to metabolic diseases, such as obesity and diabetes. 

Panel 4D Summary: Ag3349 Highest expression of the CG57851-01 gene is seen in the 
thymus (CT=29.7). The putative protein encoded by this gene could therefore play an 
important role in T cell development. Small molecule therapeutics designed against the 
protein encoded by this gene could be utilized to modulate immune function (T cell 
development) and be important for organ transplant, AIDS treatment or post chemotherapy 
immune reconstitiution. 

Panel 5 Islet Summary: Ag3349 Expression of the CG57851-01 gene is low/undetectable 
in all samples on this panel (CTs>35). (Data not shown.) 

AW. CG59258-01: KIAA1608 protein 

Expression of gene CG59258-01 was assessed using the primer-probe set Ag3520, 
described in Table AWA. 



Table AWA . Probe Name Ag3520 



Primers 


Sequences 


1 

Length! 


Start 
Position 


SEQID 
NO: 


Forward 


5 ' -cctcacagatgaggacacaga-3 1 


21 i 


717 


541 


Probe 


TET-5 1 -acttgcttgccaaagtcactcagcaa-3 ' - 
TAMRA 


26 | 


752 


542 


Reverse 


5 ' - tttctgagagccagacagacat- 3 1 


22 f 


781 


543 



CNSneurodegenerationvl.O Summary: Ag3520 Expression of the CG59258-01 gene 
is low/undetectable in all samples on this panel (CTs>35). (Data not shown.) 



760 



Genera]l_screening_panel_vl.4 Summary: Ag3520 Expression of the CG59258-01 gene 
is low/uitidetectable in all samples on this panel (CTs>35). (Data not shown.) 

Panel 4D Summary: Ag3520 Expression of the CG5 925 8-01 gene is low/undetectable in 
all samples on this panel (CTs>35). (Data not shown.) 

AX. CG59564-01: Sorting nexin 6 

Expression of gene CG595 64-01 was assessed using the primer-probe set Ag3471, 
described in Table AXA. Results of the RTQ-PCR runs are shown in Tables AXB, AXC 
and AXD. 



Table AXA . Probe Name Ag3471 



Primers 


Sequences 


Length 


Start 
Position 


SEQID 
NO: 


Forward 


5 ' -gtgcctggcagacgattata-3 ' 


20 


820 


544 


Probe 


TET-5 ' -ctatctcagctgcgctgagcagtctg-3 ' - 
TAMRA 


26 


843 


545 


Reverse 


5 ' -gtccttagctggttgacttcct-3 ' 


22 


876 


546 



Table AXB . CNS_neurodegeneration_vl.O 



Tissue Name 


Rel. Exp.(%) Ag3471, 
Run 210376963 


Tissue Name 


Rel. Exp.(%) Ag3471, 
Run 210376963 


AD 1 Hippo 


10.8 


Control (Path) 3 
Temporal Ctx 


2.4 


AD 2 Hippo 


27.0 


Control (Path) 4 
Temporal Ctx 


28.5 


AD 3 Hippo 


5.9 


AD 1 Occipital Ctx 


10.3 


AD 4 Hippo 


10.8 


AD 2 Occipital Ctx 
(Missing) 


0.0 


AD 5 H ippo 


64.6 


AD 3 Occipital Ctx 


3.6 


AD 6 Hippo 


43.8 


AD 4 Occipital Ctx 


30.4 


Control 2 Hippo 


56.3 


AD 5 Occipital Ctx 


100.0 


Control 4 Hippo 


4.0 


AD 6 Occipital Ctx 


14.5 


Control (Path) 3 
Hippo 


2.3 


Control 1 Occipital 
Ctx 


1.1 


AD 1 Temporal 
Ctx 


11.5 


Control 2 Occipital 
Ctx 


82.9 


AD 2 Temporal 
Ctx 


36.3 


Control 3 Occipital 
Ctx 


13.3 


AD 3 Temporal 


4.7 


Control 4 Occipital 


5.4 
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Ctx 




Ctx 




A Pi A Tpmnnrol 

j\d h i temporal 
Ctx 


27.9 


control (Fatnj i 
Occipital Ctx 


87.7 


A Pi ■> Tn f TVm 1-1 1 

j\u j mi icmpordi 
Ctx 


85.9 


v^ontroi [train) z 
Occipital Ctx 


10.3 


Temporal Ctx 


37.1 


control ^ratn ) j 
Occipital Ctx 


1.5 


AD 6 Inf Temporal 
Ctx 


46.3 


Control (Path) 4 
Occipital Ctx 


12.2 


AD 6 Sup 
Temporal Ctx 


52.1 


Control 1 Parietal 
Ctx 


3.1 


Control 1 
Temporal Ctx 


3.6 


Control 2 Parietal 
Ctx 


38.4 


Control 2 
Temporal Ctx 


81.2 


Control 3 Parietal 
Ctx 


17.4 


Control 3 
Temporal Ctx 


19.1 


Control (Path) 1 
Parietal Ctx 


88.9 


Control 3 
Temporal Ctx 


8.0 


Control (Path) 2 
Parietal Ctx 


22.2 


Control (Path) 1 
Temporal Ctx 


88.9 


Control (Path) 3 
Parietal Crx 


1.7 


Control (Path) 2 
Temporal Ctx 


48.0 


Control (Path) 4 
Parietal Ctx 


38.7 


Table AXC. General screenins_panel vl.4 


Tissue Name 


Rel. Exp.(%) 
Ag3471, Run 
222691297 


Tissue Name 


Rel. Exp.(%) 
Ag3471, Run 
222691297 


Adipose 


2.3 


Renal ca. TK- 10 


2.7 


Melanoma* 
Hs688(A).T 


3.1 


Bladder 


4.2 


Melanoma* 
Hs688(B).T 


4.2 


Gastric ca. (liver met.) 
NCI-N87 


6.3 


Melanoma* Ml 4 


1 > (i 


Gastric ca. KATO III 


4.1 


Melanoma* 
LOXIMVI 


0.7 


Colon ca. SW-948 


0.7 


Melanoma* SK- 
MEL-5 


0.8 


Colon ca. SW480 


2.6 


Squamous cell J 
carcinoma SCC-4 | 


Colon ca.* (SW480 
met) SW620 


4.5 


Testis Pool 


6.2 


Colon ca. HT29 


2.2 


Prostate ca.* (bone 
met) PC-3 


3.8 


Colon ca. HCT-116 


3.9 


Prostate Pool 


0.7 


Colon ca. CaCo-2 


3.5 
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Placenta. 


2.3 


Colon cancer tissue 


1.0 


Uterus Pool 


0.7 


CobnTrSW1116 


1.9 


Ovarian ca. 
OVCAR-3 


5.3 


Colon ca. Colo-205 


0.6 


Ovarian ca. SK- 
OV-3 


2.1 


Colon ca. SW-48 


2.3 


Ovarian ca. 
OVCAR-4 


2.8 


Colon Pool 


7.0 


Ovsrisn c<i. 
OVCAR-5 


5.8 


Small Intestine Pool 


5.8 


Ov3ris.n C3. 
IGROV-l 


5.3 


Stomach Pool 


5.5 


Ovarian ca. 
OVCAR-8 


3.0 


Bone Marrow Pool 


3.5 


Ovary 


7 I 


Fetal Heart 


1.7 


Breast ca. MCF-7 


22 


Heart Pool 


3.1 


Breast ca. MDA- 
MB-23 1 


2.4 


Lymph Node Pool 


9.5 


Breast ca. BT 549 


98.6 


Fetal Skeletal Muscle 


1 . \ 


Breast ca. T47D 


8.4 


Skeletal Muscle Pool 


4.2 


Breast ca. MDA-N 




Spleen Pool 


1 .8 


Brca-.t 1',h>I 


7.5 


Thymus Pool 


6.2 


Trachea 


2.7 


CNS cancer 
(glio/astro) U87-MG 


7.0 


Lung 


1.9 


CNS cancer 
(glio/astro) U-118-MG 


4.9 


Fetal Lung 


13.4 


CNS cancer 
(neuro;met) SK-N-AS 


11.3 


Lung ca. NCI-N417 


2.2 


CNS cancer (astro) SF- 
539 


1.9 


Lung ca. LX-1 


2.2 


SNB-75 ( ) 


23.8 


Lung ca. NCI-H146 


2.1 


CNS cancer (glio) 
SNB-19 


3.6 


Lung ca. SHP-77 


3.7 


CNS cancer (glio) SF- 
295 


11.3 


Lung ca. A549 


3.7 


Brain (Amygdala) Pool 


22.1 


Lung ca. NCI-H526 


2.5 


Brain (cerebellum) 


42 9 


Lung ca. NCI-H23 


17.6 


Brain (fetal) 


100.0 


Lung ca. NCI-H460 


1.8 


Brain (Hippocampus) 
Pool 


26.6 


Lung ca. HOP-62 


3.6 


Cerebral Cortex Pool 


33.0 


Lung ca. NCI-H522 


5.7 


Brain (Substantia 
nigra) Pool 


29.3 
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Liver 




0.1 


Brain (Thalamus) Pool 




37.4 


Fetal Liver 


1.3 


Brain (whole) 


55.1 


Liver ca. HepG2 


1.3 


Spinal Cord Pool 


7.7 


Kidney iPool 


10.6 


Adrenal Gland 


1.5 


Fetal Kidney 


6.7 


Pituitary gland Pool 


0.6 


Renal ca. 786-0 


0.9 


Salivary Gland 


0.8 


Renal ca. A498 


0.8 


Thyroid (female) 


0.8 


Renal ca. ACHN 


2.1 


Pancreatic ca. 
CAPAN2 


3.0 


Renal ca. UO-31 


3.4 


Pancreas Pool 


7.4 


Table AXD. Panel 4D 


Tissue Name 


Rel. Exp.(%) 
Ag3471, Run 
loo417Izo 


Tissue Name 


Rel. Exp.(%) 
Ag3471, Run 
loo4171Z6 


Secondsiry Thl act 


10.2 


HUVEC IL-lbeta 


2.3 


Secondary Th2 act 


11.2 


HUVEC IFN gamma 


7.8 


Secondary Trl act 


19.2 


HUVEC TNF alpha + 
IFN gamma 


4.7 


Secondary Th 1 rest 


28.7 


HUVEC TNF alpha + 
IL4 


6.3 


Secondary Th2 rest 


18.4 


HUVEC IL-11 


6.2 


Secondary Trl rest 


24.5 


Lung Microvascular EC 
none 


6.4 


Primary Th 1 act 


8.7 


Lung Microvascular EC 
TNFalpha + IL-lbeta 


7.1 


Primary Th2 act 


20.4 


Microvascular Dermal 
EC none 




6.9 


Primary Trl act 


31.0 


Microsvasular Dermal 
EC TNFalpha + IL-lbeta 


2.5 


Primary Th 1 rest 


45.7 


Bronchial epithelium 
TNFalpha + ILlbeta 


2.6 


Primary Th2 rest 


23.3 


Small airway epithelium 
none 


3.1 


Primary Trl rest 


25.3 


Small airway epithelium 
TNFalphaH- IL-lbeta 


3.3 


CD45R\ CD4 
lymphocyte act 


9.7 


Coronery artery SMC rest 


3.9 


CD45RO CD 4 
lymphocyte act 


23.2 


Coronery artery SMC 
I NI :xlpha + IL-lbeta 




4.6 


CD 8 lymphocyte act 


9.0 


Astrocytes rest 




12.7 


Secondary CD8 
lymphocyte rest 


24.8 


Astrocytes TNFalpha + 
IL-lbeta 




18.9 
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Secondary CD 8 

^^^^ 


19.9 


KU-812 (Basophil) rest 


26.2 


CD4 lymphocyte none 


10.6 


k I 1 (in /Rocrn-iMU 

JVU-olZ ^JDasopnn^ 

PMA/ionomycin 


50.0 


2ryThl/Th2/Trl anti- 
CD95 CH11 


22.7 


CCD 11 06 

(Keratinocytes) none 


6.9 


LAK cells rest 


6.4 


CCD 11 06 
(Keratinocytes) 
TNFalpha + IL-lbeta 


10.9 


LAK cells IL-2 


27.5 


Liver cirrhosis 


15.3 


LAK cells IL-2+IL-12 


21.3 


Lupus kidney 


4.2 


LAK cells IL-2+1FN 
gamma 


27.4 


NCI-H292 none 


5.8 


LAK cells IL-2+IL-18 


22.7 


NCI-H292 II -1 


7.2 


LAK cells 
PMA/ionomycin 


8.2 


NCI-H292 IL-9 


3.6 


NK Cel ls IL-2 rest 


13.6 


NCI-H292 IL-13 


3.7 


Two Way MLR 3 day 


23.7 


NCI-H292 I FN gamma 


3.2 


Two Way MLR 5 day 


5.6 


HPAEC none 


3.7 


Two Way MLR 7 day 


7.7 


HPAEC TNF alpha + IL- 
1 beta 


2.3 


PBMC rest 


6.1 


Lung fibroblast none 


18.3 


PBMC PWM 


7.3 


Lung fibroblast TNF 
alpha + IL-1 beta 


20.6 


PBMC PHA-L 7.1 


Lung fibroblast IL-4 


16.6 


Ramos (B cell) none j 6.3 


Lung fibroblast IL-9 


9.2 


Ramos (B cell) 
ionomycin 


3.2 


Lung fibroblast IL-13 


7.9 


B lymphocytes PWM 


12.2 


Lung fibroblast IFN 
gamma 


6.2 


B lymphocytes CD40L 
and IL-4 


11.1 


Dermal fibroblast 
CCD 1070 rest 


11.4 


EOL-1 dbcAMP 


7.4 


Dermal fibroblast 
CCD 1070 TNF alpha 


28.5 


EOL-1 dbcAMP 
PMA/io nomyc in 


12.5 


Dermal fibroblast 
CCD 1070 IL-1 beta 


7.2 


Dendritic cells none 


13.4 


Dermal fibroblast IFN 
gamma 


69 


Dendritic cells LPS 


14.0 


Dermal fibroblast IL-4 


12.8 


Dendritic cells anti- 
CD40 


15.9 


IBD Colitis 2 


1.7 


Monocytes rest 


21.3 


IBD Crohn's 


4.8 


Monocytes LPS 


11.4 


Colon 


100.0 


Macrophages rest 


23.5 


Lung 


12.6 
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Ma rophages LPS 


3.7 


Thymus 


8.9 


HUVEC none 


6.7 


Kidney 


34.4 


HUVEC starved 


11.0 







CNS_neurodegeneration_vl.O Summary: Ag3471 This panel does not show differential 
expression of the CG59564-01 gene in Alzheimer's disease. However, this expression 
profile confirms the presence of this gene in the brain. Please see Panel 1.4 for discussion 
of utility of this gene in the central nervous system. 



General_screenin^j>anel_vl.4 Summary: Ag3471 The CG59564-01 gene, a sorting 
nexin homolog, shows highly brain preferential expression. Moderate levels of expression 
are seen in all brain regions examined, with highest expression in the fetal brain (CT=28.5). 
Thus, this gene would be useful for distinguishing brain tissue from non-neural tissue, and 
may be beneficial as a drug target in neurologic disease, such as Alzheimer's disease, 
Parkinson's disease, schizophrenia, multiple sclerosis, stroke and epilepsy. 

Among tissues with metabolic function, this gene is expressed at low levels in 
pituitary, adipose, adrenal gland, pancreas, and adult and fetal skeletal muscle, heart, and 
liver. This widespread expression among these tissues suggests that this gene product may 
play a role in normal neuroendocrine and metabolic and that disregulated expression of this 
gene may contribute to neuroendocrine disorders or metabolic diseases, such as obesity and 
diabetes. 

In addition, this gene is expressed at significant levels in a breast cancer cell line 
(CT=28 .6). Thus, expression of this gene could be used to differentiate this sample from 
other samples on this panel and as a marker for breast cancer. 

20 Panel 4B Summary: Ag3471 The CG59564-01 gene, a sorting nexin homolog, is most 
highly expressed in normal colon (CT=30). In addition, this gene is expressed at high to 
moderate levels in a wide range of cell types of significance in the immune response in 
health and disease. These cells include members of the T-cell, B-cell, endothelial cell, 
macrophage/monocyte, and peripheral blood mononuclear cell family, as well as epithelial 

25 and fibroblast cell types from lung and skin, and normal tissues represented by colon, lung, 
thymus and kidney. This ubiquitous pattern of expression suggests that this gene product 
may be i nvolved in homeostatic processes for these and other cell types and tissues. This 
pattern is in agreement with the expression profile in General_screening_panel_vl.4 and 
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also suggests a role for the gene product in cell survival and proliferation. Therefore, 
modulation of the gene product with a functional therapeutic may lead to the alteration of 
functions associated with these cell types and lead to improvement of the symptoms of 
patients suffering from autoimmune and inflammatory diseases such as asthma, allergies, 
5 inflammatory bowel disease, lupus erythematosus, psoriasis, rheumatoid arthritis, and 
osteoarthritis. 

AY. CG 59553-01: Secretory protein SEC8 

Expression of gene CG59553-01 was assessed using the primer-probe set Ag3465, 
described in Table AYA. Results of the RTQ-PCR runs are shown in Tables AYB, AYC 
10 andAYD. 



Table AYA . Probe Name Ag3465 



Primers. 


Sequences 


Length 


Start 
Position 


SEQID 
NO: 


Forward 


5 ' - ttcacagcaagaagatgaacct-3 ' 


22 


616 


547 


Probe 


TET-5 ' -tcatagatgaactacaccggcacctg-3 ' - 
TAMRA 


26 


649 


548 


Reverse 


5 1 -ctcggctagtcgatttgatgt-3 ' 


21 


676 


549 



Table AYB . CNS_neurodegeneration_vl.O 



Tissue Name 


Rel. Exp.(%) Ag3465, 
Run 210376516 


Tissue Name 


Rel. Exp.(%) Ag3465, 
Run 210376516 


AD 1 Hippo 


21.3 


Control (Path) 3 
Temporal Ctx 


8.2 


AD 2 H ippo 


33.0 


Control (Path) 4 
Temporal Ctx 


40.9 


AD 3 H ippo 


11.0 


AD 1 Occipital Ctx 


20.7 


AD 4 H ippo 


11.6 


AD 2 Occipital Ctx 
(Missing) 


0.0 


AD 5 Hippo 


87.7 


AD 3 Occipital Ctx 


10.3 


AD 6 H ippo 


46.7 


AD 4 Occipital Ctx 


24.8 


Control 2 Hippo 


29.7 


AD 5 Occipital Ctx 


40.6 


Control 4 Hippo 


20.4 


AD 6 Occipital Ctx 


25.3 


Control (Path) 3 
Hippo 


14.2 


Control 1 Occipital 

Ctx 


6.7 


AD 1 Temporal 
Ctx 


21.3 


Control 2 Occipital 
Ctx 


59.9 


AD 2 Temporal 


38.7 


Control 3 Occipital 


21.8 
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Ctx 




Ctx 




AD 3 Temporal 
Ctx 


8.2 


Control 4 Occipital 
Ctx 


11.8 


AD 4 Temporal 
Ctx 


30.8 


control (rain) 1 
Occipital Ctx 


79.6 


AD 5 Inf Temporal 
Ctx 


100.0 


Control (Path) 2 
Occipital Ctx 


18.8 


AD 5 Sup 
Temporal Ctx 


58.2 


control (ratn) 5 
Occipital Ctx 


4.0 


AD o in i I emporal 
Ctx 


47.6 


Control (Path) 4 
Occipital Ctx 


25.7 


AD o bup 
Temporal Ctx 


52.1 


Control 1 Parietal 
Ctx 


14.2 


Control 1 
Temporal Ctx 


11.8 


Control 2 Parietal 
Ctx 


56.6 


Control 2 
Temporal Ctx 


42.0 


Control 3 Parietal 
Ctx 


23.8 


Control 3 
Temporal Ctx 


22.8 


Control (Path) 1 
Parietal Ctx 


75.3 


Control 3 
Temporal Ctx 


14.0 


Control (Path) 2 
Parietal Ctx 


29.7 


Control (Path) 1 
Temporal Ctx 


64.6 


Control (Path) 3 
Parietal Ctx 


8.5 


Control (Path) 2 
Temporal Ctx 


47.0 


Control (Path) 4 
Parietal Ctx 


_ _„ _ 
52.9 

_ _ . _ 


Table A YC. General screening_panel vl.4 


Tissue Name 


Rel. Exp.(%) 
Ag3465, Run 
217118990 


Tissue Name 


Rel. Exp.(%) 
Ag3465, Run 
217118990 


Adipose 


13.1 


Renal ca. TK-10 


60.7 


Melanoma* 
Hs688(A).T 


22.5 


Bladder 


37.9 


Melanoma* 
Hs688(B).T 


30.1 


Gastric ca. (liver met.) 
NCI-N87 


42.9 


Melanoma* M14 


54.7 


Gastric ca. KATO III 


48.0 


Melanoma* 
LOXIMVI 


15.5 


Colon ca. SW-948 


5.5 


Melanoma* SK- 
MEL-5 


j 42.0 


Colon ca. SW480 


57.4 


Squamous cell 
carcinoma SCC-4 


8.7 


Colon ca.* (SW480 
met) SW620 


37.4 


r< us Pool 


12.8 


Colon ca. HT29 


25.0 


Prostate ca.* (bone 


63.7 


Colon ca. HCT-116 


28.3 
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met) PC -3 






Prostate Pool 


13.0 


Colon ca CaCo-2 


46.7 


Placenta 


7.1 


Colon cancer tissue 


26.4 


Uterus Pool 


12.0 


Colon ca. SW1116 


8.6 


Ovarian ca. 
OVCAR-3 


37.4 


Colon ca. Colo-205 


6.9 


Ovarian ca. SK- 
OV-3 


21.5 


Colon ca. S W-48 


7.5 


Ovarian ca. 
OVCAR-4 


26.1 


Colon Pool 


21.8 


Ovarian ca. 
OVCAR-5 


42.0 


Small Intestine Pool 


25.5 


Ov<iriEii] cu 
IGROV-l 


23.5 


Stomach Pool 


15.4 


Ovarian ca. 
OVCAR-8 


24.7 


Bone Marrow Pool 


9.4 


Ovary 


14.0 


Fetal Heart 


6.9 


Breast ca. MCF-7 


38.4 


Heart Pool 


11.1 


Breast ca. MDA- 
MB-23I 


49.0 


Lymph Node Pool 


23.8 


Breast ca. BT 549 


45.4 


Fetal Skeletal Muscle 


11 9 


Breast ca. T47D 


74.7 


^kiL'tai Muscle Pool 




Breast ca. MDA-N 


20.4 


Spleen Pool 


39.2 


Breast Pool 


22.8 


Thymus Pool 


39.8 


Trachea 


15.4 


CNS cancer 
(glio/astro) U87-MG 


100.0 


Lung 


6.8 


CNS cancer 

(gli aoruf U-118 KV} 


54.7 


Fetal Lung 


41.5 


CNS cancer 
(neuro;met) SK-N-AS 


50.0 


Lung ca. NCI-N417 


12.2 


CNS cancer (astro) SF- 
539 


19.3 


Lung ca. LX-1 


26.1 


CNS cancer (astro) 
SNB-75 


75.3 


Lungca. NCI-H146 


12.6 


CNS cancer (glio) 
SNB-19 


23.8 


Lung ca. SHP-77 


33.9 


CNS cancer (glio) SF- 
295 


95.3 


Lung ca. A549 


43.8 


Brain < \m\ gdala) Poo! 


11.6 


Lung ca. NCI-H526 


7.6 


Brain (cerebellum) 


12.2 


Lung ca. NCI-H23 


78.5 


Brain (fetal) 


32.5 


Lung ca. NCI-H460 


25.0 


Brain (Hippocampus) 
Pool 


12.7 


Lungca. HOP-62 


28.5 


Cerebral ( 'ortex Pool 


15.8 
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Lung ca. NCI-H522 


25.0 


Brain (Substantia 
nigra) Pool 


11.7 


Liver 


2.0 


Brain (Thalamus) Pool 


17.7 


Fetal Liver 


18.7 


Brain (whole) 


15.6 


Liver ca. HepG2 


15.5 


Spinal Cord Pool 


12.5 


Kidney Pool 


36.6 


Adrenal Gland 


17.7 


Fetal Kidney 


26.8 


Pituitar} gland Pool 


6 0 


Renal ca. 786-0 


55 5 


Salivary Gland 


7.0 


Renal ca. A498 


19.8 


Thyroid (female) 


6.6 


Renal ca. ACHN 


31.0 


Pancreatic ca. 
CAPAN2 


40.3 


Renal ca. UO-31 


48.0 


Pancreas Pool 


28.7 


Table A YD. Panel 4D 


Tissue Name 


Rel. Exp.(%) 
Ag3465, Run 


Tissue Name 


Rel. Exp.(%) 
Ag3465, Run 


Secondary Th 1 act 


22.1 


HUVEC IL-lbeta 


14.6 


Secondary Th2 act 


33.9 


HUVEC IFN gamma 


17.8 


Secondary Trl act 


44.4 


HUVbC lNr alpha + 
IFN gamma 


10.6 


Secondary Thl rest 


33.4 


HUVEC TNF alpha + 
IL4 


8.3 


Secondary Th2 rest 


25.0 


HUVEC IL 1 1 


8.2 


Secondary Trl rest 


29.7 


Lung Microvascular EC 
none 


12.3 


Primary Thl act 


14.3 


Lung Microvascular EC 
TNFalpha + IL-lbeta 


14.8 


Primary Th2 act 


41.2 


Microvascular Dermal 
EC none 


15.5 


Primary Trl act 


46.7 


Microsvasular Dermal 
EC TNFalpha + IL-lbeta 


14.7 


Primary Thl rest 


88.9 


Bronchial epithelium 
TNTalpln -r IT \hcu 


15.5 


Primary Th2 rest 


39.2 


Small airway epithelium 
none 


14.0 


Primary Trl rest 


31.0 


Small airway epithelium 
1 Nl- alpha + IL-lbeta 


65.5 


CD45RA CD4 
1) m| h >c>te act 


20.6 


Coronery artery SMC rest 


18.3 


CD45RO CD4 
lymphocyte act 


29.9 


Coronery artery SMC 
TNFalpha + IL-lbeta 


12.5 


CD8 lymphocyte act 


23.0 


Astrocytes rest 


28.7 
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lymphocyte rest 


24.1 


IL-lbeta 


31.6 


Secondary CD8 
lymphocyte set 


18.7 


KU-812 (Basophil) rest 


19.8 


CD4 lymphocyte none 


19.5 


KU-8 12 (Basophil) 
PMA/ionomycin 


42.6 


2ryThl/Th2/Trl anti- 
CD95 CHI 1 


37.4 


CCD 1106 

(Keratinocytes) none 


21.8 


LAK cells rest 


(CCD 1106 
17.1 jj(Keratinocytes) 

|TNFalpha + IL-1 beta 


100.0 


LAK cells IL-2 


35.8 


Liver cirrhosis 


16.5 


LAK cells IL-2+IL-12 


32.3 


Lupus kidne> 


23.5 


LAK cells IL-2+IFN 
gamma 


38.4 


NCI-H292 none 


48.6 


LAK cells IL-2+ IL-18 


32.5 


NCI-H292 IL-4 


45.1 


LAK cells 
PMA/ionomycin 


12.0 


NCI-H292 IL-9 


49.7 


NK Cell s IL-2 rest 


24.7 


NCI-H292 IL-13 


26.4 


Two Way MLR 3 day 


31.4 


NCI-H292 IFN gamma 


25.3 


Two Way MLR 5 day 


19.6 


HPAEC none 


17.9 


Two Way MLR 7 day 


14.9 


HPAEC TNF alpha + IL- 
1 beta 


20.2 


PBMC rest 


18.4 


Lung fibroblast none 


39.2 


PBMC PWM 


18.7 


Lung fibroblast TNF 
alpha + IL-1 beta 


32.8 


PBMC PHA-L 


10.2 


Lung fibroblast IL-4 


28.3 


Ramos (B cell) none 


61.6 


Lung fibroblast IL-9 


20.4 


Ramos ( B cell) 
ionomycin 


46.7 


Lung fibroblast IL-13 


19.5 


B lymphocytes PWM 


28.1 


Lung fibroblast IFN 
gamma 


26.6 


d iyiiipiiuLyiCa KsLJ'tVL, 
and IL-4 


44.8 


Dermal fibroblast 
CCD1 070 rest 


26.8 


EOL-1 dbcAMP 


33.2 


Dermal fibroblast 
CCD1070 TNF alpha 


50.7 


EOL-1 dbcAMP 
PMA/ionomycin 


25.5 


Dermal fibroblast 
CCD1070 IL-1 beta 


18.4 


Dendritic cells none 


30.1 


Dermal fibroblast IFN 
gamma 


19.2 


Dendritic cells LPS 


19.1 


Dermal fibroblast IL-4 


34.6 


Dendritic cells anti- 
CD40 


33.7 


IBD Colitis 2 


9.0 


Monoc) tes rest 


25.0 


IBD Crohn's 


12.4 


Monocytes LPS 


16.3 


Colon 


56.3 
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Macrophages rest 


44.4 


Lung 


16.4 


Macrophages LPS 


14.4 


Thymus 


49.3 


HUVKC none 


20.4 


Kidney 


52.1 


HUVEC starved 


37.1 







CNS_neurodegeneration_vl.O Summary: Ag3465 This panel does not show differential 
expression of the CG59553-01 gene in Alzheimer's disease. However, this expression 
profile confirms the presence of this gene in the brain. Please see Panel 1 .4 for discussion 
of utility of this gene in the central nervous system. 



5 General_screening_panel_vl.4 Summary: Ag3465 Highest expression of the CG59553- 
01 gene is seen in a brain cancer cell line (CTs=24). Expression of this gene is ubiquitous 
throughout this panel, with significant levels of expression in clusters of cell lines derived 
from brain, renal, colon, lung, breast, ovarian, and melanoma cancers. These high levels of 
expression in all the samples on this panel suggest a role for this gene in cell growth and 
10 proliferation. 

This molecule is also expressed at high levels in all regions of the CNS examined. 
Therefore, therapeutic modulation of the expression or function of this gene may be useful 
in the treatment of neurologic disorders, such as Alzheimer's disease, Parkinson's disease, 
schizophrenia, multiple sclerosis, stroke and epilepsy. 

1 5 Among tissues with metabolic function, this gene is expressed at high to moderate 

levels in pituitary, adipose, adrenal gland, pancreas, thyroid, and adult and fetal skeletal 
muscle, heart, and liver. This widespread expression among these tissues suggests that this 
gene product may play a role in normal neuroendocrine and metabolic and that disregulated 
expression of this gene may contribute to neuroendocrine disorders or metabolic diseases, 

20 such as obesity and diabetes. 



Panel 4B Summary: Ag3465 The CG59553-01 gene is expressed at high to moderate 
levels in a wide range of cell types of significance in the immune response in health and 
disease. These cells include members of the T-cell, B-cell, endothelial cell, 
macrophage/monocyte, and peripheral blood mononuclear cell family, as well as epithelial 
25 and fibroblast cell types from lung and skin, and normal tissues represented by colon, lung, 
thymus and kidney. This ubiquitous pattern of expression suggests that this gene product 
may be involved in homeostatic processes for these and other cell types and tissues. This 
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pattern is in agreement with the expression profile in General_screening_panel_vl.5 and 
also suggests a role for the gene product in cell survival and proliferation. Therefore, 
modulation of the gene product with a functional therapeutic may lead to the alteration of 
functions associated with these cell types and lead to improvement of the symptoms of 
5 patients suffering from autoimmune and inflammatory diseases such as asthma, allergies, 
inflammatory bowel disease, lupus erythematosus, psoriasis, rheumatoid arthritis, and 
osteoarthritis. 



AZ. CG59435-01 and CG59435-02: Human Neddl 

Expression of gene CG59435-01 and CG59435-02 was assessed using the primer- 
10 probe set Ag3437, described in Table AZA. Results of the RTQ-PCR runs are shown in 
Tables AZB, AZC and AZD. Please note that CG59435-02 represents a full-length physical 
clone of the CG59435-01 gene, validating the prediction of the gene sequence. 



Table AZA . Probe Name Ag3437 



Primers 


Sequences 


Length 


Start 
Position 


SEQID 
NO: 


Forward 


5 ' -tggtgctgaaagtggaaatc-3 * 


20 


1536 


550 


Probe 


TET-5 1 -cctctccatcatctaaccaaacaaga- 3 ' - 
TAMRA 


26 


1562 


551 


Reverse 


5 ' -tgggcttcaatttcattctct-3 ' 


21 


1611 


552 



Table AZB . CNS neurodegeneration vl.O 



Tiss ue Name 


Rel. Exp.(%) Ag3437, 
Run 210374394 


Tissue Name 


Rel. Exp.(%) Ag3437, 
Run 210374394 


AD 1 Hippo 


8.9 


Control (Path) 3 
Temporal Ctx 


6.9 


AD 2 Hippo 


25.7 


Control (Path) 4 
Temporal Ctx 


27.9 


AD 3 Hippo 


18.2 


AD 1 Occipital 
Ctx 


26.6 


AD 4 Hippo 


13.2 


AD 2 Occipital 
Ctx (Missing) 


0.0 


AD 5 hippo 


52.9 


AD 3 Occipital 
Ctx 


7.6 


AD 6 Hippo 


100.0 


AD 4 Occipital 
Ctx 


26.6 


Control 2 Hippo 


26.1 


AD 5 ■ )ccipital 

Ctx 


26.8 
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Control 4 Hippo 


26.2 


AD 6 Occipital 
Ctx 


21.8 


control (ram) 5 
Hippo 


14.7 


Control 1 Occipital 
Ctx 


9.0 


AD 1 Temporal Ctx 


35.8 


Control 2 Occipital 
Ctx 


23.5 


AD 2 Temporal Ctx 


27.7 


Control 3 Occipital 
Ctx 


17.8 


AD 3 Temporal Ctx 


14.6 


Control 4 Occipital 
Ctx 


14.6 


AD 4 Temporal Ctx 


23.3 


Control (Path) 1 
Occipital Ctx 


70.2 


AD 5 Inf Temporal 
Ctx 


65.5 


Control (Path) 2 
Occipital Ctx 


12.9 


AD 5 SupTemporal 
Ctx 


47.0 


Control (Path) 3 
Occipital Ctx 


4.9 


AD 6 Inf Temporal 
Ctx 


78.5 


Control (Path) 4 
Occipital Ctx 


22.2 


AD 6 Sup Temporal 
Ctx 


92.0 


Control 1 Parietal 
Ctx 


13.3 


Control 1 Temporal 
Ctx 


10.9 


Control 2 Parietal 
Ctx 


50.0 


Control 2 Temporal 
Ctx 


23.5 


Control 3 Parietal 
Ctx 


13.1 


Control 3 Temporal 
Ctx 


17.9 


Control (Path) 1 
Parietal Ctx 


35.4 


Control 4 Temporal 

Ctx 


12.8 


Control (Path) 2 
Parietal Ctx 


26.6 


Control (Path) 1 
Temporal Ctx 


37.4 


Control (Path) 3 
Parietal Ctx 


5.4 


Control (Path) 2 
Temporal Ctx 


44.4 


Control (Path) 4 
Parietal Ctx 


29.3 



Table AZC . General_screening_panel_vl.4 



Tissue Name 


Rel. Exp.(%) 
Ag3437, Run 
217066730 


Tissue Name 


Rel. Exp.(%) 
Ag3437, Run 
217066730 


Adipose 


10.0 


Renal ca.TK-10 


24.0 


Melanoma* 
Hs688(A).T 


25.7 


Bladder 


18.6 


Melanoma* 
Hs688(B).T 


27.5 


Gastric ca. (liver met.) 
NCI-N87 


100.0 


Melanoma* Ml 4 


34.9 


Gastric ca. KATO III 


60.3 


Melanoma* 


3 1 .9 


Colon ca. SW-948 


9.7 
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Melanoma* SK- 
MEL-5 


8.7 


Colon ca. S W480 


61.6 


Squamous cell 
carcinoma SCC-4 


24 8 


Colon ca.* (SW480 
met) SW620 


46.3 


Testis Pool 


25.9 


Colon ca. HT29 


22.7 


Prostate ca.* (bone 
met) PC-3 


84.1 


Colon ca. HCT-116 


72.2 


Prostate Pool 


12.3 


Colon ca. CaCo-2 


32.8 


Placenta 


0.3 


Colon cancer tissue 


38.2 


Uterus Pool 


11.4 


Colon ca. SW1116 


7.5 


Ovarian ca. 
OVCAR-3 


33.2 


Colon ca. Colo-205 


6.7 


Ovarian ca. SK- 
OV-3 


92.7 


Colon ca. S W-48 


7.1 


Ovarian ca. 
OVCAR-4 


9.7 


Colon Pool 


27.2 


Ovarian ca. 
OVCAR-5 


26.4 


— —— 

Small Intestine Pool 


22.8 


Ovarian ca. 
IGROV-1 


16.5 


Stomach Pool 


12.3 


Ovarian ca. 
OVCAR-8 


6.5 


Bone Marrow Pool 


13.5 


Ovary 


7.6 


Fetal Heart 


21.9 


Breast ca. MCF-7 


24.3 


Heart Pool 


11.7 


Breast ca. MDA- 
MB-231 


84.1 


Lymph Node Pool 


30.4 


Breast ca. BT 549 


68.3 


Fetal Skeletal Muscle 


1^ 2 


Breast ca. T47D 


52.1 


Skeletal Muscle Pool 


28.5 


Breast ca. MDA-N 


18.9 


Spleen Pool 


15.8 


Breast Fool 


26.6 


Thymus Pool 


21.2 


Trachea 


9.2 


CNS cancer 
(glio/astro) U87-MG 


14.0 


Lung 


4.9 


CNS cancer 
(glio/astro) U-118-MG 


91.4 


Fetal Lung 


49.0 


CNS cancer 
(neuro;met) SK-N-AS 


55.5 


Lung ca. NCI-N417 


4.9 


539 


14.5 


Lung ca. LX-1 


37.4 


CNS cancer (astro) 
SNB-75 


33.0 


Lungca. NCI-H146 


6.8 


CNS cancer (glio) 
SNB-19 


12.5 


Lung ca. SHP-77 


51.4 


CNS cancer (glio) SF- 
295 


51.4 
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Lung ca. A549 


33.9 


Brain (Amygdala) Pool 


0.0 


Lung ca. NCI-H526 


8.4 


Brain (cerebellum) 


1.5 


Lung ca. NCI-H23 


34.4 


Brain (fetal) 


6.2 


Lung ca. lNd-rl4oU 


98.6 


Brain (Hippocampus) 
Pool 


3.0 


Lung ca. HOP-62 


15.2 


Cerebral Cortex Pool 


3.0 


Lung ca. NCI-H522 


37.9 


Brain (Substantia 
nigra) Pool 


3.6 


Liver 


0.4 


Brain (Thalamus) Pool 


3.9 


Fetal Liver 


26.8 


Brain (whole) 


1.1 


Liver ca. HepG2 


11.3 


Spinal Cord Pool 


5.0 


Kidney Pool 


23.2 


Adrenal Gland 


3.3 


Fetal Kidney 


42.9 


Pituitary gland Pool 


4.5 


Renal ca. 786-0 


41.5 


Salivary Gland 


0.9 


Renal ca. A498 


12.9 


Thyroid (female) 


4.2 


Renal ca. ACHN 


20.2 


Pancreatic ca. 
CAPAN2 


20.7 


Renal ca.UO-31 


29.7 


Pancreas Pool 


19.8 



Table AZD . Panel 4. ID 



Tissue Name 


Rel. Exp.(%) 
Ag3437, Run 
169839068 


Tissue Name 


Rel. Exp.(%) 
Ag3437, Run 
169839068 


Secondary Thl act 


44.8 


HUVEC 11,-1 beta 


23.2 


Secondary Th2 act 


57.8 


HUVEC I FN gamma 


26.2 


Secondary Trl act 


60.7 


HUVEC TNF alpha + 
IFN gamma 


18.6 


Secondary Thl rest 


10.2 


HUVEC TNF alpha + 
IL4 


16.7 


Secondary Th2 rest 


14.3 


HUVEC IL-11 


10.9 


Secondary Trl rest 


13.9 


Lung Microvascular EC 
none 


28.1 


Primary Thl act 


37.4 


Lung Microvascular EC 
TNFalpha + IL-lbeta 


25.3 


Primary Th2 act 


34.9 


Microvascular Dermal 
EC none 


19.9 


Primary Trl act 


39.0 


Microsvasular Dermal 
EC TNFalpha + IL-lbeta 


17.3 


Primary Thl rest 


17.8 


Bronchial epithelium 
TNFalpha + IL1 beta 


20.9 


Primary Th2 rest 


14.7 


Small airway epithelium 
none 


4.9 


Primary Trl rest 


23.2 


Small airway epithelium 


20.4 
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TNFalpha + IL-lbeta 




CD45RA CD4 
lymphocyte act 


39.0 


Coronery artery SMC rest 


10.7 


CD45RO CD4 
lymphocyte set 


37.4 


Coronery artery SMC 
TNFalpha + IL-lbeta 


10.7 


CD8 lymphocyte act 


31.9 


Astrocytes rest 


9.2 


Secondary CD8 
lymphocyte rest 


33.7 


Astrocytes TNFalpha + 
IL-lbeta 


6.4 


Secondary CD8 
lymphocyte act 


21.8 


KU-812 (Basophil) rest 


36.6 


CD4 lymphocyte none 


10.4 


KU-812 (Basophil) 
PIVlA/ionomycin 


100.0 


2ry Thl/Th2/Trl_anti- 
pnos phi 1 


15.7 


CCD 1106 

(Keratinocytes) none 


21.9 


LAK cells rest 


21.8 


(Keratinocytes) 
TNFalpha + IL-lbeta 


29.9 


LAK cells IL-2 


24.1 


Liver cirrhosis 


6.0 


LAK cells IL-2+IL-12 


33.2 


NCI-H292 none 


13.0 


LAK cells IL-2+IFN 
gamma 


38.4 


NCI-H292 IL-4 


25.2 


LAK cells IL-2+ IL-18 


33.9 


NCI-H292 IL-9 


32.8 


LAK cells 
PMA/ionomycin 


9.3 


NCI-H292IL-13 


26.2 


NK Cells IL-2 rest 


27.9 


NCI-H292 IFN gamma 


37.9 


Two Way MLR 3 day 


13.2 


HPAEC none 


13.7 


Two Way MLR 5 day 


25.3 


HPAEC TNF alpha + IL- 
1 beta 


30.8 


Two Way MLR 7 day 


23.8 


Lung fibroblast none 


12.1 


PBMC rest 


9.1 


Lung fibroblast TNF 
alpha + IL-1 beta 


9.5 


PBMC PWM 


25.9 


Lung fibroblast IL-4 


11.7 


PBMC PHA-L 


27.7 


Lung fibroblast IL-9 


19.3 


Ramos (B cell) none 


23.5 


Lung fibroblast IL-1 3 


11.2 


Ramos (B cell) 
ionomycin 


23.0 


Lung fibroblast IFN 
gamma 


19.5 


B lymphocytes PWM 


36.3 


Dermal fibroblast 
CCD 1070 rest 


66.9 


B lymphocytes CD40L 
and IL-4 


21.5 


Dermal fibroblast 
CCD1070 TNF alpha 


70.2 


EOL-1 dbcAMP 


21.0 


Dermal fibroblast 
CCD 1070 IL-1 beta 


46.3 


EOL-1 dbcAMP 
PMA/ionomycin 


19.8 


Dermal fibroblast IFN 
gamma 


17.1 


Dendritic cells none 


10.2 


Dermal fibroblast IL-4 


21.5 
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Dendritic cells LPS 


10.8 


Dermal Fibroblasts rest 


8.9 


Dendritic cells anti- 
CD40 


9.1 


Neutrophils TNFa+LPS 


0.5 


Monocytes rest 


10.1 


Neutrophils rest 


5.7 


Monocytes LPS 


11.6 


Colon 


5.5 


Macrophages rest 


13.9 


Lung 


10 7 


Macrophages LPS 


15.7 


Thymus 


39.2 


HUVEC none 


11.8 


Kidney 


8.8 


HUVEC starved 


18.7 







CNS_neurodegeneration_vl.O Summary: Ag3437 This panel confirms the expression of 
CG59435-01 gene at low levels in the brains of an independent group of individuals. 
However, no differential expression of this gene was detected between Alzheimer's 
diseased postmortem brains and those of non-demented controls in this experiment. Please 
5 see Panel 1 .4 for a discussion of the potential utility of this gene in treatment of central 
nervous system disorders. 

General_screening_panel_vl.4 Summary: Ag3437 The CG59435-01 is gene is 
ubiquitously exrpressed in this panel, with highest expression in a gastric cancer cell line 
(CT=26.5). In addition, significant levels of expression are evident in cell lines from brain 
1 0 cancer, colon cancer, ovarian cancer, breast cancer, prostate cancer and lung cancer. Thus, 
expression of this gene could be used as a marker to detect the presence of these cancers. 
Furtheraiore, therapeutic modulation of the expression or function of this gene may be 
effective in the treatment of these cancers. 

In addition, this gene is expressed at moderate to low levels in pituitary, adipose, 
1 5 adrenal gland, pancreas, thyroid, and adult and fetal skeletal muscle, heart, and liver. This 
widespread expression among metabolic tissues suggests that this gene product may play a 
role in normal neuroendocrine and metabolic and that disregulated expression of this gene 
may contribute to neuroendocrine disorders or metabolic diseases, such as obesity and 
diabetes. 

20 In addition, the CG59435-01 gene encodes a homologue of mouse NEDD1 protein. 

Nedd is an acronym of "neural precursor cell expressed developmentally and down- 
regulated" (Ref 1) The developmentally regulated mouse gene Neddl encodes a protein 
with similarities to the beta subunit of heterotrimeric GTP-binding proteins that has growth 
suppressing activity when overexpressed in various cultured cell types. Neddl mRNA is 
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shown to be strongly expressed in early embryonic brain and may play a role in the 
differentiation-coupled growth arrest in neuronal cells (Ref. 2). The moderate to low levels 
(CT=30--33) in all regions of the central nervous system examined suggest that this gene 
product may also play a role in the differentiation-coupled growth arrest in neuronal 
5 cells.Fuithermore, this gene may play a role in central nervous system disorders such as 
Alzheimer's disease, Parkinson's disease, epilepsy, multiple sclerosis, schizophrenia and 
depression. 

References: 

1 . Kumar S, Tomooka Y, Noda M. (1992) Identification of a set of genes with 
10 developmentally down-regulated expression in the mouse brain. Biochem Biophys Res 

Commun 185(3): 1155-61 

2. Kumar S, Matsuzaki T, Yoshida Y, Noda M. (1994) Molecular cloning and 
biological activity of a novel developmentally regulated gene encoding a protein with beta- 
transducin-like structure. J Biol Chem 269(1 5): 1 1318-26. 

15 

Panel 4.1D Summary: Ag3437 The CG59435-01 is gene is ubiquitously exrpressed in 
this panel, with highest expression in the basophil cell line KU-812 treated with 
PMA/ionomycin (CT=27.9). This gene is expressed at high to moderate levels in a wide 
range of cell types of significance in the immune response in health and disease. These 

20 cells include members of the T-cell, B-cell, endothelial cell, macrophage/monocyte, and 
peripheral blood mononuclear cell family, as well as epithelial and fibroblast cell types 
from lung and skin, and normal tissues represented by colon, lung, thymus and kidney. 
This ubiquitous pattern of expression suggests that this gene product may be involved in 
homeostatic processes for these and other cell types and tissues. This pattern is in 

25 agreement with the expression profile in General_screening_panel_vl .4 and also suggests a 
role for the gene product in cell survival and proliferation. Therefore, modulation of the 
gene product with a functional therapeutic may lead to the alteration of functions 
associated with these cell types and lead to improvement of the symptoms of patients 
suffering from autoimmune and inflammatory diseases such as asthma, allergies, 

30 inflammatory bowel disease, lupus erythematosus, psoriasis, rheumatoid arthritis, and 
osteoarthritis. 
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BA. CG59439-01 and CG59439-02: Xenobiotic/medium-chain fatty acid:CoA 
ligase form XL-III 

Expression of gene CG59439-01 was assessed using the primer-probe set Ag3438, 
described in Table BAA. Results of the RTQ-PCR runs are shown in Table BAB. Please 
note that CG59439-02 represents a full-length physical clone of the CG59439-01 gene, 
validating the prediction of the gene sequence. 



Table BAA . Probe Name Ag3438 



Primers 


Sequences 


Length 


Start 
Position 


SEQID 
NO: 


Forward 


5 ' -accccattaaccacttttgg-3 1 


20 


938 


553 


Probe 


TET-5 ' -tcatctatatatcgaatgattctgcagca- 
3 ' -TAMRA 


29 


964 


554 


Reverse 


5 ' -gaacctgatgctggtgaaatc-3 1 


21 


994 


555 



Table BAB . Panel 4. ID 



Tissue Name 


Rel. Exp.(%) 
Ag3438, Run 
198383568 


Tissue Name 


Rel. Exp.(%) 
Ag3438, Run 
198383568 


Secondary Thl act 


4.0 


HUVEC IL-lbeta 


0.0 


Secondary Th2 act 


100.0 


HUVEC I FN gamma 


0.0 


Secondary Trl act 


25.9 


HUVEC TNF alpha + 
IFN gamma 


0.0 


Secondary Thl rest 


0.0 


HUVEC TNF alpha + 
IL4 


0.0 


Secondary Th2 rest 


0.0 


HUVEC IL-1 1 


0.0 


Secondary Trl rest 


0.0 


Lung Microvascular EC 
none 


0.0 


Primary Thl act 


0.0 


Lung Microvascular EC 
TNFalpha + IL-lbeta 


0.0 


Primary Th2 act 


0.0 


Microvascular Dermal 
EC none 


0.0 


Primary Trl act 


0.0 


Microsvasular Dermal 
EC TNFalpha + IL-lbeta 


0.0 


Primary Thl rest 


0.0 


Bronchial epithelium 
TNFalpha + ILlbeta 


9.4 


Primary Th2 rest 


0.0 


Small airway epithelium 
none 


7.0 


Primary Trl rest 


0.0 


Small airway epithelium 


13.8 
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TNFalpha + IL-lbeta 




CD45RA CD4 
lymphocyte act 


16.3 


Coronery artery SMC rest 


0.0 


CD45RO CD4 
lymphocyte act 


0.0 


Coronery artery SMC 
TNFalpha + IL-lbeta 


0.0 


CDS lymphocyte act 


0 0 


Astrocytes rest 


0.0 


Secondary CD8 
lymphocyte rest 


0.0 


Astrocytes TNFalpha + 
IL-lbeta 


0.0 


Secondary CD8 
lymphocyte act 


0.0 


KU-812 (Basophil) rest 


25.9 


CD4 lymphocyte none 


12.9 


KU-812 (Basophil) 
P^lA/ionomycin 


10.5 


2ry Thl/Th2/Trl_anti- 
CD95 CH11 


0.0 


CCD1106 


0.0 


LAK cells rest 


6.9 


CCD1 106 
(Keratinocytes) 
TNFalpha + IL-lbeta 


0.0 


LAK cells IL-2 


6.1 


Liver cirrhosis 


20.4 


LAK cells IL-2+IL-12 


7.7 


NCI-H292 none 


0.0 


LAK cells IL-2+IFN 
gamma 


13.6 


NCI-H292 IL-4 


6.7 


LAK cells IL-2+ IL-18 


14.2 


NCI-H292 11,-9 


11 7 


LAK cells 
PMA/ionomycin 


0.0 


NCI-H292IL-13 


25.9 


NK Cells IL-2rest 


0.0 


NCI-H292 IFN gamma 


17.2 


Two Way MLR 3 day 


14.9 


HPAEC none 


0.0 


Two Way MLR 5 day 


0.0 


HPAEC TNF alpha + IL- 
1 beta 


0.0 


Two Way MLR 7 day 


0.0 


Lung fibroblast none 


0.0 


PBMC rest 


4.4 


Lung fibroblast TNF 
alpha + IL-1 beta 


0.0 


PBMC PWM 


2.7 


Lung fibroblast IL-4 


0.0 


PBMC PHA-L 


21.2 


Lung fibroblast IL-9 


0.0 


Ramos (B cell) none 


0.0 


Lung fibroblast IL-I3 


0.0 


Ramos (B cell) 
ionomycin 


0.0 


Lung fibroblast IFN 
gamma 


0.0 


B lymphocytes PWM 


0.0 


Dermal fibroblast 
CCD 1070 rest 


0.0 


B lymphocytes CD40L 
and IL-4 


11.1 


Dermal fibroblast 
CCD 1070 TNFalpha 


0.0 


EOL-1 dbcAMP 


12.2 


Dermal fibroblast 
CCD1 070 IL-1 beta 


0.0 


EOL-1 dbcAMP 
PMA/ionomycin 


0.0 


Dermal fibroblast IFN 
gamma 


0.0 


Dendritic cells none 


69 , 


Dermal fibroblast IL-4 


0.0 
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Dendritic cells LPS 


0.0 


Dermal Fibroblasts rest 


0.0 


Dendritic cells anti- 
CD40 


0.0 


Neutrophils TNFa+LPS 


0.0 


Monocytes rest 


6.4 


Neutrophils rest 


4.8 


Monocytes LPS 


00 


Colon 


5.5 


Macrophages rest 


70 


Lung 


0.0 


Macrophages LPS 


0.0 


Th>mus 


3.6 


HUVEC none 


00 


Kidney 


22 4 


HUVEC starved 


0.0 







CNS_neurodegeneration_vl.O Summary: Ag3438 Expression of the CG59439-02 gene 
is low/undetectable in all samples on this panel (CTs>35). (Data not shown.) 



General_screening_panel_vl.4 Summary: Ag3438 Results from one experiment with 
the CG59439-02 gene are not included. The amp plot indicates that there were 
5 experimental difficulties with this run. 

Panel 4.1D Summary: Ag3438 Expression of the CG59439-02 gene is restricted to a 
sample derived from chronically activated Th2 cells (CT=33). 

Panel 4D Summary: Ag3438 Results from one experiment with the CG59439-02 gene 
are not included. The amp plot indicates that there were experimental difficulties with this 
10 run. 

BB. CG59354-01 and CG59354-02 and CG59354-03: phosducin-like protein 

Expression of gene CG59354-01 and variant CG59354-02 was assessed using the 
primer-probe set Ag3553, described in Table BBA. Results of the RTQ-PCR runs are 
shown in Tables BBB, BBC and BBD. Please note that CG59354-03 represents a full- 
15 length physical clone of the CG59354-01 gene, validating the prediction of the gene 
sequence. 



Table BE 


\A. Probe Name Ag3553 


Primers 


Sequences 


Length 


Start 
Position 


SEQ ID 
NO: 


Forward 


5 1 -tctatttccaggtcgctatcct-3 ' 


22 


1778 


556 


Probe 


TET-5 ' -acgcacagatgtcagcaccaagactt-3 ' - 
TAMRA 


26 


1822 


557 


Reverse 


5 ' -ggaatttggattactcccagaa-3 ' 


22 


1852 


558 



Table BBB . CNS neurodegeneration vl.O 
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Tissue Name 


Run 210641082 


Tissue Name 


Kei. JtiXp.^ /o) AgOSDO, 

Run 210641082 


AD 1 Hippo 


11.3 


control grainy j 
Temporal Ctx 


3.7 


AD 2 Hippo 


17.8 


control (rain) t 
Temporal Ctx 


19.6 


AD 3 Hippo 


4.8 


JAU 1 VJCLipildl 

Ctx 


15.6 


AD 4 Hippo 


4.6 


AD 2 Occipital 
Ctx (Missing) 


0.0 


AD 5 hippo 


70.2 


AD 3 Occipital 
Ctx 


4.5 


AD 6 Hippo 


55.5 


AD 4 Occipital 
Ctx 


12.5 


Control 2 Hippo 


20.3 


AD 5 Occipital 
Ctx 


28.7 


Control 4 Hippo 


14.5 


AD 6 Occipital 
Ctx 


32.1 


Control (Path) 3 
Hippo 


7.0 


Control 1 Occipital 

Ctx 


3.5 


AD 1 Temporal Ctx 


15.1 


Control 2 Occipital 
Ctx 


51.1 


AD 2 Temporal Ctx 


18.8 


Control 3 Occipital 
Ctx 


14.4 


AD 3 Temporal Ctx 


3.4 


Control 4 Occipital 
Ctx 


4.7 


AD 4 Temporal Ctx 


11.7 


Control (Path) 1 
Occipital Ctx 


64.6 


AD 5 Inf Temporal 
Ctx 


100.0 


Control (Path) 2 
Occipital Ctx 
Control (Path) 3 
Occipital Ctx 


8.0 


AD 5 SupTemporal 
Ctx 


50.7 


3.9 


AD 6 In f Temporal 
Ctx 


69.7 


Control (Path) 4 
Occipital Ctx 


11.8 


AD 6 Sup Temporal 
Ctx 


66.9 


Control 1 Parietal 
Ctx 


7.0 


Control 1 Temporal 
Ctx 


. o {Control 2 Parietal 
JCtx 


41.2 


Control 2 Temporal 
Ctx 


_ , , Control 3 Parietal 
26.6 _ 
JCtx 


12.0 


Control 3 Temporal 
Ctx 


[Control (Path) 1 
parietal Ctx 


62.0 


Control 4 Temporal 
Ctx 


7.5 


Control (Path) 2 j ^ j g 
Parietal Ctx j 


Control (Path) 1 
Ft n poral Ctx 


44.8 


( onir< 1 (Path) 3 J 
Parietal Ctx j 
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Control (Path) 2 


24.8 


Control (Path) 4 


41.8 


Temporal Ctx 


Parietal Ctx 



Table BBC . General_screening_panel_vl.4 



Tissue Name 


Rel. Exp.(%) 
Ag3553, Run 
217049381 


Tissue Name 


Rel. Exp.(%) 
Ag3553, Run 
217049381 


Adipose 


5.2 


Renal ca. 1 K-10 


40.6 


Melanoma* 
Hs688(A).T 


40.3 


Bladder 


29.9 


Melanoma* 

rlbOoo^ij ). l 


31.9 


Gastric ca. (liver met.) 
NCI-N87 


39.5 


Melanoma* M14 


41.5 


Gastric ca. KATO III 


— 


Melanoma* 

LUA1M VI 


37.1 


Colon ca. SW-948 


9.0 


Melanoma* SK- 
MEL-5 


25.9 


Colon ca. SW480 


68.3 


Squamous cell 
carcinoma SCC-4 


17.9 


Colon ca.* (SW480 
met) SW620 


23.5 


Testis Pool 


5.4 


Colon ca. HT29 


20.0 


Prostate ca.* (bone 
met) PC-3 


31.4 


Colon ca. HCT-116 


66.0 


Prostate Pool 


9.0 jColon ca. CaCo-2 


32.8 


Placenta 


4.0 |Colon cancer tissue 




Uterus Pool 


5.5 


Colon ca. SW111 6 


4.0 


Ovarian ca. 
OVCAR-3 


40.1 


Colon ca. Colo-205 


11.3 


Ovarian ca. SK- 
OV-3 


47.3 


Colon ca. SW-48 


9.9 


Ovarian ca. 
OVCAR-4 


11.4 


Colon Pool 


18.4 


Ovarian ca. 
OVCAR-5 


37.9 


Small Intestine Pool 


11.7 


Ovarian ca. 
IGROV-1 


25.0 


Stomach Pool 


14.9 


Ovarian ca. 
OVCAR-8 


16.6 


Bone Marrow Pool 


6.9 


Ovary 


10.2 


Fetal Heart 


4.6 


Breast ca. MCF-7 


42.6 


Heart Pool 


6.6 


Breast ca. MDA- 
MB-231 


50.7 


Lymph Node Pool 


21.3 


Breast ca. BT 549 


81.8 


Fetal Skeletal Muscle 


3.8 


Breast ca. T47D 


85.9 


Skeletal Muscle Pool 


6.0 
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Breast ca. MDA-N 


33.0 


Spleen Pool 




13.9 


Breast F'ool 


17.3 


Thymus Pool 


11.1 


Trachea 


16.4 


CNS cancer 
(glio/astro) U87-MG 


44.4 


Lung 


5.3 


CNS c sneer 
(glio/astro) U-118-MG 


45.4 


Fetal Lu ng 


33.7 


CNS cancer 
(neuro;met) SK-N-AS 


44.8 


Lung ca.NCI-N417 


5.8 


i^ino cancer ^asiroj or- 
539 


31.0 


Lung ca. LX-1 


22.7 


CNS cancer (astro) 
SNB-75 


100.0 


Lung ca.NCI-H146 


16.7 


CNS cancer (glio) 
SNB-19 


27.4 


Lung ca. SHP-77 


59.5 


CNS cancer (glio) SF- 
295 


59.5 


Lung ca. A549 


41.5 


Brain (Amygdala) Pool 


8.3 


Lung ca. NCI-H526 


5.4 


Brain (cerebellum) 


8.0 


Lung ca. NCI-H23 


31.0 


Brain (fetal) 


23 8 


Lung ca. NCI-H460 


33.4 


Brain (Hippocampus) 
Pool 


10.7 


Lung ca. HOP-62 


21.9 


Cerebral Cortex Pool 


15.1 


Lung ca. NCI-H522 


17.8 


Brain (Substantia 
nigra) Pool 


11 0 


Liver 




0.5 


Brain (Thalamus) Pool 


16.6 


Fetal Liver 




18.3 


Brain (whole) 


11.8 


Liver ca. HepG2 




9.2 


Spinal Cord Pool 


1 1.7 


Kidney Pool 




29.5 


Adrenal Gland 


6.5 


Fetal Kidney 




16.2 


Pituitary gland Pool 


4.7 


Renal ca. 786-0 




57.0 


Salivary Gland 


9.2 


Renal ca. A498 




7.9 


Thyroid (female) 


8.3 


Renal ca. ACHN 




20.9 


Pancreatic ca. 
CAPAN2 


30.4 


Renal ca. UO-31 




29.1 


Pancreas Pool 
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Table BBD. Panel 4D 


Tissue Name 


Rel. Exp.(%) 
Ag3553, Run 
166487505 


1 Tissue Name 

| 




ReL Exp.(%) 
Ag3553, Run 
166487505 


Secondary Thl act 




45.4 


jHUVEC IL-lbeta 




17.0 


Secondary Th2 act 




42.9 


HUVEC 1FN gamma 


39.2 


Secondary Trl act 


58.6 


HUVEC TNF alpha + 
1J N aamma 


27.4 
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Secondary Th 1 rest 


6.7 


HUVECTNF alpha + 
IL4 


31.4 


Secondary Th2 rest 


14.5 


HUVEC IL-11 


23.7 


Secondairy Trl rest 


15.0 


Lung Microvascular EC 
none 


41.2 


Primary Th 1 act 


38.4 


Lung Microvascular EC 
TNFalpha + IL-lbeta 


28.3 


Primary Th2 act 


24.5 


Microvascular Dermal 
EC none 


59.0 


Primary Trl act 


32.8 


Microsvasular Dermal 
EC TNFalpha + IL-lbeta 


23.5 


Primary Thl rest 


57.0 


Bronchial epithelium 
TNFalpha + ILlbeta 


21.5 


Primary Th2 rest 


43.2 


Small airway epithelium 


5.8 


Primary Trl rest 


39.0 


Small airway epithelium 
TNFalpha + IL-lbeta 


52.5 


CD45RA CD4 


25.5 


Coronery artery SMC rest 


14.3 


CD45RO CD4 


34.6 


Coronery artery SMC 
TNFalpha + IL-lbeta 


12.0 


CD8 lymphocyte act 


28.1 


Astrocytes rest 


18.2 


Secondary CD8 
lymphocyte rest 


30.8 


Astrocytes TNFalpha + 
IL-lbeta 


11.3 


Secondary CD8 
lymphocyte act 


21.9 


KU-812 (Basophil) rest 


19.6 


CD4 lymphocyte none 


6.0 


KU-8 1 2 (Basoph i 1) 
PMA/ionomycin 


71.2 


2ry Thl/Th2/Trl_anti- 
CD95CH11 


21.9 


CCD1106 

(Keratinocytes) none 


31.6 


LAK cells rest 


24.8 


CCD1 106 
(Keratinocytes) 
TNFalpha + IL-lbeta 


9.0 


LAK celislL-2 


32.8 


Liver cirrhosis 


4.1 


LAK cells IL-2+1L-12 


24.0 


Lupus kidney 


4.0 


LAK cells IL-2+1FN 
gamma 


37.6 


NCI-H292 none 


35.1 


LAK cells IL-2+ IL-18 


27.2 


NCI-H292 IL-4 


39.2 


LAK cells 
PMA/ionomycin 


16.8 


NCI-H292 IL-9 


49.0 


NK Cells IL-2 rest 


23.5 


NCI-H292IL-13 


28.7 


Two Way MLR 3 day 


21.5 


NCI-H292 IFN gamma 


29.5 


Two Way MLR 5 day 


16.3 


HPAEC none 


38.2 


Two Way MLR 7 day 


17.6 


HPAEC TNF alpha + IL- 
1 beta 


27.2 
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PBMC rest 


8.0 


Lung fibroblast none 


9.4 




PBMC PWM 


91.4 


Lung fibroblast TNF 
alpha + IL-1 beta 


11.1 




PBMC PH \-L 


33.9 


Lung fibroblast IL-4 


34.6 




Ramos (B cell) none 


29.5 


Lung fibroblast IL-9 


21.3 




ivamos ( t> CcII^ 

ionomyc in 


85.3 


Lung fibroblast IL-1 3 


15.8 




B lymphocytes PWM 


100.0 


Lung fibroblast IFN 
gamma 


39.2 




B lymphocytes CD40L 


33.0 


Dermal fibroblast 


45.1 




and IL-4 


CCD 1070 rest 


s 


EOL-1 dbcAMP 


35.8 


Dermal fibroblast 
CCD 1070 TNF alpha 


75.3 


■J 


EOL-1 dbcAMP 


46.7 


Dermal fibroblast 


' 37.4 


m 


PMA/ionomycin 


CCD1070 IL-1 beta 


o 


Dendritic cells none 


14.3 


Dermal fibroblast IFN 
gamma 


15.7 




Dendritic cells LPS 


15.2 


Dermal fibroblast IL-4 


25.9 




Dendritic cells anti- 
CD40 


22.2 


IBD Colitis 2 


1.4 




ivtnnoe te^ irsi 


10.4 


IBD Crohn's 


1.5 




Monocytes 1 .PS 


16.7 


Colon 


16.7 




Macrophages re>t 


27.9 


Lung 


12.2 




Macrophages LPS 


5.7 


Thymus 


28.7 




HUVEC none 


29.5 


Kidney 


28.5 




HUVEC starved 


67.4 







CNS neurodegeneration vl.O Summary: Ag3553 This panel confirms the expression of 
CG59354-03 gene at low levels in the brains of an independent group of individuals. 
However, no differential expression of this gene was detected between Alzheimer's 
diseased postmortem brains and those of non-demented controls in this experiment. Please 
5 see Panel 1 .4 for a discussion of the potential utility of this gene in treatment of central 
nervous system disorders. 

General_screening_panel_vl.4 Summary: Ag3553 The CG59354-03 gene is 
ubiquitously expressed in this panel, with highest expression in a brain cancer cell line 
(CT=25.9). In addition, significant levels of expression are seen in cell lines derived from 
10 colon, breast, ovarian, renal, lung, prostate, and melanoma cancers. Furthermore, higher 
levels of expression are seen in fetal liver and lung (CTs=27-28) when compared to 
expression in the adult tissues (CTs=30-33). The high levels of expression in fetal tissue 
and cancer cell lines, both of which are highly proliferative, suggests that this gene product 
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may be involved in cell growth and differentiation. Therefore, therapeutic modulation of 
the expression or function of this gene may be effective in the treatment of cancer. 



Among tissues with metabolic or endocrine function, this gene is expressed at high 
to moderate levels in pancreas, adipose, adrenal gland, thyroid, pituitary gland, skeletal 
5 muscle, heart, liver and the gastrointestinal tract. Therefore, therapeutic modulation of the 
activity of this gene may prove useful in the treatment of endocrine/metabolically related 
diseases, such as obesity and diabetes. 

In addition, this gene is expressed at high levels in all regions of the central nervous 
system examined, including amygdala, hippocampus, substantia nigra, thalamus, 

10 cerebellum, cerebral cortex, and spinal cord. The CG59354-03 gene encodes a splice 
variant of phosphoduc in-like protein (PHLP). PDCL is a putative modulator of 
heterotrimeric G proteins. It was initially isolated as the product of an ethanol-responsive 
gene in neural cell cultures (Ref. 1). PDCL shares extensive amino acid sequence 
homology with phosducin (PDC), a phosphoprotein expressed in retina and pineal gland 

15 that inhibits several G protein-coupled signaling pathways by binding to the beta-gamma 
subunits of G proteins. Therefore, this gene may play a role in central nervous system 
disorders such as Alzheimer's disease, Parkinson's disease, epilepsy, multiple sclerosis, 
schizophrenia and depression. 

References: 

20 1 . Miles MF, Barhite S, Sganga M, Elliott M. ( 1 993) Phosducin-like protein: an 

ethanol-responsive potential modulator of guanine nucleotide-binding protein function. 
Proc Natl Acad Sci U S A 90(22): 1083 1-5 

Panel 4D Summary: Ag3553 The CG59354-03 gene is ubiquitously expressed in this 
panel, with highest expression in B cells treated with polk-weed mitogen (CT=27.2). In 

25 addition, this gene is expressd at is expressed at high to moderate levels in a wide range of 
cell types of significance in the immune response in health and disease. These cells include 
members of the T-cell, B-cell, endothelial cell, macrophage/monocyte, and peripheral 
blood mononuclear cell family, as well as epithelial and fibroblast cell types from lung and 
skin, and normal tissues represented by colon, lung, thymus and kidney. This ubiquitous 

30 pattern of expression suggests that this gene product may be involved in homeostatic 

processes for these and other cell types and tissues. This pattern is in agreement with the 
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expression profile in General_screening_panel_vl .4 and also suggests a role for the gene 
product in cell survival and proliferation. Therefore, modulation of the gene product with a 
functional therapeutic may lead to the alteration of functions associated with these cell 
types and lead to improvement of the symptoms of patients suffering from autoimmune and 
inflammatory diseases such as asthma, allergies, inflammatory bowel disease, lupus 
erythematosus, psoriasis, rheumatoid arthritis, and osteoarthritis. 

BC. CG59319-01 and CG59319-02: phosducin-like protein 

Expression of gene CG593 19-01 was assessed using the primer-probe set Ag3544, 
described in Table BCA. Results of the RTQ-PCR runs are shown in Tables BCB and 
BCC. Please note that CG593 19-02 represents a full-length physical clone of the 
CG5931 9-01 gene, validating the prediction of the gene sequence. 



Table BCA . Probe Name Ag3544 



Primers 


Sequences 


Length 


Start 
Position 


SEQ ID NO: 


Forward 


5 ' -tacagatcaagcatcccaatgt-3 ' 


22 


347 


559 


Probe 


TET-5 ' - 

tggttaaccagcatcttagtcttctagca-3 * - 
TAMRA 


29 


375 


560 


Reverse 


5 ' -ttcacgatggctttaacaaatt-3 • 


22 


423 


561 



Table BCB . General_screening_panel_vl.4 



Tissue Name 


Rel. Exp.(%) 
Ag3544, Run 
217048127 


Tissue Name 


Rel. Exp.(%) 
Ag3544, Run 
217048127 


Adipose 


00 


Renal ca.TK- 10 


0.0 


Melanoma* 
Hs688(A).T 


0.0 


Bladder 


0.0 


Melanoma* 
I ls688(B). 1 


0.0 


Gastric ca. (liver met.) 
NCI-N87 


0.7 


Melanoma* M 1 4 


0.0 


Gastric ca KATO III 


0.0 


Melanoma* 
LOXIMVI 


0.0 


Colon ca. SW-948 


0.2 


Melanoma* SK- 
MEL-5 


0.0 


Colon ca. S W480 


0.0 


Squamous cell 
carcinoma SCC-4 


1.4 


Colon ca.* (SW480 
met) SW620 


0.0 


Testis Pool 


100.0 


( < \ >n ca H I 29 


0.0 
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Prostate ca.* (bone 
met) PC-3 


0.0 


Colon ca. HCT-116 


0.3 


Prostate Pool 


0.0 


Colon ca. CaCo-2 


0.0 


Placenta 


0.0 


Colon cancer tissue 


0.0 


Uterus Pool 


0.0 


Colon ca. SW1116 


0.0 


Ovarian ca. 
OVCAR-3 


1.2 


Colon ca. Colo-205 


0.0 


Ovarian ca. SK- 
OV-3 


0.9 


Colon ca. SW-48 


0.0 


Ovarian ca. 
OVCAR-4 


0.0 


Colon Pool 


0.0 


Ovarian ca. 
OVCAR-5 


0.0 


Small Intestine Pool 


0.0 


Ovarian ca. 
IGROV-1 


0.2 


Stomach Pool 


0.0 


Ovarian ca. 
OVCAR-8 


0.0 


Bone Marrow Pool 


0.0 


Ovary 


0.0 


Fetal Heart 


0 0 


Breast ca. MCF-7 


0.0 


Heart Pool 


00 


Breast ca. MDA- 
MB-231 


0.2 


Lymph Node Pool 


0.0 


Breast ca. BT 549 


0.0 


Fetal Skeletal Muscle 


0.0 


Breast ca. T47D 


0.3 


Skeletal Muscle Pool 


0.0 


Breast ca. MDA-N 


o.u 


Spleen Pool 


0.0 


Breast 1*001 


0.0 


l'h) mus Pool 


(111 


Trachea 


0.5 


CNS cancer 
(glio/astro) U87-MG 


0.0 


Lung 


0.0 


CNS cancer 
(glio/astro) U-118-MG 


0.0 


Fetal Lung 


0.0 


CNS cancer 
(neuro;met) SK-N-AS 


0.0 


Lung ca. NCI-N417 


0.0 


ctno cancer (astro) or- 
539 


0.3 

— — - 


Lung ca. LX-1 


0.0 


CNS cancer (<istro) 
SNB-75 


0.0 


Lung ca. NCI-H146 


0.0 


CNS cancer (glio) 
SNB-19 


0.0 


Lung ca. SHP-77 


1 ~o 


CNS cancer (glio) SF- 
295 


0.2 


Lung ca. A 549 


0.0 


Brain (Amygdala) Poo! 


02 


Lung ca. NCI-H526 


_!■? . . 


Brain (cerebellum) 


0.0 


Lung ca. NCI-H23 


0.0 


Brain (fetal) 


0.6 


Lung ca. NCI-H460 


0.0 


Brain (Hippocampus) 
Pool 


0.0 
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Lung ca. HOP-62 


0.0 


Cerebral Cortex Pool 


0 2 


i_.ung ca. iNv^i-riJzz 


' 


Brain (Substantia 
nigra) Pool 


0.0 


Liver 


0.0 


Brain (Thalamus) Pool 


0.0 


Fetal Liver 


0.0 


Brain (whole) 


0.0 


Liver ca. HepG2 


0 0 


Spinal Cord Pool 


0.0 


Kidney Pool 


0.0 


Adrenal Gland 


0.0 


Fetal Kidney 


0.0 


Pituitary gland Pool 


0.0 


Renal ca. 786-0 


1.6 


Salivary Gland 


0.0 


Renal ca. A498 


0.0 


Thyroid (female) 


0.2 


Renal ca. ACHN 


0.0 


Pancreatic ca. 
CAPAN2 


0.0 


Renal ca. UO-31 


0.0 


Pancreas Pool 


0.0 



Table BCC . Panel 4.1 D 



Tis»sue Name 


Rel. Exp.(%) 
Ag3544, Run 
169850546 


Tissue Name 


Rel. Exp.(%) 
Ag3544, Run 
169850546 


Secondary Thl act 


0.0 


HUVEC IL-lbeta 


0.0 


Secondary Th2 act 


0.0 


HUVEC IFN gamma 


0.0 


Secondary Trl act 


0.0 


HUVEC TNF alpha + 
IFN gamma 


0.0 


Secondary Thl rest 


0.0 


HUVEC TNF alpha + 
IL4 


0.0 


Secondary Th2 rest 


0.0 


HUVEC IL-11 


0.0 


Secondary Trl rest 


0.0 


Lung Microvascular EC 
none 


0.0 


Primary Thl act 


0.0 


Lung Microvascular EC 
TNFalpha + IL-lbeta 


0.0 


Primary Th2 act 


0.0 


Microvascular Dermal 
EC none 


0.0 


Primary Trl act 


0.0 


Microsvasular Dermal 
EC TNFalpha + IL-lbeta 


0.0 


Primary Thl rest 


2.6 


Bronchial epithelium 
TNFalpha + IL1 beta 


0.0 


Primary Th2 rest 


0.0 


Small airway epithelium 
none 


0.0 


Primary Trl rest 


0.0 


Small airway epithelium 
TNFalpha + IL-lbeta 


0.0 


CD45RA. CD4 
lymphocyte act 


0.0 


Coronery artery SMC rest 


0.0 


CD45RO CD4 
lymphocyte act 


0.0 


Coronery artery SMC 
TNFalpha + IL-lbeta 


0.0 
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CD8 lymphocyte act 


0.0 


Astrocytes rest 


0.0 


Secondary CD8 
lymphocyte rest 


0.0 


Astrocytes TNFalpha + 
IL-lbeta 


0.0 


Secondary CD 8 
lymphocyte act 


0.0 


KU-812 (Basophil) rest 


100.0 


CD4 lymphocyte none 


0.0 


KU-olz (Basophil) 


61.1 


2ryThlAWTrl_anti- 


0.0 


CCD 1106 

(Keratinocytes) none 


0.0 


LAK cells rest 


0.0 


(Keratinocytes) 
TNFalpha + IL-lbeta 


0.0 


LAK cells IL-2 


00 


Liver cirrhosis 


00 


LAK cells IL-2+IL- 12 


0.0 


NCI-H292 none 


0.0 


LAK cells IL-2+IFN 
gamma 


0.0 


NCI-H292 IL-4 


0.0 


LAK cells 1L-2+ II - IS 


0.0 


NCI-H292 IL-9 


0.0 


LAK cells 
PMA/ic nomycin 


0.0 


NCI-H292 IL-13 


0.0 


NK Cells IL-2 rest 


0.0 


NCI-H292 I FN gamma 


0.0 


Two Way MLR 3 day 


0.0 


HPAEC none 


0.0 


Two Way MLR 5 day 


0.0 


HPAEC TNF alpha + IL- 
1 beta 


0.0 


Two Way MLR 7 day 


0.0 


Lung fibroblast none 


0.0 


PBMC rest 


0.0 


Lung fibroblast TNF 
alpha + IL-1 beta 


0.0 


PBMC PWM 


0.0 


Lung fibroblast IL-4 


0 0 


PR MP PHA ! 


0 0 


Lung fibroblast IL-9 




Ramos (B cell) none 


0.0 


Lung fibroblast IL-13 


0.0 


Ramos (B cell) 
ionomycin 


0.0 


Lung fibroblast IFN 
gamma 


0.0 


B lymphocytes PWM 


0.0 


Dermal fibroblast 
CCD 1070 rest 


0.0 


B lymphocytes CD40L 
and IL-4 


0.0 


Dermal fibroblast 
CCD1070 TNF alpha 


0.0 


EOL-1 dbcAMP 


0.0 


Derm 3.1 fibroblast 
CCD1070 IL-1 beta 


0.0 


EOL-1 dbcAMP 
PMA/ionomycin 


0.0 


Dermal fibroblast IFN 
gamma 


0.0 


Dendritic cells none 


0.0 


Dermal fibroblast IL-4 


0.0 


Dendritic cells LPS 


0.0 


Dermal Fibroblasts rest 


0.0 


Dendritic cells anti- 
CD40 


0.0 


Neutrophils TNFa+LPS 


0.0 


Monocytes rest 


0.0 


Neutrophils rest 


0.0 
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Monocytes LPS 


0.0 


Colon 


0.0 


Macrophages rest 


0.0 


Lung 


0.0 


Macrophages LPS 


0.0 


Thymus 


00 


HUVEC none 


0.0 


Kidney 


0.0 


HUVEC starved 


0.0 







CNS_neurodegeneration_vl.O Summary: Ag3544 Expression of the CG59319-02 gene 
is low/undetectable in all samples on this panel (CTs>35). (Data not shown.) 



General_screening_panel_vl.4 Summary: Ag3544 Expression of the CG59319-02 gene 
is restricted to a sample derived from the testis (CT=29.8). Thus, expression of this gene 
could be used to differentiate between this sample and other samples on this panel and as a 
marker of testicular tissue. Furthermore, therapeutic modulation of the expression or 
function of this gene may be useful in the treatment of male infertility or hypogonadism. 

Panel 4.1D Summary: Ag3544 Expression of the CG593 19-02 gene is restricted to 
samples derived from the basophil cell line KU-812 (CTs=32). Thus, expression of this 
gene could be used as a marker of this cell type. Furthermore, the specific pattern of 
expression of this gene suggests that therapeutic modulation of the expression or function 
of the protein encoded by this gene may block or inhibit inflammation or tissue damage 
due to basophil activation in response to asthma, allergies, hypersensitivity reactions, 
psoriasis, and viral infections. 

BD. CG59576-01: Olfactory Receptor 

Expression of gene CG59576-01 was assessed using the primer-probe set Ag3478, 
described in Table BDA. Results of the RTQ-PCR runs are shown in Table BDB. 

Table BDA . Probe Name Ag3478 



Primers 


Sequences 


Length 


Start 
Position 


SEQID 
NO: 


Forward 


5 ' -tggaagtgtagccctgatgtac-3 • 


:: 


708 


562 


Probe 


TET-5 1 -tgctcttctctgccaagtactccttt-3 ' - 
TAMRA 


26 


731 


563 


Reverse 


5 ' -aaacattaggctgatggttgtg-3 ' 


22 


765 


564 



Table BDB . Panel 4D 
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Tissue Name 


Rel. Exp.(%) 
Ag3478, Run 
166441540 


Tissue Name 


Rel. Exp.(%) 
Ag3478, Run 
166441540 


Secondary Thl act 


0.0 


HUVEC IL-lbeta 


0.0 


Secondary Th2 act 


0.0 


HUVEC IFN gamma 


0.0 


Secondary Trl act 


0.0 


HUVEC TNF alpha + 
IFN gamma 


0.0 


Secondary Thl rest 


0.0 


HUVEC TNF alpha + 
IL4 


0.0 


Secondary Th2 rest 


9.0 


HUVEC IL-ll 


0.0 


Secondary Trl rest 


0.0 


Lung Microvascular EC 
none 


0.0 


Primary Thl act 


0.0 


Lung Microvascular EC 
TNFalpha + IL-lbeta 


0.0 


Primary Th2 act 


0.0 


Microvascular Dermal 
EC none 


0.0 


Primary Trl act 


0.0 


Microsvasular Dermal 
EC TNFalpha + IL-lbeta 


0.0 


Primary Thl rest 


0.0 


Bronchial epithelium 
TNFalpha + IL1 beta 


0.0 


Primary Th2 rest 


0.0 


Small airway epithelium 


0.0 


Primary Trl rest 


0.0 


Small airway epithelium 
TNFalpha + IL-lbeta 


0.0 


CD45RA CD4 
lymphocyte act 


0.0 


Coronery artery SMC rest 


0.0 


CD45RO CD4 
lymphocyte act 


0.0 


Coronery artery SMC 
TNFalpha + IL-lbeta 


0.0 


CD8 lymphocyte act 


0.0 


Astrocytes rest 


0.0 


oeconoary tus 
lymphocyte rest 


0.0 


Astrocytes TNFalpha + 
IL-lbeta 


0.0 


Secondary CD8 
lymphocyte set 


0 0 


KU-812 (Basophil) rest 


0.0 


CD4 lymphocyte none 


0.0 


ivu-oiz (£>asopnn) 
PMA/ionomycin 


4.6 


2ryThl/Th2/Trl anti- 
CD95 CH11 


2.6 


CCD 11 06 

(Keratin ocytes) none 


0.0 


LAK cells rest 


0.0 


CCD 11 06 
(Keratin ocytes) 
TNFalpha + IL-lbeta 


0.0 


LAK cells IL-2 


0.0 


Liver cirrhosis 


100.0 


LAK cells IL-2+IL-12 


0.0 


Lupus kidney 


0.0 


LAK cells IL-2+IFN 
gamma 


0.0 


NCI-H292 none 


0.0 


LAK cells IL-2+ IL-18 


0.0 


NCI-H292 IL-4 


0.0 
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LAK cells 
PMA/ionomycin 


0.0 




0.0 


NK Cells IL-2 rest 


0.0 


NCI-H292 IL-13 


0.0 


Two Way MLR 3 day 


0.0 


NCI-H292 I FN gamma 


0.0 


Two Way MLR 5 day 


0.0 


HPAEC none 


0.0 


Two Way MLR 7 day 


0.0 


HPAEC TNF alpha + IL- 
1 beta 


0.0 


PBMC rest 


0.0 


Lung fibroblast none 


0.0 


PBMC PWM 


0.0 


Lung fibroblast TNF 
alpha + IL-1 beta 


0.0 


PBMC PHA-L 


0.0 


Lung fibroblast IL-4 


0.0 


Ramos (B cell) none 


0.0 


Lung fibroblast IL-9 


0.0 


Ramos (B cell) 
ionomycin 


0.0 


Lung fibroblast IL-13 


0.0 


B lymphocytes PWM 


0.0 


Lung fibroblast IFN 

gamma 


0.0 


B lymphocytes CD40L 
and IL-4 


0.0 


Dermal fibroblast 
v^v^l* i u / u resx 


0.0 


EOL-1 dbcAMP 


0.0 

" ■ ■■ ■ -■ 


Dermal fibroblast 
CCD 1070 TNF alpha 


0.0 


EOL-1 dbcAMP 
PMA/ionomycin 


0.0 


Dermal fibroblast 
CCD1070 IL-1 beta 


0.0 


Dendritic cells none 


0.0 


Dermal fibroblast IFN 
gamma 


0.0 


Dendritic cells LPS 


0.0 


Dermal fibroblast IL-4 


0.0 


Dendritic cells anti- 
CD40 


0.0 


IBD Colitis 2 


25.7 


Monocytes rest 


0.0 


IBD Crohn's 


24.3 


Monocytes LPS 


o'.o 


Colon 


5.9 


Macrophages rest 


0.0 ILung 


5.8 


Macrophages LPS 


0.0 


Thymus 


5.9 


HUVEC none 


0.0 


Kidney 


10.2 


HUVEC starved 


0.0 i 





CNS_neurodegeneration_vl.O Summary: Ag3478 Expression of the CG59576-01 gene 
is low/undetectable in all samples on this panel (CTs>35). (Data not shown.) 



General_screening_panel_vl.4 Summary: Ag3478 Expression of the CG59576-01 gene 
is low/undetectable in all samples on this panel (CTs>35). (Data not shown.) 

General_screening_panel_vl.5 Summary: Ag3478 Expression of the CG59576-01 gene 
is low/undetectable in all samples on this panel (CTs>35). (Data not shown.) 
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Panel 4D Summary: Ag3478 Expression of the CG59576-01 gene is restricted to a 
sample derived from liver cirrhosis (CT=32.3). Furthermore, expression of this gene is not 
detected! in normal liver in Panel 1.4, suggesting that its expression is unique to liver 
cirrhosis. This gene encodes a putative GPCR; therefore, antibodies or small molecule 
5 therapeutics could reduce or inhibit fibrosis that occurs in liver cirrhosis. In addition, 
antibodies to this putative GPCR could also be used for the diagnosis of liver cirrhosis. 

Panel 5 Islet Summary: Ag3478 Expression of the CG59576-01 gene is low/undetectable 
in all samples on this panel (CTs>35). (Data not shown.) 

BE. CG 59557-01: Olfactory Receptor 

1 0 Expression of gene CG59557-01 was assessed using the primer-probe set Ag3470, 

described in Table BEA. Results of the RTQ-PCR runs are shown in Table BEB. 



Table BEA . Probe Name Ag3470 



Primers 


Sequences 


Length 


Start 
Position 


SEQ ID 

NO: 


Forward 


5 ' -ccaaccttctcagtgaacagaa-3 ' 


22 


413 


565 


Probe 


TET-5 • -tctctttcataggttgcctcctgcaga-3 ' - 
TAMRA 


27 


440 


566 


Reverse 


5 ' -ccgagtgagtggaagaagtaca-3 1 


22 


467 


567 



Table BEB . Panel 4D 



Tissue Name 


Rel. Exp.(%) 
Ag3470, Run 
166417125 


Tissue Name 

. 


Rel. Exp.(%) 
Ag3470, Run 
166417125 


Secondarv [hi act 


0.0 iHUVEC IL-1 beta 


0.0 


Secondary Thl act 


3.2 .HUVECIFN gamma 


0.0 


Secondary Trl act 


00 HUVEC TNT alpha - 
JIFN gamma 


0.0 


Secondary Thl rest 


0.0 


HUVEC TNF alpha + 
IL4 


0.0 


Secondary Th2 rest 


0 0 


HUVEC IL-1 1 


0.0 


Secondary Trl rest 


0.0 


Lung Microvascular EC 
none 


0.0 


Primary Thl act 


0.0 


Lung Microvascular EC 
TNFalpha + IL-1 beta 


3.0 


Primary Th2 act 


0.0 


Microvascular Dermal 
EC none 


0.0 
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Primary Trl act 


0.0 


Microsvasular Dermal 
EC TNFalpha + IL-1 beta 


0.0 


Primary Thl rest 


2.8 


Bronchial epithelium 
TNFalpha + IL1 beta 


0.0 


Primary Th2 rest 


0.0 


Small airway epithelium 


0.0 


Primary Trl rest 


0.0 


Small airway epithelium 
TNFalpha + IL-1 beta 


0.0 


CD45RA CD4 
lymphocyte act 


0.0 


Coronery artery SMC rest 


0.0 


CD45RO CD4 
lymphocyte act 


0.0 


Coronery artery SMC 
TNFalpha + IL-1 beta 


0.0 


CD8 lymphocyte act 


0.0 


Astrocytes rest 


0.0 


oeconOciry luo 
lymphocyte rest 


0.0 


Astrocytes TNFalpha + 
IL-1 beta 


0.0 


Secondary CD8 
lymphocyte act 


0.0 


KU-812 (Basophil) rest 


0.0 


CD4 lymphocyte none 


2.9 


VTT on ST> • i\ 

KU-812 (Basophil) 
PMA/iononiycin 


0.0 


2ryThl/Th2/Trl anti- 
CD95 CH11 


0.0 


CCD1106 

(Keratinocytes) none 


0.0 


LAK cells rest 


0.0 


CCD1 106 
(Keratinocytes) 
TNFalpha + IL-1 beta 


0.0 


LAK cells IL-2 


0.0 


Liver cirrhosis 


100.0 


LAK cells IL-2+IL-12 


0.0 


Lupus kidney 


2.2 


LAK cells IL-2+IFN 
gamma 


0.0 


NCI-H292 none 


0.0 


LAK cells IL-2+IL-18 


3.2 


NCI-H292 IL-4 


0.0 


LAK cells 
PMA/ionomycin 


0.0 


NCI-H292 IL-9 


0.0 


NK Cells IL-2 rest 


0.0 


NCI-H292 IL-1 3 


0.0 


Two Way MLR 3 day 


0.0 


NCI-H292 I FN gamma 


0.0 


Two Way MLR 5 day 


0.0 


HPAEC none 


0.0 


Two Way MLR 7 day 


0.0 


HPAEC TNF alpha + IL- 
1 beta 


0.0 


PBMC rest 


5.8 


Lung fibroblast none 


0.0 


PBMC PWM 


2.7 


Lung fibroblast TNF 
alpha + IL-1 beta 


0.0 


PBMC PHA-L 


0 0 


Lung fibroblast IL-4 


0.0 


Rani os (B cell) none 


0.0 


Lung fibroblast IL-9 


0.0 


Ramos (B cell) 
ionomycin 


0.0 


Lung fibroblast IL-1 3 


0.0 


B lymphocytes PWM 


6.5 


Lung fibroblast IFN 
gamma 


0.0 
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JO jyiIipjIUCyLC5> \^LJH\J Lj 

and IL-4 


2.2 


Dermal fibroblast 
CCD 1070 rest 


0.0 


EOL-1 dbcAMP 


0.0 


Dermal fibroblast 
CCD 1070 TNF alpha 


0.0 


EOL-1 dbcAMP 
PMA/ionomycin 


0.0 


Dermal fibroblast 
CCD1070 IL-1 beta 


0.0 


Dendritic cells none 


3.2 


Dermal fibroblast IFN 
gamma 


0.0 


Dendrit ic cells LPS 


20.9 


Dermal fibroblast IL-4 


0.0 


Dendritic cells anti- 
CD40 


0.0 


IBD Colitis 2 


19.6 


Monocytes rest 


0.0 


IBD Crohn's 


4.9 


Monocytes LPS 


6.3 


Colon 


13.9 


Macrophages rest 


21.6 


Lung 


14.8 


Macrophages LPS 


0.0 


Th>mus 


2.2 


HUVEC none 


0.0 


Kidney 


0.0 


HUVEC starved 


0.0 







CNS_neurodegeneration_vl.O Summary: Ag3470 Expression of the CG59557-01 gene 
is low/undetectable in all samples on this panel (CTs>35). (Data not shown.) 



General_screening_panel_vl.4 Summary: Ag3470 Expression of the CG59557-01 gene 
is low/undetectable in all samples on this panel (CTs>35). (Data not shown.) 

5 Panel 4D Summary: Ag3470 Expression of the CG59557-01 gene is detected in a liver 
cirrhosis sample (CT = 32.2). Furthermore, expression of this gene is not detected in 
normal liver in Panel 1.4, suggesting that its expression is unique to liver cirrhosis. This 
gene encodes a putative GPCR; therefore, antibodies or small molecule therapeutics could 
reduce or inhibit fibrosis that occurs in liver cirrhosis. In addition, antibodies to this 
1 0 putative GPCR could also be used for the diagnosis of liver cirrhosis. 

BF. CG59555-01: Olfactory Receptor 

Expression of gene CG59555-01 was assessed using the primer-probe set Ag3467, 
described in Table BFA. Results of the RTQ-PCR runs are shown in Tables BFB, BFC and 
BFD. 

1 5 Table B FA. Probe Name Ag3467 



Primers 


Sequences 


Length 


Start 
Position 


SEQID 
NO: 
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Forward 


5 1 -ggcaaggaaagtcattcctaa-3 ' 


21 


953 


568 


Probe 


TET-5 1 -tggtgtgacatttgactctccctcct-3 ' - 
TAMRA 


26 


975 


569 


Reverse; 


5 ' -tggtaccaagattccaggagat-3 ' 


22 


1006 


570 



Table BFB . CNSjneurodegenerationvl.O 



G 



Tissue Name 


Rel. Exp.(%) Ag3467, 
Run 210376517 


Tissue Name 


Rel. Exp.(%) Ag3467, 
Run 210376517 


AD 1 Hippo 


4.6 


Control (Path) 3 
Temporal Ctx 


14.8 


AD 2 Hippo 


29.7 


Control (Path) 4 
Temporal Ctx 


15.2 


AD 3 Hippo 


10.7 


AD 1 Occipital 
Ctx 


14.5 


AD 4 Hippo 


28.9 


AD 2 Occipital 
Ctx (Missing) 


0.0 


AD 5 hippo 


21.6 


AD 3 Occipital 
Ctx 


14.5 


AD 6 Hippo 


100.0 


AD 4 Occipital 
Ctx 


14.8 


Control 2 Hippo 


8.8 


AD 5 Occipital 
Ctx 


13.7 


Control 4 Hippo 


35.6 


AD 6 Occipital 
Ctx 


10.3 


Control (Path) 3 
Hippo 


21.9 


Control 1 Occipital 
Ctx 


18.3 


AD 1 Temporal Ctx 


28.3 


Control 2 Occipital 
Ctx 


7.9 


AD 2 Temporal Ctx 


37.4 


Control 3 Occipital 


16.2 


AD 3 Temporal Ctx 


7.4 


Control 4 Occipital 
Ctx 


24.0 


AD 4 Temporal Ctx 


28.3 


Control (Path) 1 
Occipital Ctx 


28.3 


AD 5 Inf Temporal 
Ctx 


19.3 


Control (Path) 2 
Occipital Ctx 


10.1 


AD 5 SupTemporal 
Ctx 


33.4 


Control (Path) 3 
Occipital Ctx 


12.6 


AD 6 In f Temporal 
Ctx 


39.8 


Control (Path) 4 
Occipital Ctx 


14.3 


AD 6 Sup Temporal 
Ctx 


83.5 


Control 1 Parietal 
Ctx 


8.7 


Control 1 Temporal 
Ctx 


14.4 


Control 2 Parietal 
Ctx 


22.2 
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Control 2 Temporal 
Ctx 


13.6 


^Ullirui _> r ail e Lai 

Ctx 


9.8 


Control 3 Temporal 
Ctx 


11.8 


Control (Path) 1 
Parietal Ctx 


33.2 


Control 4 Temporal 
Ctx 


16.0 


Control (Path) 2 
Parietal Ctx 


12.4 


Control (Path) 1 
Temporal Ctx 


24.3 


Control (Path) 3 
Parietal Ctx 


19.6 


Control (Path) 2 
Temporal Ctx 


6.1 


Control (Path) 4 
Parietal Ctx 


12.9 



Table P>FC . General_screening_panel_vl.4 



Tissue Name 


Kel. Hxp.( /o ) 
Ag3467, Run 
217119371 


Tissue Name 


Rel. Exp.(%) 
Ag3467, Run 
217119371 


Adipose 


11.6 


Renal ca.TK-10 


14.6 


Melanoma* 
Hs688(A).T 


27.0 


Bladder 


14.1 


Melanoma* 
Hs688(B).T 


27.9 


Gastric ca. (liver met.) 
NCI-N87 


3.2 




112 


dactrin no V A Tfl TTT 

vjdstnc Cd. rv/\ 1 KJ III 




Melanoma* 

T OYTMVT 


0.0 


Colon ca. SW-948 


0.4 


Melanoma* SK- 
MEL-5 


0.0 


Colon ca. SW480 


12.7 


Squamous cell 
carcinoma SCC-4 


1.8 


Colon ca.* (SW480 
met) SW62ft 


10.4 


Testis Pool 


7.0 


Colon ca. HT29 


1.7 


Prostate ca.* (bone 
met) PC-3 


15.2 


Colon ca. HCT-116 


7.9 


Prostate Pool 


9.7 


Colon ca. CaCo-2 


5.3 


Placenta 


1.7 


Colon cancer tissue 


3.6 


Uterus Pool 


5.1 


Colon ca. SW1116 


0.4 


Ovarian ca. 
OVCAR-3 


4.1 


Colon ca. Colo-205 


0.2 


Ovarian ca. SK- 
OV-3 


16.2 


Colon ca. S W-48 


0.5 


Ovarian ca. 
OVCAR-4 




Colon Pool 


20.6 


Ovarian ca. 
OVCAR-5 


13.9 


Small Intestine Pool 


26.8 


Ovariari ca. 
IGROV-1 


0.0 


Stomach Pool 


19.5 


Ovarian ca. 


9.5 


Bone Marrow Pool 


16.7 
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OVCAR-8 








Ovary 


8.4 


Fetal Heart 


23.5 


Breast ca. MCF-7 


0.8 


Heart Pool 


11.2 


Breast ca. MDA- 
MB-231 


26.2 


Lymph Node Pool 


31.9 


Breast ca. BT 549 


0.0 


Fetal Skeletal Muscle 


7.1 


Breast ca. T47D 


18.2 


Skeletal Muscle Pool 


1.8 


Breast ca. MDA-N 




Spleen Pool 


22.4 


Breast Pool 


26.6 


Thymus Pool 


25.9 


Tracheal 


9.5 


CNS cancer 
(glio/astro) U87-MG 


0.4 


Lung 


12.4 


CNS cancer 
(glio/astro) U-118-MG 


0.2 


Fetal Lung 


100.0 


CNS cancer 
(neuro:met) SK-N-AS 


0.0 


Lung ca. NCI-N417 


0.0 


CNS cancer (astro) SF- 
539 


0.0 


Lung ca. LX-1 


11.6 


CNS cancer (astro) 
SNB-75 


1.5 


t w , n n ICNS cancer (glio) 
Lung ca.NCI-H 146 1 0.0 S NB-19 


1.1 


Lung ca. SHP-77 | 0.1 


CNS cancer (glio) SF- 
295 


20.0 


Lung ca. A549 j 3.0 


Brain (Amygdala) Pool 


1.9 


Lung ca. NCI-H526 


0.0 


lira in i\etebe!li)f!)) 


1 .0 


Lung ca. NCI-H23 


13.9 


Ri in (fetal) 


2.5 


Lung ca. NCI-H460 


5.4 


Brain (Hippocampus) 
Pool 


0.2 


Lung ca. HOP-62 


6.7 


CVrebial Corfex Pool 


1.3 


Lung ca. NCI-H522 


0.0 


Brain (Substantia 
nigra) Pool 


1.6 


Liver 


0.1 


Brain (Thalamus) Pool 


2 5 


Fetal Li ver 


6.6 


Brain (whole) 


11 


Liver ca. HepG2 


1.4 


Spinal Cord Pool 


4.5 


Kidney Pool 


34.4 


Adrenal Gland 


6.3 


Fetal Kidney | 76.3 


Pi t u i tan giand Pool 


4.5 


Renal ca. 786 0 28.1 


Salivary Gland 


1.8 


Renal ca. A498 


12.1 


Thyroid (female) 


13.4 


Renal ca. ACHN 


23.0 


Pancreatic ca. 
CAPAN2 


1.0 


Renal ca. UO-3 1 


25.0 


Pancreas Pool 


27.2 



Table BFD . Panel 4D 
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Tissue Name 


Rel. Exp.(%) 

AgJ4o /, Klin 
166417105 


Tissue Name 


Rel. Exp.(%) 
AgJ4o 7, Kun 
166417105 


Secondary Thl act 


2.4 


HUVEC IL-lbcta 


1.5 


Secondary Th2 act 


5.1 


iiUVJbL, irJN gamma 


18.2 


Secondary Trl act 


7.2 


HUVEC TNF alpha + 
IFN gamma 


6.2 


Secondary Thl rest 


18.2 


HUVEC TNF alpha 1 
IL4 


1.7 


Secondary Th2 rest 


14.6 


HUVEC IL-1 1 


1.8 


Secondary Trl rest 


22.1 


Lung Microvascular EC 
none 


0.6 


Primary Thl act 


1.3 


Lung Microvascular EC 
TNFalpha + IL-1 beta 


0.3 


Primary Th2 act 


9.0 


Microvascular Dermal 
EC none 


0.1 


Primary Trl act 


7.2 


Microsvasular Dermal 
EC TNFalpha + IL-1 beta 


0.5 


Primary Thl rest 


100.0 


Bronchial epithelium 
TNFalpha + IL1 beta 


1.5 


Primary Th2 rest 


38.7 


Small airway epithelium 
none 


1.6 


Primary Trl rest 


~„ . iSmall airway epithelium J , - 
|TNFalpha + IL-lbeta j 


CD45RA CD4 
lymphocyte act 


1.9 


Coronery artery SMC rest 


4.5 


CD45RO CD4 
lymphocyte act 


8.7 


Coronery artery SMC 
TNFalpha + IL-1 beta 


5.1 


CD8 lymphocyte act 


4.0 


Astrocytes rest 


0.4 


Secondary CD8 
lymphocyte rest 


7.5 


Astrocytes TNFalpha + 
IL-lbeta 


1.7 


Secondary CD8 
lymphocyte act 


4.7 


KU-812 (Basophil) rest 


0.7 


CD4 lymphocyte none 


9 3 lKU-812 (Basophil) 
|PMA/ionomycin 


3.2 


2ryThl/Th2/Trl_anti- 
CD95 CHI 1 


52.5 


CCD1106 

(Keratinocytes) none 


1.5 


LAK cells rest 


4.6 


CCD1106 
(Keratinocytes) 
TNFalpha + IL-lbeta 


16.4 


LAK cells IL-2 


15.1 


Liver cirrhosis 


14.6 


LAK cells IL-2+1L-12 


6.5 


Lupus kidney 


37.6 


LAK cells IL-2+IFN 
gamma 


10.9 


NCI-H292 none 


4.6 


LAK cells 1L-2+IL- 18 


6.9 


NCI-H292 IL-4 


5.2 
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LAK cells 
PMA/ionomycin 


1.1 


NCI-H292 IL-9 


5.6 


NK Cells IL-2 rest 


3.1 


NCI-H292 IL-13 


2.3 


Two Way MLR 3 day 


11.0 


NC1-H292 IFN gamma 


1.5 


Two Way MLR 5 day 


6.0 


HPAEC none 


3.6 


Two Way MLR 7 day 


6.7 


HPAEC TNF alpha + IL- 
1 beta 


10.5 


PBMC rest 


2.6 


Lung fibroblast none 


15.3 


PBMC PWM 


4.2 


juung iiDrouiasi liNr 
alpha + IL-1 beta 


5.9 


PR A/IP P [ I A T 




Lung fibroblast IL-4 


5.6 


Ran inc. (B ceh) none 


0.0 


Lung fibroblast IL-9 


5.6 


Ramos (B cell) 
ionomycin 


0.1 


Lung fibroblast IL-1 3 


5.1 


B lymphocytes PWM 


6.0 


Lung fibroblast IFN 
gamma 


7.9 


B lymphocytes CD40L 
and IL-4 


6.5 


Dermal fibroblast 
CCD1070 rest 


4.5 


EOL-1 dbcAMP 


0.0 


Dermal fibroblast 
CCD 1070 TNF alpha 


21.6 


fc.wJL-1 UDCAMr 

PMA/ionomycin 


0.0 


Dermal fibroblast 
CCD 1070 IL-1 beta 


2.0 


Dendritic cells none 


0.4 


Dermal fibroblast IFN 
gamma 


3.5 


Dendritic cells LPS 


0.1 


Dermal fibroblast IL-4 


6.9 


Dendritic cells anti- 
CD40 


0 1 


TDH Politic 0 


106 


Monocytes rest 


1.6 


IBD Crohn's 


1 8 


Monocles LPS 


1.7 


Colon 


34.9 


Macrophages rest 


8.4 


I una 


6.7 


Macrophages LPS 


1.2 


Thymus 


26.8 


HUVEC none 


2.6 


Kidney 


11.3 


HUVEC starved 


2.8 







CNS_neurodegeneration_vl.O Summary: Ag3467 The CG5 9555-01 gene encodes a 
putative GPCR. It is expressed at low to moderate levels in most of the samples used in this 
panel. This panel confirms the expression of CG59555-01 gene at low levels in the brains 
of an independent group of individuals. However, no differential expression of this gene 
was detected between Alzheimer's diseased postmortem brains and those of non-demented 
controls in this experiment. Please see Panel 1 .4 for a discussion of the potential utility of 
this gene in treatment of central nervous system disorders. 
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General_screening_panel_vl.4 Summary: Ag3467 The CG59555-01 gene encodes a 
putative GPCR. It is expressed at low to moderate levels in large number of the samples 
used in this panel. Highest expression of this gene is detected in fetal lung (CT=28). 
Interestingly, this gene is expressed at much higher levels in fetal (CT = 28) when 
5 compared to adult lung (CT = 31). Therefore, expression of this gene can be used to 
distinguish fetal lung from adult lung and from other samples used in this panel. In 
addition, this gene is also expressed at much higher levels in fetal fetal liver (CT=32) as 
compared to adult liver (CT=38). Thus, expression of this gene can be used to distinguish 
fetal liver from adult liver. 

10 Among tissues with metabolic or endocrine function, this gene is expressed at low 

to moderate levels in pancreas, adipose, adrenal gland, thyroid, pituitary gland, skeletal 
muscle, heart, liver and the gastrointestinal tract. Therefore, therapeutic modulation of the 
activity of this gene may prove useful in the treatment of endocrine/metabolically related 
diseases, such as obesity and diabetes. 

15 This gene is also expressed at low levels in all regions of the central nervous system 

examined, including amygdala, substantia nigra, thalamus, cerebellum, cerebral cortex, and 
spinal cord. Several neurotransmitter receptors are GPCRs, including the dopamine 
receptor family, the serotonin receptor family, the GABAB receptor, muscarinic 
acetylcholine receptors, and others; thus this GPCR may represent a novel neurotransmitter 

20 receptor. Targeting various neurotransmitter receptors (dopamine, serotonin) has proven to 
be an effective therapy in psychiatric illnesses such as schizophrenia, bipolar disorder, and 
depression. Furthermore, the cerebral cortex and hippocampus are regions of the brain that 
are known to be involved in Alzheimer's disease, seizure disorders, and in the normal 
process of memory formation. Therefore, this gene may play a role in central nervous 

25 system disorders such as Alzheimer's disease, Parkinson's disease, epilepsy, multiple 
sclerosis, schizophrenia and depression. 

Panel 4D Summary: Ag3467 The CG59555-01 gene encodes a putative GPCR. Highest 
expression of this gene is detected in resting primary Thl cells (CT=27). This gene is 
expressed at moderate levels in a wide range of cell types of significance in the immune 
30 response in health and disease. These cells include members of the T-cell, B-cell, 

endothelial cell, macrophage/monocyte, and peripheral blood mononuclear cell family, as 
well as epithelial and fibroblast cell types from lung and skin, and normal tissues 
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represented by colon, lung, thymus and kidney. This ubiquitous pattern of expression 
suggests that this gene product may be involved in homeostatic processes for these and 
other cell types and tissues. This pattern is in agreement with the expression profile in 
General _screeningjpanel_vl .4 and also suggests a role for the gene product in cell survival 
and proliferation. Therefore, modulation of the gene product with a functional therapeutic 
may lead to the alteration of functions associated with these cell types and lead to 
improvement of the symptoms of patients suffering from autoimmune and inflammatory 
diseases such as asthma, allergies, inflammatory bowel disease, lupus erythematosus, 
psoriasis, rheumatoid arthritis, and osteoarthritis. 

BG. CG59551-01: Olfactory Receptor 

Expression of gene CG59551-01 was assessed using the primer-probe set Ag3463, 
described in Table BGA. Results of the RTQ-PCR runs are shown in Tables BGB and 
BGC. 

Table BGA . Probe Name Ag3463 



Primers 


Sequences 


Length 


Start 
Position 


SEQID 
NO: 


Forward 


5 ' -ctatggtttccagatgtttcca-3 ' 


22 


78 


571 


Probe 


TET-5 • -tagatgttccagctgcccatctctga-3 ' - 
TAMRA 


26 


105 


572 


Reverse 


5 ' -attgtgagacacagctggattt-3 ' 


22 


132 


573 



Table BGB . General_screening_panel_vl .4 



Tissue Name 


Rel. Exp.(%) 
Ag3463, Run 
217067349 


Tissue Name 


Rel. Exp.(%) 
Ag3463,Run 
217067349 


Adipose 


0.0 


R< n 1 ca rK 10 


9.0 


Melanoma* 
Hs688(A).T 


0.0 


Bladder 


11.2 


Melanoma* 
Hs688(B).T 


0.0 


Gastric ca. (liver met.) 
NCI-N87 


13.0 


Melanoma* M14 


0.0 


Gastric ca. KATO HI 


0.0 


Melanoma* 
LOXIMVI 


0.0 


Colon ca. SW-948 


0.0 


Melanoma* SK- 
MEL-5 


43.8 


Colon ca. SW480 


11.6 


Squamous cell 


0.0 


Colon ca* (SW480 


0.0 
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carcinoma SCC-4 




met) SW620 




Testis Pool 


85.9 


Colon ca. 11T29 


0.0 


Prostate ca.* (bone 
met) PC-3 


0.0 


Colon ca. HCT-116 


10.7 


Prostate Pool 


7.2 


Colon ca. CaCo-2 


12.8 


Placenta 


12.8 




0.0 


Uterus Pool 


0.0 


Colon ca. SW1116 


0.0 


Ovarian ca. 
OVCAR-3 


0.0 


Colon ca. Colo-205 


0.0 


Ovarian ca. SK- 
OV-3 


100.0 


Colon ca. SW-48 


0.0 


Ovarian ca. 

VJ V l^/Vft-H 


11.9 


Colon Pool 


12.5 


Ovarian ca. 
OVCAR-5 


11.3 


Small Intestine Pool 


0.0 


Ovarian ca. 
IGROV-1 


26.1 


Stomach Pool 


0.0 


Ovarian ca. 
OVCAR-8 


0.0 


Bone Marrow Pool 


0.0 


Ovary 


0.0 


Fetal Heart 


0.0 


Breast ca. MCF-7 


0.0 


Heart Pool 


0.0 


Breast ca. MDA- 
MB-231 


0.0 


Lymph Node Pool 


23.2 


Breast ca. BT 549 


20 3 


Fetal Skeletal Muscle 


82.9 


Breast ca. T47D 


0.0 


Skeletal Muscle Pool 


0.0 






opieen rooi 




I3rc isl ! . I 


12.5 


Thymes P-.<o 


0.0 


Trachea. 


20.9 


CNS cancer 
(glio/astro) U87-MG 


0.0 


Lung 


0.0 


CNS cancer 
(glio/astro) U-118-MG 


24.0 


Fetal Lung 


23.7 


CNS cancer 
(neuro;met) SK-N-AS 


0.0 


Lung ca. NCI-N417 


0.0 


CNS cancer (astro) SF- 
539 


0.0 


Lung ca. LX-1 


—————— 

0.0 


SNB-75 ^ ) 


38.2 


Lung ca. NCI-H146 


0.0 


CNS cancer (glio) 
SNB-19 


0.0 


Lung ca. SHP-77 


8.8 


CNS cancer (glio) SF- 
295 


33.2 


Lung ca. A549 


0.0 


Brain (Amygdala) Pool 


576""" J " 


Lung ca. NCI-H526 


0.0 


Brain (cerebellum) 


0.0 


Lung ca. NCI-H23 


57 4 


Brain (fetal) 


48.6 
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Lung ca. NCI-H460 


31.2 


Brain (Hippocampus) 
Pool 


21.3 


Lung ca. HOP-62 


0.0 


Cerebral Cortex Pool 


35.6 


Lung ca. NCI-H522 


14.4 


Brain (Substantia 
nigra) Pool 


15 5 


Liver 


0.0 


Brain (Thalamus) Pool 


14.1 


Fetal Liver 


0.0 


Brain (whole) 


0.0 


Liver ca. HepG2 


5.0 


Spinal Cord Pool 


29.5 


Kidney Pool 


37.9 


Adrenal Gland 


14.8 


Fetal Kidney 


0.0 


Pituitary gland Pool 


0.0 


Renal ca. 786-0 


0.0 


Salivary Gland 


0.0 


Renal ca. A498 


0.0 


Thyroid (female) 


0.0 


Renal ca. ACHN 


0.0 


Pancreatic ca. 
CAPAN2 


0.0 


Renal ca. UO-3 1 


0.0 


Pancreas Pool 


12.2 



Table BGC . Panel 4. ID 



Tissue Name 


Rel. Exp.(%) 
Ag3463, Run 
169839351 


Tissue Name 


Rel. Exp.(%) 
Ag3463, Run 
169839351 


Secondary Thl act 


0.0 


HUVEC IL-lbeta 


0.0 


Secondary Th2 act 


0.0 


HUVEC IFN gamma 


0.0 


Secondary Trl act 


1.3 


HUVEC TNF alpha + 
IFN gamma 


0.0 


Secondary Thl rest 


2.6 


HUVEC TNF alpha + 
IL4 


0.0 


Secondary Th2 rest 


0.0 


HUVEC IL-11 


0.0 


Secondary Trl rest 


0.0 


Lung Microvascular EC 
none 


0.0 


Primary Thl act 


0.0 


Lung Microvascular EC 
TNFalpha + IL-lbeta 


0.0 


Primary Th2 act 


0.0 


Microvascular Dermal 
EC none 


0.0 


Primary Trl act 


0.0 


Microsvasular Dermal 
EC TNFalpha + IL-lbeta 


1.4 


Primary Thl rest 


0.0 


Bronchial epithelium 
TNFalpha + ILlbeta 


0.6 


Primary Th2 rest 


0.0 


Small airway epithelium 
none 


0.0 


Primary Trl rest 


0.0 


Small airway epithelium 
TNFalpha + IL-lbeta 


0.0 


CD45RA CD4 
lymphocyte act 


0.0 


Coronery artery SMC rest 


0.0 
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CD45ROCD4 1 0f> ICoronery artery SMC 
lymphocyte act ' {TNFalpha + IL-lbeta 


0.0 


CD8 lymphocyte act 


0.0 


Astrocytes rest 


0.0 


Secondary CD8 
lymphocyte iost 


0.0 


Astrocytes TNFalpha + 
[L-l beta 


0.0 


Secondary CD8 
lymphocyte act 


0.0 


KU-812 (Basophil) rest 


7.2 


CD4 lymphocyte none 


0.0 


KU-812 (Basophil) 
PMA/ionomycin 


100.0 


2ry Thl/Th2/Trl anti- 
CD95 CHI 1 


0.0 


CCD 1106 

^Keratinocytes) none 


0.0 


LAK cells rest 


0.0 


CCD 1106 
[Keratinocytes) 
TNFalpha + IL-lbeta 


1.1 


LAK cells IL-2 


0.0 


Liver cirrhosis 


1.2 


LAK cells IL-2+IL-12 


0.0 


iNUi-rizyz none 


0.0 


LAK cells IL-2+IFN 
gamma 


0.0 


NCI-H292IL-4 


0.0 


LAK cells IL-Z+ II,-! 8 


0.0 




0.0 


LAK cells 


0.0 


NCI-H292 IL-13 


0.0 


NK Cells IL-2 rest 


0.0 


NCI-H292 I FN gamma 


0.0 


Two Way MLR 3 day 


1.3 


HPAEC none 


1.2 


Two Way MLR 5 day 


0.0 


rlrf\CVs 1 IN I a 1 pi la ■ 1L 

1 beta 


0.0 


Two Way MLR 7 day 




Lung fibroblast none 


1.0 


PBMC rest 


0.0 


Lung fibroblast TNF 
alpha + IL-1 beta 


0.0 


PBMC PWM 


0.0 


Lung fibroblast IL-4 


0.0 


PBMC PHA-L 


0.0 


Lung fibroblast IL-9 


0.0 


Ramos (B cell) none 


1.3 


Lung fibroblast IL-13 


0.0 


Ramos (B cell) 
ionomycin 


0.0 


Lung fibroblast IFN 
gamma 


0.0 


B lymphocytes PWM 


0.0 


Dermal fibroblast 
CCD1 070 rest 


0.0 


B lymphocytes CD40L 
and IL-4 


1.2 


Derm 3.1 fibrobl&st 
CCD1070 TNF alpha 


0.0 


EOL-l dbcAMP 


0.0 


Dermal fibroblast 
CCD1070 IL-1 beta 


0.0 


EOL-l dbcAMP 
PMA/ionomycin 


0.0 


Dermal fibroblast IFN 
gamma 


0.0 


Dendritic cells none 


6.1 


Dermal fibroblast IL-4 


0.0 


Dendritic cells LPS 


0.0 


Dermal Fibroblasts rest 


0.0 


Dendritic cells anti- 
CD40 


1.2 


Neutrophils TNFa+LPS 


0.0 
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Monocytes rest 


0.0 


Neutrophils rest 


0.0 


Monocytes LPS 


0.0 


Colon 


0.5 


Macrophages rest 


4.1 


Lung 


0.0 


Macrophages LPS 


0.0 


Thymus 


0.0 


HUVEC none 


0.0 


Kidney 


2.3 


HUVEC starved 


0.0 







CNS_neurodegeneration_vl.O Summary: Ag3463 Expression of the CG59551-01 gene 
is low/undetectable in all the samples on this panel. (Data not shown.) 



Genera l_screeningjpanel_vl.4 Summary: Ag3463 The CG59551-01 gene encodes a 
putative GPCR. Highest expression of this gene is detected in an ovarian cancer cell line 
5 SK-OV-3 (CT=34). In addition, low expression of this gene is also observed in fetal 
skeletal muscle (CT= 34.4), one of the lung cancer cell line (CT= 34.9), and testis (CT= 
34.3). Thus, expression of this gene can be used to distinguish these sample from other 
samples used in this panel. In addition, therapeutic modulation of the activity of the GPCR 
encoded by this gene may be useful in the treatment of ovarian and lung cancer, fertility, 
1 0 hypogonadism, and muscle related diseases. 

Panel 4.1D Summary: Ag3463 The CG59551-01 gene encodes a putative GPCR. Highest 
expression of this gene is seen in KU-812 cells treated with PMA/ionomycin (CT=30.86). 
Thus, expression of this gene can be used to distinguish this sample from other samples 
used in this panel. In addition, expression of this gene is high in KU-812 (basophils) cells 
treated with PMA/ionomycin (CT=30.86) as compared to resting KU-812 cells 
(CT=34.66). Therefore, expression of this gene can be used to distinguish resting from 
PMA/ionomycin treated- basophils. It is known that GPCR-type receptors are important in 
multiple physiological responses mediated by basophils (ref. 1). Therefore, antibody or 
small molecule therapies designed with the protein encoded for by this gene could block or 
inhibit inflammation or tissue damage due to basophil activation in response to asthma, 
allergies, hypersensitivity reactions, psoriasis, and viral infections. 

References: 

1. Heinemann A., Hartnell A., Stubbs V.E., Murakami K., Soler D., LaRosa G., 
Askenase P.W., Williams T.J., Sabroe I. (2000) Basophil responses to chemokines are 
25 regulated by both sequential and cooperative receptor signaling. J. Immunol. 165: 7224- 

7233. 
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BH. CG59540-01: OLFACTORY RECEPTOR 

Expression of gene CG59540-01 was assessed using the primer-probe sets Ag3460 
and Agl519, described in Tables BHA and BHB. Results of the RTQ-PCR runs are shown 
in Tables BHC, BHD and BHE. 



Table BHA . Probe Name Ag3460 



Primers 


Sequences 


Length 


Start 
Position 


SEQID 
NO: 


Forward 


5 ' -tcagtgcagagatggagatctt-3 * 


22 


97 


574 


Probe 


TET-5 • -tgcatcttctccctgttatatctcttca-3 ' - 
TAMRA 


28 


126 


575 


Reverse 


5 1 -gacagatgagtcccatgttcat-3 • 


22 


171 


576 



Table BHB . Probe Name Agl 5 1 9 



Primers 


Sequences 


Length 


Start 
Position 


SEQID 
NO: 


Forwar i 


5 • -cctggccctcataaatctaatt-3 ' 


22 


503 


577 


Probe 


TET-5 ' -ctccttctaaggctgcccttctgtgg-3 * - 
TAMRA 


26 


525 


578 


Reverse 


5 ' -acagacagaatttcaccgaaga-3 ' 


22 


571 


579 



Table BHC . Panel 1.2 



Tissue Name 


Rel. Exp.(%) 
Agl519,Run 
142098791 


Tissue Name 


Rel. Exp.(%) 
Agl519, Run 
142098791 


Endothelial cells 


0.0 


Renal ca. 786-0 


32.1 


Heart (Fetal) 


1.3 


Renal ca. A498 


10.7 


Pancreas 


1.5 


Renal ca. RXF 393 


7.5 


Pancreatic ca. 
CAPAN 2 


3.6 


Renal ca. ACHN 


10.2 


Adrenal Gland 


4.7 


Renal ca. UO-31 


26.8 


Thyroid 


0.4 


Renal ca.TK-10 


14.0 


Salivary gland 


27.7 


Liver 


7.2 


Pituitary gland 


0.0 


I i\<:r (fetal) 


3.5 


Brain (fetal) 

„, .... 


0.0 


Liver ca. 

(hepatoblast) HepG2 


0.9 


Brain (whole) 


0.0 


Lung 


0.0 


Brain (amygdala) 


0.0 


Lung (fetal) 


1.3 


Brain (cerebellum) 


00 


Lung ca. (small cell) 


22.8 
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JLX-1 




Brain (hippocampus) 


n o jLung ca. (small cell) 
INCI-H69 


11.5 


Brain (thalamus) 


n ? jLung ca. (s.cell var.) 
jSHP-77 


0.0 


Cerebral Cortex 


n » jLung ca. (large 
lcell)NCI-H460 


2.3 


Spinal cord 


0.0 


Lung ca. (non-sm. 
cell) A549 


5.1 


glio/astro U87-MG 


0.4 


Lung ca. (non-s.cell) 
NCI-H23 


7.7 


glio/astroU-118-MG 


0.9 


Lung ca. (non-s.cell) 
HOP-62 


6.8 


astrocytoma SW1783 


0.0 


Lung ca. (non-s.cl) 
NCI-H522 


0.9 


neuro*; met SK-N- 
AS 


0.0 


Lung ca. (squam.) 
SW 900 


52.1 


astrocytoma SF-539 


0.0 


Lung ca. (squam.) 
NCI-H596 


4.1 


astrocytoma SNB-75 


0.0 


Mammary gland 


1 1.8 


glioma SNB-19 


4.0 


Breast ca.* (pl.ef) 
MCF-7 


11.3 


glioma U25 1 


0.0 


Breast ca.* (pl.ef) 
MDA-MB-231 


1.4 


glioma SF-295 


2.8 


Breast ca.* (pi. ef) 
T47D 


6.3 


Heart 


13.9 


Breast ca. BT-549 


0.0 


Skeletal Muscle 


0.2 


Breast ca. MDA-N 


12.5 


Bone marrow 


0.7 


Ovar> 


1.5 


Thymus; 


0.0 


Ovarian ca. 
OVCAR-3 


12.4 


Spleen 


0.7 


Ovarian ca. 
OVCAR-4 


23.8 


Lymph node 


0.0 


Ovarian ca. 
OVCAR-5 


38.2 


Colorectal Tissue 


4.5 


Ovarian ca. 
OVCAR-8 


35.6 


Stomach 


2.6 


Ovarian ca. IGROV- 
1 


2.4 


Small intestine 


~ r JOvarian ca. (ascites) 
[SK-OV-3 


11.2 


Colon ca. SW480 


3.1 litems 


1.7 


Colon ca.* SW620 
(SW480 met) 


12.3 [Placenta 


0.8 


Colon ca. HT29 


12.3 jProstate 


14.2 
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Colon ca.HCT-1 16 


14.3 


Prostate ca.* (bone 
met) PC-3 


12.6 


C< >1< n ca. ta( o-2 


11 .5 


Testis 


0.4 


Colon ca. Tissue 
(OD03866) 


5.7 


Melanoma 
Hs688(A).T 


6.5 


Colon ca. HCC-2998 


100.0 


Melanoma* (met) 
Hs688(B).T 


12.2 


Gastric ca.* (liver 
met) NCI-N87 


15.7 


Melanoma UACC- 
62 


0.0 


Bladder 


95.3 


Melanoma Ml 4 


10.3 


Trachea. 


1.0 


Melanoma LOX 
IMVI 


0.0 


Kidney 


55.9 


Melanoma* (met) 
SK-MEL-5 


0.0 


Kidney (fetal) 


7.7 






Table BHD. Panel 1.3D 


Tissue Name 


Rel. Exp.(%) 
Agl519, Run 
165529518 


Tissue Name 


Rel. Exp.(%) 
Agl519, Run 
165529518 


Liver adenocarcinoma 


0.0 


Kidne> (ietab 


0.0 


Pancreas 


38.7 


Renal ca. 786-0 


8.1 


Pancreatic ca. CAP AN 

2 


7.9 


Renal ca. A498 


0.0 


Adrenal gland 29.9 


Renal ca. RXF 393 


29.7 


Thyroid 


26.6 


Renal ca. ACHN 


13.4 


Salivary gland 


0.0 


Renal ca. UO-31 


0.0 


Pituitary gland 


17.2 


Renal ca.TK-10 


16.8 


Brain (fetal) 


0.0 


Liver 


0.0 


Brain (whole) 


0.0 


Liver (fetal) 


27.4 


Brain (amygdala) 


0.0 


Liver ca. 

(hepatoblast) HepG2 


0.0 


Brain (cerebellum) 


0.0 


Lung 


0.0 


Brain (hippocampus) 


0.0 


Lung (fetal) 


15.6 


Brain (substantia nigra) 


0.0 


Lung ca. (small cell) 
LX-1 


50.7 


Brain (thalamus) 


0.0 


Lung ca. (small cell) 
NCI-H69 


0.0 


Cerebral Cortex 


0.0 


Lung ca. (s.cell var.) 
SHP-77 


25.0 


Spinal cord 


0.0 


Lung ca. (large 
cell)NCI-H460 


26.1 


glio/astro U87-MG 


0.0 


Lung ca. (non-sm. 


0.0 
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cell) A549 




glio/astroU-118-MG 


0.0 


Lung ca. (non-s.cell) 
NCI-H23 




0.0 


astrocytoma SW1783 


0.0 


Lung ca. (non-s.cell) 
HOP-62 


27.4 


neuro*; met SK-N-AS 


0.0 


Lung ca. (non-s.cl) 
NCI-H522 


0.0 


astrocyloma SF-539 


0.0 


Lung ca. (squam.) 
SW 900 


18.0 


astrocyloma SNB-75 


0.0 


Lung ca. (squam.) 
NCI-H596 


0.0 


glioma SNB-19 


0.0 


Mammary gland 


27.0 


glioma U251 


18.8 


Breast ca.* (pl.ef) 
MCF-7 


27.0 


glioma SF-295 


0.0 


Breast ca.* (pl.ef) 
MDA-MB-231 


45.7 


Heart (fetal) 


16.4 


Breast ca.* (pl.ef) 
T47D 


13.7 


Heart 


0.0 


Breast ca. BT-549 


0 0 


Skeletal! muscle (fetal) 


0.0 


Breast ca. MDA-N 


13 8 


Skeleta' muscle 


18.8 


< >vary 


0.0 


Bone marrow 


0.0 


Ovarian ca. 
OVCAR-3 


0.0 


Thymus 


0.0 


Ovarian ca. 
OVCAR-4 


11.4 


Spleen 


0.0 


Ovarian ca. 
OVCAR-5 


2.6 


Lymph node 


34.4 


Ovarian ca. 
OVCAR-8 


12.3 


Colorectal 


100.0 


Ovarian ca. IGROV- 
1 


0.0 


Stomach 


0.0 


Ovarian ca.* 
(ascites) SK-OV-3 


33.7 


Small intestine 


0.0 


Uterus 


0.0 


Colon ca. SW480 


22.8 


Placenta 


17.6 


Colon ca.* 
SW620(SW480 met) 


10.0 


Prostate 


0.0 


Colon ca. HT29 


16.7 


Prostate ca.* (bone 
met)PC-3 


0.0 


Colon ca. HCT-116 


16 8 


Testis 


0.0 


Colon ca. CaCo-2 


15.6 


Melanoma 
Hs688(A).T 


15.1 


Colon ca. 
tissue(OD03866) 


28.9 


Melanoma* (met) 
Hs688(B).f 


9.0 


Colon ca. HCC-2998 


31.0 


Melanoma UACC- 


0.0 
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62 




Gastric ca.* (liver met) 
NCI-N87 


36.6 


Melanoma Ml 4 


26.2 


Bladder 


51.4 


Melanoma LOX 
IMVI 


0.0 


Trachea 


0.0 


Melanoma* (met) 
SK-MEL-5 


14.1 


Kidney 


56.3 


Adipose 


0.0 



Table B HE . Panel 2D 



Tissue Name 


Rel. Exp.(%) 
Agl519,Run 
145158010 


Tissue Name 


Rel. Exp.(%) 
Agl519, Run 
145158010 


Normal Colon 


81.8 


Kidney Margin 
8120608 


5.0 


CC WelltoModDiff 
(OD03866) 


6.0 


Kidney Cancer 
8120613 


0.0 


CC Margin (OD03866) 


7.3 


Kidney Margin 
8120614 


1.9 


CC Gr.2 rectosigmoid 
(ODQ3868) 


5.8 


Kidney Cancer 
9010320 


7.6 


CC Margin (OD03868) 


0.0 


Kidney Margin 
9010321 


5.8 


CCModDiff 
(ODO3920) 


18.6 


Normal Uterus 


4.2 


CC Margin (ODO3920) 


10.6 


Uterus Cancer 
064011 


47.3 


CC Gr.2 ascend colon 
(OD03921) 


8.2 


Normal Thyroid 


21.6 


CC Margin (OD03921) 


4.8 


Thyroid Cancer 
064010 


42.3 


CC from Partial 
Hepatectomy 
(ODO4309) Mets 


47.6 


Thyroid Cancer 
A302152 


20.9 


Liver M argin 
(ODO4309) 


10.4 


Thyroid Margin 
A302153 


59.5 


Colon mets to lung 
(OD04451-01) 


12.2 


Normal Breast 


71.2 


Lung Margin (OD0445 1 - 
02) 


6.5 


Breast Cancer 
(OD04566) 


15.7 


Normal Prostate 6546-1 


11.6 


Breast Cancer 
(OD04590-01) 


19.9 


Prostate Cancer 
(OD04410) 


31.6 


Breast Cancer Mets 
(OD04590-03) 


41.5 
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Prostate Margin 
(OD04410) 


25.5 


Breast Cancer 

Metastasis 

(OD04655-05) 


33.7 


Prostate; Cancer 
(OD04720-01) 


27.2 


Breast Cancer 
064006 


27.0 


Prostate: Margin 
(OD04720-02) 


31.4 


Breast Cancer 1024 


48.0 


Normal Lung 061010 


25.2 


Brenst Cfmccr 
9100266 


3.3 


Lung Met to Muscle 
(OD04286) 


6.2 


Breast Margin 
9100265 


7.8 


Muscle Margin 
(OD04286)~ 


00 


Breast Cancer 
A209073 


24.8 


Lung Malignant Cancer 
(OD03126) 


39.0 


Breast Margin 
A209073 


32.3 


Lung Margin (OD03126) 


12.0 


Normal Liver 


3.5 


Lung Cancer (OD04404) 


4.9 


Liver Cancer 064003 


56.6 


Lung Margin (OD04404) 


27.9 


Liver Cancer 1025 


7.2 


Lung Cancer (OD04565) 


11.9 


Liver Cancer 1 026 


1.8 


Lung Margin (OD04565) 


1.4 


Liver Cancer 6004-T 


6.0 


i_.ung v^ant-er \ i- 
01) 


52.5 


Liver Tissue 6004-N 


0.0 


T una Maroin fTYnf>A?^7- 

02) 


20.7 


Liver Cancer 6005 -T 


5.6 


(ODO4310) 


5.6 


Liver Tissue 6005-N 


0.0 


Liver N^Efirgin 
(ODO4310) 


2.2 


Normal Bladder 


24.0 


Melanoma Mets to Lung 
(OD04321) 


0.0 


Bladder Cancer 1023 


3.3 


Lung Margin (OD04321) 


24.7 


Bladder Cancer 
A302173 


5.7 


Normal Kidney 


100.0 


Bladder Cancer 
(OD047 18-01) 


2.9 


Kidney Ca, Nuclear 
grade 2 (OD04338) 


34.4 


Bladder Normal 
Adjacent (OD047 1 8- 
03) 


0.0 


Kidney Margin 
(OD04338) 


54.7 


Normal Ovary 


3.9 


Kidney Ca Nuclear 
grade 1/2 (OD04339) 


81.8 


Ovarian Cancer 
064008 


7.2 


Kidney Margin 
(OD04339) 


48.3 


Ovarian Cancer 
(OD04768-07) 


38.4 


Kidney Ca, Clear cell 
type (OD04340) 


11.0 


Ovary Margin 
(OD04768-08) 


5.1 
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Kidney Margin 
(OD04340) 


56.6 


Normal Stomach 


11.4 


Kidney Ca, Nuclear 
grade 3 (OD04348) 


3.4 


Gastric Cancer 
9060358 


6.5 


Kidney Margin 
(OD04348) 


43.2 


Stomach Margin 
9060359 


0.0 


Kidney Cancer 
(OD04622-01) 


11.5 


Gsstrlc Ccinccr 
9060395 


6.7 


Kidney Margin 
(OD04622-03) 


3.5 


Stomach Margin 
9060394 


4.5 


Kidney Cancer 
(OD04450-01) 


17.8 


Gastric Cancer 
9060397 


0.0 


Kidney Margin 
(OD04450-03) 


42.0 


Stomach Margin 
9060396 


6.7 


Kidney Cancer 8120607 


0.0 


Gastric Cancer 
064005 


16.6 



CNS_neurodegeneration_vl.O Summary: Ag3460 Expression of the CG59540-01 gene 
is low/undetectable (CT values > 35) across the samples in this panel. 



General_screening_panel_vl.4 Summary: Ag3460 Expression of the CG59540-01 gene 
is low/undetectable (CT values > 35) across the samples in this panel. 

5 Panel 1.2 Summary: Agl 51 9 The expression of the CG5 9540-01 gene appears to be 
highest in a sample derived from a colon cancer cell line (HCC-2998) (CT=28.2). In 
addition, there is substantial expression associated with normal kidney and bladder. Thus, 
the expression of this gene could be used to distinguish these tissues from other tissues in 
the panel. In addition there was noted expression clustered in ovarian, renal and colon 
1 0 cancer cell lines. Therefore, therapeutic modulation of this gene, through the use of small 
molecul e drugs, antibodies or protein therapeutics might be of use in the treatment of colon 
cancer, renal cancer or ovarian cancer. 

Among tissues with metabolic function, there is moderate expression in fetal and 
adult heart, adrenal, and pancreas. This expression suggests that therapeutic modulation of 
1 5 the expression or function of the protein encoded by this gene may be useful in the 
treatment of diseases that involve these tissues, including obesity and diabetes. 

In addition, there appears to be higher levels of expression in adult heart (CT=31) 
when compared to expression in fetal heart (CT=34.4). Thus, expression of this gene could 
be used to differentiate between adult and fetal heart tissue. Conversely, expression of this 
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gene is higher in fetal lung (CT=34.5) than in adult lung (CT=40). Thus, expression of this 
gene could also be used to differentiate between adult and fetal lung. 

Panel 1.3D Summary: Agl519 Significant expression the CG59540-01 gene is limited to 
a sample derived from colorectal tissue (CT=34.3). Thus, expression of this gene could be 
5 used to differentiate between this sample and other samples on this panel, and between 
colorectal tissue and other normal or malignant tissues. 

Panel 2D Summary: Agl519 The expression of the CG59540-01 gene in panel 2 appears 
to be highest in a samples derived from normal kidney tissue (CT=32). In addition there 
appears to be substantial difference in expression between normal kidney adjacent to 
10 cancer tissue and the cancer tissue itself. Thus, the expression of this gene could be used to 
distinguish normal kidney tissue from other samples in the panel. In addition, the 
expressiion of this gene could be used to distinguish normal kidney from malignant tissue. 
Moreover, therapeutic modulation of this gene, through the use of small molecule drugs, 
antibodiies or protein therapeutics might be of use in the treatment of kidney cancer. 

15 Panel 4D Summary: Ag3460 Expression of the CG59540-01 gene is low/undetectable 
(CT values > 35) across the samples in this panel. 

BI. CG59280-01 and CG59280-02: OLFACTORY RECEPTOR 

Expression of gene CG59280-01 and CG59280-02 was assessed using the primer- 
probe set Ag3527, described in Table BIA. Results of the RTQ-PCR runs are shown in 
20 Table BIB. Please note that CG59280-02 represents a full-length physical clone of the 
CG59280-01 gene, validating the prediction of the gene sequence. 



Table B IA. Probe Name Ag3527 



Primers 


Sequences 


Length 


Start 
Position 


SEQ IB- 
NO: 


Forward 


5 ' -atggccattgataggtacgtt-3 ' 


21 


361 


580 


Probe 


TET- 5 1 -catctgtaaccctctccgctacccaa-3 ' - 
TAMRA 


26 


384 


581 


Reverse 


5 1 -ccacagagagctgaacacaga-3 1 


21 


428 


582 



Table B IB. Panel 4D 



| Ti ssue Name j ReLExp ^%) P" Tissue Nam e { Rel. Exp.(%) | 
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Ag3527, Run 
166446354 




Ag3527, Run 
166446354 


Secondary Thl act 


0.0 


HUVEC IL-lbeta 


0 0 


Secondary Th2 act 


0.0 


HUVEC IFN gamma 


0.0 


Secondary Trl act 


0.0 


HUVEC TNF alpha + 
IFN gamma 


0.0 


Secondary Thl rest 


0.0 


HUVEC TNF alpha + 
IL4 


0.0 


Secondary Th2 rest 


0.0 


HUVEC IL-1 1 


0.0 


Secondary Trl rest 


2.0 


Lung Microvascular EC 
none 


0.0 


Primary Thl act 


0.0 


Lung Microvascular EC 
TNFalpha + IL-1 beta 


0.0 


Primary Th2 act 


0.0 


Microvascular Dermal 
EC none 


4.2 


Primary Trl act 


0.0 


Microsvasular Dermal 
EC TNFalpha + IL-1 beta 


0.0 


Primary Thl rest 


0.0 


Bronchial epithelium 
TNFalpha + IL1 beta 


4.2 


Primary Th2 rest 


0.0 


Small airway epithelium 


0.0 


Primary Trl rest 


0.0 


Small airway epithelium 
TNFalpha + IL-lbeta 


0.0 


CD45RA CD4 
lymphocyte act 


0.0 


Coronery artery SMC rest 


0.0 


CD45RO CD4 
lymphocyte act 


0.0 


Coronery artery SMC 
TNFalpha + IL-lbeta 


0.0 


CD8 lymphocyte act 


0.0 


Astrocytes rest 


0 0 


Secondary CD8 
lymphocyte rest 


0.0 


Astrocytes TNFalpha + 
IL-lbeta 


0.0 


Secondary CD8 
lymphocyte act 


0.0 


KU-812 (Basophil) rest 


0.0 


CD4 lymphocyte none 


4.4 


KU-olz (Basophil) 
PMA/ionomyci n 


0.0 


2ry Thl/Th2/Trl anti- 
CD95 CH11 


0.0 


CCD1106 

(Keratinocytes) none 


0.0 


LAK cells rest 


0.0 


CCD1 106 
(Keratinocytes) 
TNFalpha + IL-1 beta 


0.0 


LAK cells 1L-2 


0.0 


Liver cirrhosis 


100.0 


LAK cells IL-2+IL-12 


0.0 


Lupus kidney 


0.0 


LAK cells IL-2+IFN 
gamma 


5.1 


NCI-H292 none 


0.0 


LAK cells IL-2+ IL-18 


0.0 


NCI-H292 1L-4 


0.0 


LAK cells 


0.0 


NCI-H292 IL-9 


0.0 
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PMA/ionomycin J j 




NK Cells IL-2 rest 


0.0 


NCI-H292IL-13 


0.0 


Two Way MLR 3 day 


00 


NCI-H292 IFN gamma 


0.0 


Two Way MLR 5 day 


00 


HPAEC none 


4.5 


Two Way MLR 7 day 


0.0 


HPAEC TNF alpha + IL- 
1 beta 


0.0 


PBMC rest 


3.1 




0 0 


PBMC PWM 


0.0 


Lung fibroblast TNF 

dlr»Via 4- TT 1 K t <i 

dipna ■ iju-i ueia 


0.0 


PBMC PHA-L 


0.0 


Lung fibroblast IL-4 


0.0 


Ramos (B cell) none 


0.0 


Lung fibroblast IL-9 


0 0 


Ramos (B cell) 
ionomycin 


0.0 


Lung fibroblast IL-13 


0.0 


B lymphocytes PWM 


0.0 


Lung fibroblast IFN 
gamma 


0.0 


B lymphocytes CD40L 
and IL-4 


0.0 


Dermal fibroblast 
CCD 1070 rest 


0.0 


EOL-1 dbcAMP 


0.0 


Dermal fibroblast 
<■...':. uio/o lJNr aipna 


0.0 


1 AUr- A TV/fP 
JtA/L-I UDCAMr 

PMA/ionomycin 


0.0 


Dermal fibroblast 
CCD 1070 IL-1 beta 


0.0 


Dendritic cells none 


4.5 


Dermal fibroblast IFN 
gamma 


0.0 


Dendritic cells LPS 


5.3 


Dermal fibroblast IL-4 


0.0 


Dendritic cells anti- 
CD40 


9.9 


IBD Colitis 2 


10.8 


Monocytes rest 


0.0 


FbD Crohn's 


8.9 


Monocytes LPS 


27.0 


Colon 


0.0 


Macrophages re - 1 


9.5 


Lung 


4.6 


Macrophages LPS 


4.0 


Thymus 


0.0 


HUVEC none 


0.0 


Kidney 


0.0 


UU\ EC starved 


0.0 







CNS_neurodegeneration_vl.O Summary: Ag3527 Expression of the CG59280-01 gene 
is low/undetectable (CT values > 35) across the samples in this panel. (Data not shown.) 



General_screening_panel_vl.4 Summary: Ag3527 Expression of the CG59280-01 gene 
is low/undetectable (CT values > 35) across the samples in this panel. (Data not shown.) 
This gene encodes a G protein-coupled receptor (GPCR), a type of cell surface receptor 
involved in signal transduction. It is most similar to members of the odorant receptor 
subfamily of GPCRs. Based on analogy to other odorant receptor genes, we predict that 
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expression of this gene may be highest in nasal epithelium, a sample not represented on this 
panel. 

Panel 4D Summary: Ag3527 Highest expression of the CG59280-01 gene is seen in the 
liver cirrhosis sample(CT=31.81). Thus, expression of this gene could be used to 
5 differentiate between this sample from the other samples on this panel and as a marker to 
detect the presence of liver cirrhosis. Furthermore, expression of this gene is not detected in 
normal liver in Panel 1.4, suggesting that its expression is unique to liver cirrhosis. This 
gene encodes a putative GPCR; therefore, antibodies or small molecule therapeutics could 
reduce or inhibit fibrosis that occurs in liver cirrhosis. In addition, antibodies to this 
1 0 putative GPCR could also be used for the diagnosis of liver cirrhosis. 

BJ. CG 59568-01: GPCR 

Expression of gene CG595 68-01 was assessed using the primer-probe set 
Ag3474, described in Table BJA. Results of the RTQ-PCR runs are shown in Table 
BJB. 

1 5 Table BJA . Probe Name Ag3474 



Primers 


Sequences 


Length 


Start 
Position 


SEQID 
NO: 


Forward 


5 ' -ttacagcaatcaccatggtctt-3 ' 


22 


475 


760 


Probe 


TET-5 ' -accttctgtggaccctatgaaactga-3 ' - 

TAMRA 


26 


510 


761 


Reverse 


5 ' -gggtgaagtcacaaaagaagtg-3 ' 


22 


537 


762 



Table B JB. Panel 4D 



Tissue Name 


Rel. Exp.(%) 
Ag3474, Run 
166417193 


Tissue Name 


Rel. Exp.(%) 
Ag3474, Run 
166417193 


Secondary Thl act 


7.1 


HUVECIL-lbeta 


0.0 


Secondary Th2 act 


0.0 


HUVEC IFN gamma 


0.0 


Secondary Trl act 


4.2 


HUVEC TNF alpha + 
IFN gamma 


r "' T " ' 
0.0 


Secondary Thl rest 


0.0 


HUVEC TNF alpha + 
IL4 


0.0 


Se .v.ndary Th2 rest 


0.0 


HUVEC IL-1I 


0.0 


Secondary Trl rest 


0.0 


Lung Microvascular EC 
none 


0.0 
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Primary Thl act 


0.0 


Lung Tvlicrovctsculsir EC 
TNFalpha + IL-lbeta 


0.0 


Primary Th2 act 


2.2 


Microvascular Dermal 
EC none 


0.0 


Primary Trl act 


0.0 


Microsvasular Dermal 
EC TNFalpha + IL-lbeta 


0.0 


Primary Thl rest 


3.3 


Bronchial epithelium 
TNFalpha + IL1 beta 


0.0 


Primary Th2 rest 


5.9 


Small airway epithelium 


0.0 


Primary Trl rest 


0.0 


Small airway epithelium 
TNFalpha + IL-lbeta 


0.3 


CD45RA CD4 
lymphocyte act 


0.0 


Coronery artery SMC rest 


0.0 


CD45RO CD4 
lymphocyte act 


0.0 


Coronery artery SMC 
TNFalpha + IL-lbeta 


0.0 


CD8 lymphocyte act 


0.0 


Astrocytes rest 


0.0 


Secondary CD8 
lymphocyte rest 


1.7 


Astrocytes TNFalpha + 
IL-lbeta 


0.0 


Secondary CD8 
lymphocyte act 


0.0 


KU-812 (Basophil) rest 


0.0 


CD4 lymphocyte none 


0.0 


KU-olz (.Basophil) 
PMA/ionomycin 


3.1 


2ryThl/Th2/Trl anti- 
CD95 CHI 1 


0.0 


CCD 11 06 

(Keratinocytes) none 


0.0 


LAK cells rest 


0.0 


CCD 1106 
(Keratinocytes) 
TNFalpha + IL-lbeta 


0.5 


LAK cells IL-2 


0.0 


Liver cirrhosis 


100.0 


LAK cells IL-2+IL-12 


3.6 


Lupus kidney 


0.0 


LAK cells IL-2+IFN 
gamma 


3.9 


NC1-H292 none 


0.0 


LAK cells IL-2+IL-18 


2.4 


NCI-H292 IL-4 


0.0 


LAK cells 
PMA/ionomycin 


0.0 


NCI-H292 IL-9 


0.0 


NK Cell s IL-2 rest 


0.0 


NCI-H292 IL-13 


0.0 


Two Way MLR 3 day 


0.0 


NCI-H292 IFN gamma 


0.0 


Two Way MLR 5 day 


0.0 


HPAEC none 


0.0 


Two Way MLR 7 day 


0.0 


HPAEC TNF alpha + IL- 
1 beta 


0.0 


PBMC rest 


0.0 


Lung fibroblast none 


0.0 


PBMC PWM 


0.0 


Lung fibroblast TNF 
alpha + IL-1 beta 


6.2 


PBMC PHA-L 


0.0 


Lung fibroblast IL-4 


0.0 


Ramos (B cell) none 


0.0 


Lung fibroblast IL-9 


0.0 
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Ramos (B cell) 
ionomycin 


0.0 


Lung fibroblast IL-1 3 


0.3 


B lymphocytes PWM 


8.4 


Lung fibroblast IFN 
gamma 


0.0 


B lymphocytes CD40L 
and IL-4 


0.0 


Dermal fibroblast 
CCD 1070 rest 


0.0 


EOL-1 dbcAMP 


0.0 


Dermal fibroblast 
CCD1070 TNF alpha 


1.6 


EOL-1 dbcAMP 
PMA/ionomycin 


0.0 


Dermal fibroblast 
CCD 1070 IL-1 beta 


0.0 


Dendritic cells none 


0.0 


Dermal fibroblast IFN 
gamma 


0.0 


Dendritic cells LPS 


1.5 


Dermal fibroblast IL-4 


0.0 


Dendriti c cells anti- 
CD40 


0.0 


IBD Colitis 2 


11.9 


Monocytes rest 


0.0 


IBD Crohn's 


0.0 


Monocytes LPS 


5.8 


Colon 


I 5 


Macrophages rest 


3.4 


Lung 


0.0 


Macrophages LPS 


0.0 


Thymus 


3 8 


HUVEC none 


0.0 


Kidney 


6.7 


HUVEC starved 


0.0 







CNS_neurodegeneration_vl.O Summary: Ag3474 Expression of the CG59568-01 gene 
is low/undetectable (CT values > 35) across the samples in this panel. (Data not shown.) 



General_screening_panel_vl.4 Summary: Ag3474 Expression of the CG59568-01 gene 
is low/undetectable (CT values > 35) across the samples in this panel. (Data not shown.) 
5 This gene encodes a G protein-coupled receptor (GPCR), a type of dell surface receptor 
involved in signal transduction. It is most similar to members of the odorant receptor 
subfamily of GPCRs. Based on analogy to other odorant receptor genes, we predict that 
expression of this gene may be highest in nasal epithelium, a sample not represented on this 
panel. 

10 Panel 4D Summary: Ag3474 Highest expression of the CG59280-01 gene is seen in the 
liver cirrhosis sample(CT=31.37). Thus, expression of this gene could be used to 
differentiate between this sample from the other samples on this panel and as a marker to 
detect the presence of liver cirrhosis. Furthermore, expression of this gene is not detected in 
normal liver in Panel 1.4, suggesting that its expression is unique to liver cirrhosis. This 

1 5 gene encodes a putative GPCR; therefore, antibodies or small molecule therapeutics could 
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reduce or inhibit fibrosis that occurs in liver cirrhosis. In addition, antibodies to this 
putative GPCR could also be used for the diagnosis of liver cirrhosis. 



References: 

1. Mark M.D., Wittemann S., Herlitze S. (2000) G protein modulation of 
recombinant P/Q-type calcium channels by regulators of G protein signalling proteins. J. 
Physiol. 528 Pt 1:65-77. 

BK. CG59224-01 and CG59216-01: GPCR 

Expression of gene CG59224-01 and variant CG59216-01 was assessed using the 
primer-probe sets Ag3400 and Ag3405, described in Tables BKA and BKB. Results of the 
RTQ-PCR runs are shown in Table BKC. 



Table BKA . Probe Name Ag3400 



Primers 


Sequences 


Length 


Start 
Position 


SEQID 
NO: 


Forward 


5 ' -tctctgacctagggctgtctct-3 ' 


22 


225 


584 


Probe 


TET-5 ' -tcttccttacccatcactttgggact-3 ' - 
TAMRA 


26 


248 


585 


Reverse 


5 ' -catgaatttcatggacatcaaa-3 1 


22 


281 


586 



Table BKB . Probe Name Ag3405 



Primers 


Sequences 


Length 


Start 
Position 


SEQ ID 
NO: 


Forward 


5 ' -cacatctgtgctgtgcttatct-3 1 


22 


746 


587 


Probe 


TET-5 * -agtgctgccatgctccaccagttt-3 ' - 
TAMRA 


24 


785 


588 


Reverse 


5 * -acgtggatcataggagacacat-3 1 


22 


816 


589 



Table BKC . General_screening_panel_vl.4 



Tissue Name 


ReL Exp.(%) 
Ag3400, Run 
216822602 


ReL Exp.(%) 
Ag3405, Run 
216838741 


Tissue Name 


Rel. Exp.(%) 
Ag3400, Run 
216822602 


Rel. Exp.(%) 
Ag3405, Run 
216838741 


Adipose 


0.0 


0.0 


Renal ca. TK-10 


0.0 


0.0 


Melanoma* 
Hs688(A).T 


0.0 


0.0 


Bladder 


1.2 


0.0 


Melanoma* 


0.0 


0.0 


Gastric ca. (liver 


0.0 


0.0 
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Hs688(B).T 






met.) NCI-N87 






Melanoma* 
M14 


0.0 


0.0 


Gastric ca. 
KATO III 


0.0 


0.0 


Melanoma* 
LOXIMVI 


0.0 


0.0 


Colon ca. SW- 
948 


0.0 


0.0 


Melanoma* 
SK-MEL-5 


0.0 


0.0 


Colon ca. 
SW480 


0.0 


0.0 


Squamous 
cell 

carcinoma 
SCC-4 


0.0 


0.0 


Colon ca.* 
(SW480 met) 
SW620 


0.0 


0.0 


Testis Pool 


0.0 


0.0 


Colon ca. HT29 


0.0 


0.0 


Prostate ca.* 
(bone met) 
PC-3 


0.0 


0.0 


Colon ca. HCT- 
116 


0.0 


0.0 


Prostate Pool 


3.2 


1.8 


Colon ca. CaCo- 
2 


0.0 


0.0 


Placenta 


0.0 


0.0 


Colon cancer 
tissue 


0.0 


0.0 


Uterus Pool 


0.0 


0.0 


Colon ca. 
SW1116 


0.0 


0.0 


Ovarian ca. 
OVCAR-3 


0.0 


0.0 


Colon ca. Colo- 
205 


0.0 


0.0 


Ovarian ca. 
SK-OV-3 


1.0 


0.0 


Colon ca. SW-48 


0.0 


0.0 


Ovarian ca. 


0.0 


0.0 


Colon Pool 


0.0 


0.0 


Ovarian ca. 


0.0 


0.0 


Small Intestine 
Pool 


0.0 


0.0 


Ovarian ca. 
IGROV-1 


0.0 


0.0 


Stomach Pool 


0.0 


0.0 


Ovarian ca. 
OVCAR-8 


0.0 


0.0 


Bone Marrow 
Pool 


0.0 


0.0 


Ovary 


0.0 


0.0 


Fetal Heart 


0.0 


0.0 


Breast ca. 
MCF-7 


0.0 


0.0 


Heart Pool 


1.7 


1.3 


Breast ca. 

MDA-MB- 

231 


0.0 


0.0 


Lymph Node 
Pool 


0.0 


0.0 


Breast ca. BT 

549 


0.0 


0.0 


Fetal Skeletal 
Muscle 


0.0 


0.0 


Breast ca. 
T47D 


0.0 


0.0 


Skeletal Muscle 
Pool 


0.0 


0.0 


Breast ca. 
MDA-N 


0.0 


0.5 


Spleen Pool 


0.0 


0.0 


Breast Pool 


0.0 


0.5 


Thymus Pool 


o.o" 


0.5 



824 



Trachea 


0.0 


0.0 


CNS cancer 
(glio/astro) U87- 
MG 


0.0 


0.0 


Lung 


0.0 


0.0 


CNS cancer 
(glio/astro) U- 


0.0 


0.0 


Fetal Lung 


0 0 




CNS cancer 
(neuro;met) SK- 
N-AS 




0.0 


Lung ca.. 
N( I-N417 


0.0 


0.0 


CNS cancer 
(astro) SF-539 


0.0 


0.0 


Lung ca.. LX- 
1 


0.0 


0.0 


CNS cancer 
(astro) SNB-75 


0.0 


0.0 


Lung ca . 
NCI-H146 


2.5 


3.3 


CNS cancer 
(glio) SNB-19 


0.0 


0.0 


Lung C&. 
SHP-77 


100.0 


100.0 


CNS cancer 
(glio) SF-295 


0.0 


0.0 


Lung ca.. 
A549 


0.0 


0.0 


Brain 

(Amygdala) Pool 


0.0 


0.0 


Lung ca. 
INI i-H2Zo 


0.0 


0.0 


Brain 

(cerebellum) 


0.0 


0.0 


Lung ca. 


0.0 


0.0 


Brain (fetal) 


0.0 


0.0 


Lung ca. 
NCI-H460 


0.0 


0.0 


Brain 

(Hippocampus) 
Pool 


0.0 


0.0 


Lung ca. 
HOP-62 


0.0 


0.0 


Cerebral Cortex 
Pool 


0.0 


0.6 


Lung ca. 
NC1-H522 


0.0 


0.0 


Brain (Substantia 
nigra) Pool 


0.0 


0.0 


Liver 


0.0 


0.0 


(Thalamus) Pool 


0.0 


0.0 


Fetal Liver 


0.0 


0.0 


Brain (whole) 


0.0 


0.0 


Liver ca. 
HepG2 


0.0 


0.0 


Spinal Cord Pool 


0.0 


0.0 


Kidney Pool 


0.0 


0.4 


Adrenal Gland 


0.0 


0.0 


Fetal Kidney 


10.8 


2.6 


Pituitary gland 
Pool 


0.0 


0.0 


786-0 


0.0 


0.0 


Salivary Gland 


0.0 


0.0 


Renal ca. 
A498 


0.0 


0.0 


Thyroid (female) 


0.0 


0.0 


Renal ca. 
\CHN 


0.0 


0.0 


Pancreatic ca. 
CAPAN2 


0.0 


0.0 


Renal ca. 
UO-31 


0.0 


0.0 


Pancreas Pool 


0.0 


1.2 
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CNS_neurodegeneration_vl.O Summary: Ag3400/Ag3405 Expression of the CG59224- 
01 gene is low/undetectable (CT values > 35) across the samples in this panel. (Data not 
shown.) 

General_screening_panel_vl.4 Summary: Ag3400/Ag3405 Two experiments with two 
5 different probe and primer sets produce results that are in excellent agreement, with 
significant expression of the CG59224-01 gene exclusively in a lung cancer cell line 
sample (CTs = 30-33). Therefore, expression of this gene may be used to distinguish this 
sample from other samples on this panel and as a marker for lung cancer. Furthermore, 
therapeutic modulation of the activity of the GPCR encoded by this gene may be beneficial 
1 0 in the treatment of lung cancer. 

Panel 4D Summary: Ag3400/Ag3405 Expression of the CG59224-01 gene is 
low/undetectable (CT values > 35) across the samples in this panel. (Data not shown.) This 
gene encodes a G protein-coupled receptor (GPCR), a type of cell surface receptor 
involved in signal transduction. It is most similar to members of the odorant receptor 
1 5 subfamily of GPCRs. Based on analogy to other odorant receptor genes, we predict that 

expression of this gene may be highest in nasal epithelium, a sample not represented on this 
panel. 

BL. CG59214-01 and CG59214-01: GPCR 

Expression of gene CG59214-01 and CG59214-01 was assessed using the primer- 
20 probe sets Ag3398 and Ag3404, described in Tables BLA and BLB. Results of the RTQ- 
PCR runs are shown in Tables BLC and BLD. 

Table BLA . Probe Name Ag3398 



Primers 


Sequences 


Length 


Start 
Position 


SEQ 
IDNO: 


Forward 


5 ' -atacttgcatctcccacatctg-3 ' 


22 


724 


590 


Probe 


TET-5 ' -caccaatgattgggctatctatgatcca- 
3 ' -TAMRA 


28 


766 


591 


Reverse 


5 ' -tgaggaagcattctgtccatag-3 • 


22 


797 


592 


Table BLB. Probe Name Ae3404 


Primers 


Sequences 


Length 


Start 
Position 


SEQ ID 
NO: 
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Forward 


5 ' -atacttgcatctcccacatctg-3 • 


22 


724 


593 


Probe 


TET-5' -caccaatgattgggctatctatgatcca- 
3 ' -TAMRA 


28 


766 


594 


Reverse 


5 ' -tgaggaagcattctgtccatag-3 ' 


22 


797 


595 



Table BLC . General_screening_panel_vl.4 



Tissue Name 


Rel. Exp.(%) 
Ag3398, Run 
216822567 


Rel. Exp.(%) 
Ag3404, Run 
216838380 


Tissue Name 


Rel. Exp.(%) 
Ag3398, Run 
216822567 


Rel. Exp.(%) 
Ag3404, Run 
216838380 


Adipose 


0.0 


0 0 


Renal ca.TK-10 


0.0 


0.0 


Melanoma* 
Hs688(A).T 


0.0 


0.0 


Bladder 


0.0 


0.0 


Melanoma* 
Hs688(B).T 


0.0 


0.0 


Gastric ca. (liver 
met.) NCI-N87 


0.0 


0.0 


Melanoma* 
M14 


0 0 


0 0 


Gastric ca. 
KATO III 


0.0 


0.0 


Melanoma* 
LOXIMVI 


0 0 




Colon ca. SW- 
948 


0.0 


0.0 


Melanoma* 
SK-MEL-5 


0 0 
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Colon ca. 
SW480 




0.0 


Squamous 
cell 

carcinoma 
SCC-4 


0.0 


0.0 


Colon ca.* 
(SW480 met) 
SW620 


0.0 


0.0 


Testis Pool 


0.0 


0.0 


Colon ca. HT29 


0.0 


0.0 


Prostate ca.* 
(bone met) 
PC-3 


0.0 


0.0 


Colon ca. HCT- 
116 


0.0 


0.0 


Prostate Pool 


0.0 


0.0 


Colon ca. CaCo- 
2 


0.0 


0.0 


Placenta 


0.0 


0.0 


Colon cancer 
tissue 


0.0 


0.0 


Uterus Pool 


0.0 


0.0 


Colon ca. 
SW 1 116 


0.0 


0.0 


Ovarian ca. 
OVCAR-3 


0.0 


0.0 


Colon ca. Colo- 
205 


0.0 


0.0 


Ovarian ca. 
SK-OV-3 


0.0 


0.0 


Colon ca. SW-48 


0.0 


0.0 


Ovarian ca. 
OVCAR-4 


0.0 


0.0 


Colon Pool 


0.0 


0.0 


Ovarian ca. 
OVCAR-5 


0.0 


0.0 


Small Intestine 
Pool 


0.0 


8.9 


Ovarian ca. 
IGROV-1 


0.0 


0.0 


Stomach Pool 


0.0 


15.6 


Ovarian ca. 


0.0 


0.0 


Bone Marrow 


0.0 


0.0 
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OVCAR-8 




|Pool 








0.0 


7.5 IFetal Heart 


0.0 


0.0 


Breast ca. 


0.0 


0.0 JHeart Pool 


0.0 


0.0 


Breast ca. 
MJLJA-Md- 

231 


0.0 


0.0 


Lymph Node 
Pool 


0.0 


0.0 


Breast ca. BT 
549 


0.0 




Fetal Skeletal 
Muscle 


0.0 


0.0 


Breast ca. 
T47D 


0.0 


0.0 


Skeletal Muscle 
Pool 


0.0 


0.0 


Breast c a. 

MUA-N 


6.9 


0.0 


Spleen Pool 


0.0 


0.0 


Breast Pool 


0.0 


0.0 


Thymus Pool 


0.0 


0.0 


Trachea 


0.0 


0.0 


CNS cancer 
(glio/astro) U87- 
MG 


0.0 


0.0 


Lung 


0.0 


0.0 


CNS cancer 
(glio/astro) U- 
118-MG 


0.0 


12.5 


Fetal Lung 


0.0 


0.0 


CNS cancer 
(neuro;met) SK- 
N-AS 


0.0 


0.0 


Lung ca. 
NCI-N417 


0.0 


0.0 


CNS cancer 
(astro) SF-539 


0.0 


0.0 


Lung ca . LX- 
1 


0.0 


0.0 


CNS cancer 
(astro) SNB-75 


0.0 


0.0 


Lung ca . 
NCI-H146 


0.0 


0 0 


CNS cancer 
(glio)SNB-19 


0.0 


0.0 


Lung ca . 
SHP-77 


100.0 




CNS cancer 
(glio) SF-295 


0.0 


0.0 


Lung ca. 
A549 


0.0 


0 0 


Brain 

(Amygdala) Pool 


0.0 


0.0 


Lung ca. 
i-t-Dzo 


0.0 


0.0 


Brain 

(cerebellum) 


0.0 


0.0 


Lung ca. 
NV. i-HZi 


0.0 


0.0 


Brain (fetal) 


0.0 


0.0 


Lung ca. 
NCI-H460 


0 0 


0.0 


Brain 

(Hippocampus) 
Pool 


0.0 


0.0 


Lung ca. 
HOP-62 


0.0 


0.0 


Cerebral Cortex 
Pool 


0.0 


0.0 


Lung ca. 
N( I-H5 


0.0 


0.0 


Brain (Substantia 
nigra) Pool 


0.0 


0.0 


Liver 


0.0 


0.0 


Brain 

(Thalamus) Pool 


4.3 


0.0 
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Fetal Liver 


0 0 




Brain (whole) 






Liver ca. 
HepG2 


0.0 


0.0 


Spinal Cord Pool 


0.0 


0.0 


Kidney Pool 


0.0 


0.0 


Adrenal Gland 


0.0 


0.0 


Fetal Kidney 


11.1 


8.5 


Pituitary gland 
Pool 


0.0 


0.0 


Renal ca. 
786-0 


0.0 


0.0 


Salivary Gland 


0.0 


0.0 


Renal ca. 
A408 


0.0 


0.0 


Thyroid (female) 


0.0 


0.0 


Renal ca. 
ACHN 


0.0 


0.0 


Pancreatic ca. 
CAPAN2 


0.0 


0.0 


Renal ca. 
UO-31 


0.0 


0.0 


Pancreas Pool 


0.0 


0.0 



Table BLD . Panel 4D 



Tissue Name 


Rel. Exp.(%) 
Ag3404, Run 
165825947 


Tissue Name 


Rel. Exp.(%) 
Ag3404, Run 
165825947 


Secondaiy t h ; act 


0.0 


HUVEC IL-lbeta 


0 


Secondary Th2 act 


0.0 


HUVEC IFN gamma 


0.0 


Secondary Trl act 


0.0 


HUVEC TNF alpha + 
IFN ;amma 


0.0 


Secondary Thl rest 


0.0 


HUVEC TNF alpha + 
IL4 


0.0 


Seed clary Th2 rest 


0.0 


HUVEC IL-11 


0.0 


Secondary Trl rest 


0.0 


Lung Microvascular EC 
none 


0.0 


Primary Thl act 


0.0 


Lung Microvascular EC 
TNFalpha + IL-lbeta 


0.0 


Primary Th2 act 


0.0 


Microvascular Dermal 
EC none 


0.0 


Primary Trl act 


0.0 


Microsvasular Dermal 
EC TNFalpha + IL-lbeta 


0.0 


Primary Thl rest 


0.0 


Bronchial epithelium 
TNFalpha + IL1 beta 


0.0 


Primary Th2 rest 


0.0 


Small airway epithelium 
none 


0.0 


Primary Trl rest 


0.0 


Small airway epithelium 
TNFalpha + IL-lbeta 


0.0 


CD45RA CD4 
lymphocyte act 


0.0 


Coronery artery SMC rest 


0.0 


CD45RO CD4 
lymphocyte act 


0.0 


Coronery artery SMC 
TNFalpha + IL-lbeta 


0.0 
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CD8 lymphocyte act 


0.0 


Astrocytes rest 


0.0 


Secondary CD8 
lymphocyte rest 


0.0 


Astrocytes TNFalpha + 
IL-1 beta 


0.0 


Secondary CD8 
lymphocyte act 


0.0 


KU-812 (Basophil) rest 


0.0 


CD4 lymphocyte none 


0.0 


KU-812 (Basophil) 
PMA/ionomycin 


5.8 


2ry Thl/Th2/Trl_anti- 

PDOS PH1 1 


8.9 


CCD1106 

(Keratinocytes) none 


0.0 


LAK cells rest 


0.0 


CCD1 106 
(Keratinocytes) 
TNFalpha + IL-1 beta 


0.0 


LAK cells 1L-2 


0.0 


Livei cirrhosis 


100.0 


LAK cells IL-2+IL-12 


0.0 


Lupus kidney 


0.0 


LAK cells IL-2+IFN 
gamma 


0.0 


NCI-H292 none 


0.0 


LAK cells IL-2+ IL-18 


0.0 


NCI-H292 IL-4 


0.0 


LAK cells 
PMA/ionomycin 


0.0 [NCI-H292 IL-9 


0.0 


Nk Cells IL-2 rest 


0.0 JNCI-H292 IL-13 


0.0 


Two Way MLR 3 day 


0.0 NC I-H292 TFN gamma 


0.0 


Two Way MLR 5 day 


0.0 1HPAEC none 


0.0 


Two Way MLR 7 day 


0.0 


HPAEC TNF alpha + IL- 
1 beta 


0.0 


PRMC rest 


0.0 


Lung fibroblast none 


0.0 


PBMC PWM 


0.0 


Lung fibroblast TNF 
alpha + IL-1 beta 


0.0 




0.0 


Lung fibroblast IL-4 




Ramos (B cell) none 


0.0 


Lung fibroblast IL-9 


0.0 


Ramos (B cell) 
ionomycin 


0.0 


Lung fibroblast IL-13 


0.0 


B lymphocytes PWM 


0.0 


Lung fibroblast IFN 
gamma 


0.0 


B lymphocytes CD40L 
and IL-4 


„ n jDermal fibroblast 
jCCD 1070 rest 


0.0 


EOL-1 dbcAMP 


ft „ JDermal fibroblast 

C CD 1070 TNF alpha 


0.0 


EOL-1 dbcAMP 
PMA/ionomycin 


„ „ jDermal fibroblast 
(CCD 1070 IL-1 beta 


0.0 


Dendritic cells none 


^ q jDermal fibroblast I FN 

jgamma 


0.0 


Dendritic cells LPS 0.0 Dermal fibroblast II. -4 3.9 


Dendritic cells anti- 
CD40 


0.0 jlBD Colitis 2 j 7.2 


Monocytes rest 


O.o Mil J Crohn's 4.0 



830 



Monocytes LPS 


0.0 


Colon 


5.5 


Macrophages rest 


0.0 


Lung 


3.1 


Macrophages LPS 


0.0 


Thymus 


0.0 


HUVEC none 


0.0 


Kidney 


0.0 


HUVEC starved 


0.0 







CNS_neurodegeneration_vl.0 Summary: Ag3398/Ag3404 Expression of the CG59222- 
01 gene is low/undetectable (CT values > 35) across the samples in this panel. (Data not 
shown.) 



General_screening_panel_vl.4 Summary: Ag3398/Ag3404 Two experiments with two 
different probe and primer sets produce results that are in excellent agreement, with 
significant expression of the CG59222-01 gene exclusively in a lung cancer cell line 
sample (CT = 33.8). Therefore, expression of this gene may be used to this sample from 
other samples on this panel and as a marker for lung cancer. Furthermore, therapeutic 
modulation of the activity of the GPCR encoded by this gene may be beneficial in the 
treatment of lung cancer. 

Panel 4D Summary: Ag3404 Highest expression of the CG59222-01 gene is seen in the 
liver ciirhosis sample (CT=32.65). Thus, expression of this gene could be used to 
differentiate between this sample from the other samples on this panel and as a marker to 
detect tltie presence of liver cirrhosis. Furthermore, expression of this gene is not detected in 
normal liver in Panel 1 .4, suggesting that its expression is unique to liver cirrhosis. This 
gene encodes a putative GPCR; therefore, antibodies or small molecule therapeutics could 
reduce or inhibit fibrosis that occurs in liver cirrhosis. In addition, antibodies to this 
putative GPCR could also be used for the diagnosis of liver cirrhosis. Ag3398 Expression 
of CG59222-01 gene is low/undetectable (CTs > 35) across all of the samples on this panel 
(Data not shown). 

BM. CG59220-01: GPCR 

Expression of gene CG59220-01 was assessed using the primer-probe set Ag3402, 
described in Table BMA. Results of the RTQ-PCR runs are shown in Tables BMB, BMC 
and BMD. 

Table BMA . Probe Name Ag3402 
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Primers 


Sequences 


Length 


Start 
Position 


SEQID 

NO: 


Forward 


5 * -ctccacacacccatgtacttct-3 ' 


22 


160 


596 


Probe 


TET-5 ' -cttggatctctgccattcctctgtca-3 ' - 
TAMRA 


26 


201 


597 


Reverse- 


5 ' -aggaggttctccaacagcttag-3 ' 


22 


233 


598 



Table BMB . CNS_neurodegeneration_vl.O 



Tissiue Name 


Rel. Exp.(%) Ag3402, 
Run 210349784 


Tissue Name 


Run 210349784 


AD 1 Hippo 


9.0 


Control (Path) 3 
Temporal Ctx 


18.6 


AD 2 Hippo 


46.3 


Control (Path) 4 
Temporal Ctx 


56.6 


AD 3 H ippo 


8.8 


AD 1 Occipital Ctx 


21.6 


AD 4 Hippo 


24.1 


AD 2 Occipital Ctx 
(IVlissing) 


0.0 


AD 5 Hippo 


52.0 


AD 3 Occipital Ctx 


15.0 


AD o Hippo 


39.2 


AD 4 Occipital Ctx 


48.0 


Control 2 Hippo 


41.8 


AD 5 Occipital Ctx 


56.3 


Control -1 f [ippo 


18.2 


AD 6 Occipital Ctx 


31.4 


Control (Path) 3 
Hippo 


13.7 


Control 1 Occipital 
Ctx 


14.4 


AD 1 Temporal 
Ctx 


27.4 


Control 2 Occipital 
Ctx 


55.5 


AD 2 Temporal 
Ctx 


57.0 


Control 3 Occipital 
Ctx 


42.3 


AD 3 Temporal 
Ctx 


10.2 


Control 4 Occipital 
Ctx 


21.2 


AD 4 Temporal 
Ctx 


54.3 


Control (Path) 1 
Occipital Ctx 


100.0 


AD 5 Inf Temporal 
Ctx 


42.9 


Control (Path) 2 
Occipital Ctx 


27.7 


AD 5 S up 
Temporal Ctx 


28.7 


< out! ■! (Path) 3 
Occipital Ctx 


9.0 


AD 6 Inf Temporal 
Ctx 


41.8 


Control (Path) 4 
Occipital Ctx 


34.2 


AD 6 Sup 
Temporal Ctx 


51.4 


Control 1 Parietal 
Ctx 


23.5 


Control 1 
Tempor al Ctx 


18.2 


Control 2 Parietal 
Ctx 


28.5 


Control 2 
Temporal Ctx 


43.2 


Control 3 Parietal 
Ctx 


29.5 


Control 3 


32.8 


Control (Path) 1 


99.3 
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1 emporal Ctx 




Parietal Ctx 




Control 3 
Temporal Ctx 


20.0 


Control (Path) 2 
Parietal Ctx 


46.7 


Control (Path) 1 
Temporal Ctx 


87.1 


Control (Path) 3 
Parietal Ctx 


12.0 


Control (Path) 2 
Temporal Ctx 


60.3 


Control (Path) 4 
Parietal Ctx 


82.4 



Table BMC . General_screening_panel_vl.4 



Tissue Name 


Rel. Exp.(%) 
Ag3402,Run 
216823314 


Tissue Name 


Rel. Exp.(%) 
Ag3402, Run 
216823314 


Adipose 


14.7 


Renal ca. TK-10 


2.5 


Melanoma* 

T J,,/" OO/ A \ T 

1 IS(>f>6(A). 1 


10.1 


Bladder 


28.7 


Melanoma* 
riSooo(Jt> ). I 


1.4 


Gastric ca. (liver met.) 
NCI-N87 


17.9 


Melanoma* Ml 4 


0.9 


Gastric ca. KATO 111 


0.7 


Melanoma* 
LOXIMVI 


0.8 


Colon ca. SW-948 


0.0 


Melanoma* SK- 
MEL-5 


2.1 


Colon ca. SW480 


1.7 


Squamous cell 
carcinoma SCC-4 


2.9 


Colon ca * fSW~S80 
met) S W620 


1.8 


Testis Pool 


25.9 


Colon ca. HT29 


1.3 


Prostate ca.* (bone 
met) PC-3 


1.2 


Colonel HCT-116 


0.0 


Pro s ttt Po( 


16.0 


Colon ca. CaCo-2 


0.7 


Placenta 


1.3 


Colon cancer tissue 


7.4 


1 iterus \>ol 


15.6 


Colon ca.SWl 116 


0.0 


Ovarian ca. 
OVCAR-3 


3.1 


Colon ca. Colo-205 


0.0 


Ovarian ca. SK- 
OV-3 


6.0 


Colon ca. SW-48 


0.0 


Ovarian ca. 
OVCAR-4 


0.6 


Colon Pool 


49.7 


Ovarian ca. 
OVCAR-5 


2.9 


Small Intestine Pool 


53.6 


Ovarian ca. 
IGROV-1 


1.6 


Stomach Pool 


15.8 


Ovarian ca. 
OVCAR-8 


1.6 


Bone Marrow Pool 


14.5 


Ovary 


48.3 


Fetal Heart 


9.1 


Breasi ,a. MCF-7 


0.6 


Heart Pool 


39.5 
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Breast ca. MDA- 
MB-231 


0.0 


Lymph Node Pool 


39.2 


Breast ca. BT 549 


2.4 


Fetal Skeletal Muscle 


5.8 


Breast ca. T47D 




Skeletal Muscle Pool 


47 6 


Breast ca. MDA-N 


0.4 


Spleen Pool 


22.2 


Breast Pool 


31.6 


Thymus Pool 


12.9 


Trachea 


5.9 


CNS cancer 
(glio/astro) U87-MG 


0.8 


Lung 


16.6 


CNS cancer 
(glio/astro) U-118-MG 


3.4 


Fetal Lung 


30.8 


CNS cancer 
(neuro;met) SK-N-AS 


1.5 


Lung ca. NCI-N417 


0.0 


CNS cancer (astro) SF- 
539 


0.0 


Lung ca . LX-1 


9.7 


CNS cancer (astro) 
SNB-75 


1.1 


Lung ca. NCI-H146 


0.0 


CNS cancer (glio) 
SNB-19 


1.1 


Lung ca. SHP-77 


0.6 


CNS cancer (glio) SF- 
295 


4.1 


Lung ca. A 549 


0.6 


Brain (Amygdala) Pool 


28.1 


Lung ca. NCI-H526 


0.0 


Brain (cerebellum) 


35 1 


Lung ca. NCI-H23 


5.9 


Brain (fetal) 


12.3 


Lung ca . NCI-H460 


0.9 


Brain (Hippocampus) 
Pool 


39.8 


Lung ca. HOP-62 


1.6 


Cerebral Cortex Pool 


88.9 


T una "Nr'T-HS?'? 


7 1 


Brain (Substantia 
nigrj) Pool 


43.2 


Liver 


0.0 


Brain (Thalamus) Pool 


74.2 


Fetal Liver 
Liver ca. HepG2 


2.5 


Brain (whole) 


26.1 


0.7 


Spinal Cord Poo! 


100.0 


Kidney Pool 


51.8 


\d • if Gland 


24.1 


Fetal Kidney 


16.7 


Pituitary gland Pool 


8.6 


Renal ca. 786-0 


0.8 


Salivary Gland 


3.2 


Renal ca. A498 


0.0 


Thyroid (female) 


8.9 


Renal ca. ACHN 


1.4 


Pancreatic ca. 
CAPAN2 


0.8 


Renal ca. UO-31 


1.5 


Pancreas Pool 


24.8 



Table BMP . Panel 4D 





Rel. Exp.(%) 




Rel. Exp.(%) 


Tissue Name 


Ag3402, Run 


Tissue Name 


Ag3402, Run 




165825209 




165825209 
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Secondary Thl act 


0.0 


HUVEC IL-1 beta 


3.5 


Secondary Th2 act 


0.0 


HUVEC I FN gamma 


6.5 


Secondary Trl act 


3.3 


HUVEC TNF alpha + 
IFN gamma 


15.4 


Secondary Thl rest 


0.0 


HUVEC TNF alpha + 
IL4 


2.9 


Secondary Th2 rest 


5.2 


HUVEC IL-1 1 


9.0 


Secondary Trl rest 


0.0 


Lung Microvascular EC 
none 


2.7 


Primary Thl act 


5.4 


Lung Microvascular EC 
TNFalpha + IL-1 beta 


0.0 


Primary Th2 act 


0.0 


Microvascular Dermal 
EC none 


3.0 


Primary Trl act 


0.0 


Microsvasular Dermal 
EC TNFalpha + IL-1 beta 


10.7 


Primary Thl rest 


3.3 


Bronchial epithelium 
TNFalpha + IL1 beta 


18.0 


Primary Th2 rest 


5.1 


Small airway epithelium 


0.0 


Primary Trl rest 


0.0 


Small airway epithelium 

TMTTcilnh'* -4- TT 1 Wo 
lINraipila i 1L,- IDcla 


79.0 


CD45RA CD4 


2.7 


Coronery artery SMC rest 


0.0 


CD45RO CD4 
lymphocyte act 


0.0 


Coronery artery SMC 

TXTPalr-iha 4- TT _1 hipta 
1 LSr aipila i 1JU-1 DtUd 


2.5 


CD8 lymphocyte act 


0.0 


Astrocytes rest 


11.3 


Secondary CD8 
lymphocyte rest 


3.0 


Astrocytes TNFalpha + 
IL-1 beta 


75.3 


Secondary CD8 
lymphocyte act 


0.0 


KU-812 (Basophil) rest 


2.6 


CD4 lymphocyte none 


0.0 


- 

KU-812 (Basophil) 
PMA/ionomycin 


6.8 


2ryThl/Th2/Trl anti- 
CD95 CH11 


1.6 


CCD1106 

(Keratinocytes) none 


0.0 


LAK cells rest 


0.0 


CCD 1 106 
(Keratinocytes) 
TNFalpha + IL-1 beta 


27.9 


LAK cells 1L-2 


5 1 


Liver cirrhosis 


90.8 


LAK cells IL-2+IL-12 


3.0 


Lupus kidney 


51.8 


LAK cells IL-2+IFN 
gamma 


4.5 


NCI-H292 none 


28.1 


LAK cells IL-2+ IL-1 8 


23.2 


NCI-H292 IL-4 


10.6 


LAK cells 
PMA/ionomycin 


0.0 


NCI-H292 IL-9 


23.5 


NK Cel ls IL-2 rest 


6.2 


NCI-H292 IL-1 3 


4.8 
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Two Way MLR 3 day 


12.1 


NCI-H292 I FN gamma 


10.3 


Two Way MLR 5 day 


3.Y 


HPAEC none 


5.8 


Two Way MLR 7 day 


5.8 


HPAEC TNF alpha + IL- 
1 beta 


0.0 


PBMC rest 


0.0 


Lung fibroblast none 


3.7 


PBMC PWM 


0.0 


Lung fibroblast TNF 
alpha + IL-1 beta 


0.0 


PBMC PHA-L 


0.0 

_ _ 


Lung fibroblast IL— 4 


0 0 


Ramos (B cell) none 




Lung fibroblast IL-9 


6.4 


Ramos (B cell) 
ionomycin 


0.0 


Lung fibroblast IL-1 3 


3.2 


B lymphocytes PWM 


2.6 


Lung fibroblast IFN 
gamma 


10.5 


B lymphocytes CD40L 
and IL-4 


1.8 


Dermal fibroblast 
CCD 1070 rest 


0.0 


EOL-1 dbcAMP 


0.0 


Dermal fibroblast 


13.7 


PMA/icnomycin 


0.0 


Dermal fibroblast 
CCD 1070 IL-1 beta 


0.0 


Dendritic cells none 


6.5 


Dermal fibroblast IFN 
gamma 


0.0 


Dendritic cells LPS 


4.7 


Dermal fibroblast IL-4 


6.3 I 


Dendritic cells anti- 
CD40 


0.0 


IBD Colitis 2 


27.0 


Monocytes rest 


24.8 


IBD Crohn's 


9.3 


Monocj'tes LPS 


3.1 


Colon 


100.0 


Macrophages rest 


4.0 


Lung 


24.7 


Macrophages LPS 


0.0 


Thymus 


59.5 


HUVEC none 


6.5 


Kidney 


27.0 


HUVEC starved 


41.2 







CNSneurodegenerationvl.O Summary: Ag3402 The CG59220-01 gene is expressed 
at low levels throughout the CNS, including in amygdala, substantia nigra, thalamus, 
cerebellum, cerebral cortex, and spinal cord. The GPCR family of receptors contains a 
large number of neurotransmitter receptors, including the dopamine, serotonin, a and b- 
5 adrenergic, acetylcholine muscarinic, histamine, peptide, and metabotropic glutamate 

receptors. GPCRs are excellent drug targets in various neurologic and psychiatric diseases. 
All antipsychotics have been shown to act at the dopamine D2 receptor; similarly novel 
antipsychotics also act at the serotonergic receptor, and often the muscarinic and adrenergic 
receptors as well. While the majority of antidepressants can be classified as selective 
10 serotonin reuptake inhibitors, blockade of the 5-HT1A and a2 adrenergic receptors 
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increases the effects of these drugs. The GPCRs are also of use as drug targets in the 
treatment of stroke. Blockade of the glutamate receptors may decrease the neuronal death 
resulting from excitotoxicity; further more the purinergic receptors have also been 
implicated as drug targets in the treatment of cerebral ischemia. The b-adrenergic receptors 
5 have been implicated in the treatment of ADHD with Ritalin, while the a-adrenergic 
receptors have been implicated in memory. Therefore this gene may be of use as a small 
molecule target for the treatment of any of the described diseases. 

Generall_screening_panel_vl.4 Summary: Ag3402 The CG59220-01 gene represents a 
novel G-protein coupled receptor (GPCR) with highest expression in spinal cord sample 
10 (CT=31 .12) and moderate expression in other samples from brain. Please see Panel 

CNS_ne urodegeneration_vl .0 for discussion of utility of this gene in the central nervous 
system. 

Low levels of expression of the CG59220-01 gene are also observed in areas 
outside of the central nervous system such as the, adipose tissue, fetal and adult heart, 
15 skeletal muscle, adrenal gland, pituitary gland, and thyroid suggesting the possibility of a 
wider role in intercellular signaling. Therapeutic modulation of the expression or function 
of this gene may therefore be useful in the treatment of metabolic disorders, including 
obesity and diabetes. 

Panel 4D Summary: Ag3402 The CG59220-01 gene represents a novel G-protein 
20 coupled receptor (GPCR) with highest expression in colon (CT=33.12). Thus expression of 
this gene can be used to distinguish these samples from other samples used in this panel. In 
addition, expression of this gene is low/undetectable (CT values > 35) in samples derived 
from IBD colitis and IBS Crohn's. Therefore, expression of this gene can be used to 
distinguish normal colon sample from the IBD colitis and IBD Crohn's sample used in this 
25 panel. 

BN. CG 59218-01: GPCR 

Expression of gene CG5921 8-01 was assessed using the primer-probe set Ag3401, 
described in Table BNA. Results of the RTQ-PCR runs are shown in Tables BNB. 

Table BNA . Probe Name Ag3401 
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Primers 


Sequences 


Length 


Start 
Position 


SEQH) 
NO: 


Forward 


5 ' -gctggctactaggtttctcctt-3 ' 


22 


447 


599 


Probe 


TET-5 ' -atcatcatgcctgtcatcctgaccag-3 ' - 
TAMRA 


26 


470 


600 


Reverse 


5 ' -ttgatgtgggtatcacagaatg-3 ' 


22 


504 


601 



Table BNB . Panel 4D 



Tissue Name 


Rel. Exp.(%) 
Ag3401, Run 
165825154 


Tissue Name 


Rel. Exp.(%) 
Ag3401, Run 
165825154 


Secondary Thl act 


2.4 


HUVEC IL-lbeta 


0.0 


Secondary Th2 act 


3.4 


HUVEC IFN gamma 


0.0 


Secondary Trl act 


8.4 


HUVEC TNF alpha + 
IFN gamma 


0.0 


Secondary Thl rest 


1 (HUVEC TNF alpha + 
IIL4 


0.0 


Secondary Th2 rest 


0.0 jHUVEC IL-1 1 


0.0 


Secondary Trl rest 


0 q iLung Microvascular EC 

{none 


0.0 


Primary Thl act 


0.0 


Lung Microvascular EC 
TNFalpha + IL-lbeta 


0.0 


Primary Th2 act 


3.1 


Microvascular Dermal 
EC none 


0.0 


Primary Trl act 


0.0 


Microsvasular Dermal 
EC TNFalpha + IL-lbeta 


0.0 


Primary Thl rest 


8.1 


Bronchial epithelium 
TNFalpha + ILlbeta 


0.0 


Primary Th2 rest 


0.0 


Small airway epithelium 
none 


0.0 


Primary Trl rest 


3.3 


Small airway epithelium 
TNFalpha + IL-lbeta 


0.0 


CD45RA CD4 
lymphocyte act 


0.0 


Coronery artery SMC rest 


0.0 


CD45RO CD4 
lymphocyte act 


0.0 


Coronery artery SMC 
TNFalpha + IL-lbeta 


0.0 


CDS lymphocyte act 


8.1 


Astrocytes rest 


5.7 


Secondary CD8 
lymphocyte rest 


0.0 


Astrocytes TNFalpha + 
IL-lbeta 


0.0 


Secondary CDS 
lymphocyte act 


1.0 


KU-812 (Basophil) rest 


0.0 


CD4 lymphocyte none 


0.0 


KU-812 (Basophil) 
PMA/ionomycin 


0.0 


2ry Thl /Th2/Trl anti- 


5.7 


CCD 1106 


0.0 
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CD95 CHI 1 




(Keratinocytes) none 




LAK ce lls rest 


0.0 


CCD 1106 
(Keratinocytes) 
TNFalpha + IL-lbeta 


0.0 


LAK cells II. -2 


3.1 


Liver cirrhosis 


100.0 


LAK cells IL-2+IL-12 


3.2 


Lupus kidney 


0.0 


LAK cells IL-2+IFN 
gamma 


8.0 


NCI-H292 none 


0.0 


LAK cells IL-2+ IL-18 


4.3 


^JCI-H292 IL-4 


0.0 


LAK cells 
PMA/ionomycin 


2.1 


NCI-H292 IL-9 


0.0 


NK Cells IL-2 rest 


0 0 


NCI-H292 IL-13 


0.0 


Two Way MLR 3 day 


0.0 


NCI-H292 I FN gamma 


0.0 


Two Way MLR 5 day 


0.0 


HPAEC none 


0.0 


Two Way MLR 7 day 


0.0 


HPAEC TNF alpha + IL- 
1 beta 


0.0 


PBMC rest 


0.0 


Lung fibroblast none 


0.0 


PBMC PWM 


3.3 


I iinrr fiKrr\Klcjct TMP 

i^ung iiurouiasi iiNr 
alpha + IL-1 beta 


0.0 


1 L>1VII_. Ill f\ l.j 


0 0 


Lung fibroblast IL-4 




Ramos (B cell) none 


0.0 


Lung fibroblast IL-9 


0.0 


Ramos (B cell) 
ionomycin 


0.0 


Lung fibroblast IL-13 


0.0 


B lymphocytes PWM 


3.4 


Lung fibroblast IFN 
gamma 


0.0 


a lympnocytes CD40L 
and IL-4 


0.0 


Dermal fibroblast 
CCD 1070 rest 


2.5 


EOL-1 dbcAMP 


0.0 


Dermal fibroblast 
CCD 1070 TNF alpha 


14.8 


EOL-1 dbcAMP 
PMA/ionomycin 


0.0 


Dermal fibroblast 
CCD1 070 IL-1 beta 


0.0 


Dendritic cells none 


6.0 


Dermal fibroblast IFN 
gamma 


0.0 


Dendritic cells LPS 


0.0 


Dermal fibroblast IL-4 


0.0 


Dendritic cells anti- 
CD40 


0.0 


IBD Colitis 2 


17.1 


Monocytes rest 


0.0 


IBD Crohn's 


0.0 


Monocytes LPS 


6.1 


Colon 


0.0 


Macrophages rest 


0.0 


Lung 


0.0 


> I;" i ophages LPS 


0.0 


Thymus 


0.0 


Hi VIC none 


0.0 


Kidney 


3.2 


HUVEC starved 


0.0 
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CNS_neurodegeneration_vl.O Summary: Ag3401 Expression of the CG59218-01 gene 
is low/umdetectable (CTs > 35) across all of the samples on this panel (data not shown). 

General_screening_panel_vl.4 Summary: Ag3401 Expression of the CG59218-01 gene 
is low/undetectable (CTs > 35) across all of the samples on this panel (data not shown). 
5 This gene product is most similar to members of the odorant receptor subfamily of GPCRs. 
Based on analogy to other odorant receptor genes, we predict that expression of this gene 
may be highest in nasal epithelium, a sample not represented on this panel. 

Panel 4D Summary: Ag3401 Highest expression of the CG592 18-01 gene is seen in the 
liver cirrhosis sample(CT=33.03). Thus, expression of this gene could be used to 
differentiate between this sample from the other samples on this panel and as a marker to 
detect the presence of liver cirrhosis. Furthermore, expression of this gene is not detected in 
normal liver in Panel 1 .4, suggesting that its expression is unique to liver cirrhosis. This 
gene encodes a putative GPCR; therefore, antibodies or small molecule therapeutics could 
reduce or inhibit fibrosis that occurs in liver cirrhosis. In addition, antibodies to this 
putative GPCR could also be used for the diagnosis of liver cirrhosis. 

BO. CG59211-01: GPCR 

Expression of gene CG5921 1-01 was assessed using the primer-probe set Ag3397, 
described in Table BOA. Results of the RTQ-PCR runs are shown in Table BOB. 



Table B OA . Probe Name Ag3397 



Primers 


Sequences 


Length 


Start 
Position 


SEQID 

NO: 


Forward 


5 ' - tcacaggcatcctagacttgac- 3 ' 


22 


645 


602 


Probe 


TET-5' -tcatgtcctacatgttgatactgaaagca-3 ' - 
TAMRA 


29 


675 


603 


Reverse 


5 ' -tttcttgatgctatgctcaaca-3 1 


22 


705 


604 



20 Table BOB . General screen ing_panel_v 1 .4 



Tissue Name 


Rel. Exp.(%) 
Ag3397, Run 
216822307 


Tissue Name 


Rel. Exp.(%) 
Ag3397, Run 
216822307 


Adipose 


0.0 


Renal ca.TK-10 


0.0 


Melanoma* 
Hs688(A). 1' 


0.0 


Bladder 


0.0 
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Melanoma* 
Hs688(B).T 


0.0 


Gastric ca. (liver met.) 
NCI-N87 


0.0 


Melanoma* Ml 4 


0.0 


Gastric ca. KATO III 


0.0 


Melanoma* 
LOXIMVI 


0.0 

_ __ 


Colon ca. SW-948 


0.0 


Melanoma* SK- 
MEL-5 


0.0 


Colon ca. SW480 


0.0 


Squamous cell 
carcinoma SCC-4 


0.0 


Colon ca.* (SW480 
met) SW620 


0.7 


Testis Pool 


0.0 


Colon ca. HT29 


On 


Prostate ca.* (bone 
met) PC-3 


0.0 


Colon ca. HCT-116 


0.0 


Prostate Pool 


0.0 


Colon ca. CaCo-2 


0.0 


Placenta 


0.0 


Colon cancer tissue 


0.0 


Uterus Pool 


0.0 


Colon ca. SWT 116 


0.0 


Ovarian ca. 
OVCAR-3 


0.0 


Colon ca. Colo-205 


0.0 


Ovarian ca. SK- 
OV-3 


0.0 


Colon ca. SW-48 


0.0 


Ovarian ca. 
OVCAR-4 


0.0 


Colon Pool 


0.6 


Ovarian ca. 
OVCAR-5 


0.0 


Small Intestine Pool 


0.0 


Ovarian ca. 
IGROV-1 


0.0 


Stomach Pool 


0.0 


Ovarian ca. 
OVCAR-8 


Bone Marrow Pool 


0.0 


Ovary 0.0 


Fetal Heart 


0.5 


Breast ca. MCF-7 


0.0 


Heart Pool 


0.5 


Breast ca. MDA- 
MB-23! 


0.0 


Lymph Node Pool 


0.0 


Breast ca. BT 549 


0.0 


Fetal Skeletal Muscle 


0.0 


Breast ca. T47D 


0.0 


Skeletal Muscle Pool 


0.0 


Breast ca. MDA-N 


0.0 


Spleen Pool 


0.0 


Breast Pool 


0.6 


Thymus Pool 


0.0 


Trachea 


0.0 


CNS cancer 
(glio/astro) U87-MG 


0.0 


Lung 


0.0 


CNS Ctinccr 
(glio/astro) U-118-MG 


0.0 


Fetal Lung 


0.0 


CNS cancer 
(neuro;met) SK-N-AS 


0.8 


Lung ca. NCI-N417 


0.0 


CNS cancer (astro) SF- 
539 


0.0 


Lungca. LX-1 


0.0 


CNS cancer (astro) 
SNB-75 


0.0 
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Lung ca. NCI-H146 


2.6 


CNS cancer (glio) 
SNB-19 


0.0 


Lung ca. SHP-77 


100.0 


CNS cancer (glio) SF- 
295 


0.0 


Lung ca. A549 


0.0 


Brain (Amygdala) Pool 


0.0 


Lung ca. NCI-H526 


0 0 


Brain (cerebellum) 


0.0 


Lung ca.NCI-H23 


0.0 


Brain (fetal) 


(Ml 


Lung ca. NCI-H460 


0.0 


Brain (Hippocampus) 
Pool 


0.0 


Lung ca. HOP-62 


0.0 


Cerebral Cortex Pool 


0.0 


Lung Lei.. iN^i-jn jzz 


' 


Brain (Substantia 
nigra) Pool 


0.0 


Liver 


0.0 


Brain (Thalamus) Pool 


0.0 


Fetal Liver 


0.0 


Brain (whole) 


()') 


Liver ca. HepG2 


0.0 


Spinal Cord Pool 


0.0 


kidhev Po( 1 


0.9 


;dre ml Gland 


0.0 


Fetal Kidney 


3.3 


Pituitary gland Po,^ 


0.0 


Renal ca. 786-0 


0.0 


Salivary Gland 


0.0 


Renal ca. A498 


0.0 


rhyroid (female) 


0.0 


Renal ca. ACHN 


0.0 


Pancreatic ca. 

CAPAN2 


0.0 


Renal ca. UO-31 


0.0 


Pancreas Pool 


0.0 



CNS_neurodegeneration_vl.O Summary: Ag3397 Expression of the CG5921 1-01 gene 
is low/undetectable (CT values > 35) across the samples in this panel. (Data not shown.) 
This gene encodes a G protein-coupled receptor (GPCR), a type of cell surface receptor 
involved in signal transduction. It is most similar to members of the odorant receptor 
5 subfamily of GPCRs. Based on analogy to other odorant receptor genes, we predict that 
expression of this gene may be highest in nasal epithelium, a sample not represented on this 
panel. 

General_screening_panel_vl.4 Summary: Ag3397 Significant expression of the 
CG59211-01 gene is seen exclusively in one of the lung cancer sample (CT = 32.29). 

1 0 Therefore, expression of this gene may be used to distinguish this sample from other 
samples on this panel and as a marker for lung cancer. There is an increasing awareness 
that some GPCRs can regulate proliferative signaling pathways and that chronic 
stimulation or mutational activation of receptors can lead to oncogenic transformation. 
Activating mutations in GPCRs are associated with several types of human tumors and 

15 some receptors exhibit potent oncogenic activity due to agonist overexpression (Whitehead 
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et al., 2001). Therefore, therapeutic modulation of the activity of the GPCR encoded by this 
gene may be beneficial in the treatment of lung cancer. 

References: 

1 . Whitehead IP, Zohn IE, Der CJ. (2001) Rho GTPase-dependent transformation 
by G protein-coupled receptors. Oncogene 2001 Mar 26;20(1 3): 1547-55 

Panel 4D Summary: Ag3397 Expression of the CG59211-01 gene is low/undetectable 
(CT values > 35) across the samples in this panel. (Data not shown.) 

BP. CG59276-01: Dihydroorotate dehydrogenase 

Expression of gene CG59276-01 was assessed using the primer-probe set Ag3524, 
described in Table BPA. Results of the RTQ-PCR runs are shown in Tables BPB, BPC, 
BPD, BPE and BPF. 



Table EPA . Probe Name Ag3524 



Primers 


Sequences 


Length 


Start 
Position 


SEQID 
NO: 


Forward 


5 ' -ttcaggcactgttctttgactt-3 ' 


22 


1439 


605 


Probe 


TET- 5 ' -aacagattttgcaacactttccaagg-3 1 - 
TAMRA 


26 


1472 


606 


Reverse 


5 1 -tgagggagtggtaacactgtgt-3 ' 


22 


1498 


607 



Table BPB . CNS neurodegeneration vl.O 



Tissue Name 


Rel. Exp.(%) Ag3524, 
Run 206915926 


Tissue Name 


Rel. Exp.(%) Ag3524, 
Run 206915926 


AD 1 Hippo 


26.8 


Control (Path) 3 
Temporal Ctx 


18.9 


AD 2 Hippo 


30.8 


Control (Path) 4 
Temporal Ctx 


51.4 


AD 3 Hippo 


25.3 


AD 1 Occipital Ctx 


37.1 


AD 4 Hippo 


38.2 


AD 2 Occipital Ctx 
(Missing) 


0.0 


AD 5 Hippo 


77.4 


AD 3 Occipital Ctx 


16.5 


AD 6 Hippo 


83.5 


AD 4 Occipital Ctx 


40.9 


Control 2 Hippo 


43.8 


AD 5 Occipital Ctx 


33.9 


Control 4 Hippo 


20.6 


AD 6 Occipital Ctx 


17.0 


( onli -1 (Path) 3 


17.6 


Control 1 Occipital 


6.3 
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Hippo 




Ctx 




AD 1 Temporal 
Ctx 


37.1 


Control 2 Occipital 
Ctx 


74.7 


aij z i emporai 
Ctx 


31.2 


Control 3 Occipital 
Ctx 


22.1 


AD 3 Temporal 
Ctx 


6.5 


Control 4 Occipital 
Ctx 


25.7 


AD 4 Temporal 
Ctx 


79.6 


Control (Path) 1 
Occipital Ctx 


89.5 


AD 5 Inf Temporal 
Ctx 


97.9 


Control (Path) 2 
Occipital Ctx 


18.4 


AD 5 Sup 
Temporal Ctx 


56.3 


Control (Path) 3 
Occipital Ctx 


11.2 


AD 6 Inf Temporal 
Ctx 


100.0 


Control (Path) 4 
Occipital Ctx 


24.8 


AD 6 Sup 
Temporal Ctx 


66.9 


Control 1 Parietal 
Ctx 


26.8 


Control 1 
Temporal Ctx 


10.6 


Control 2 Parietal 
Ctx 


68.3 


Control 2 
Temporal Ctx 


13.4 


Control 3 Parietal 
Ctx 


28.1 


Control 3 
Temporal Ctx 


25.9 


Control (Path) 1 
Parietal Ctx 


58.6 


Control 3 
Temporal Ctx 


36.6 


Control (Path) 2 
Parietal Ctx 


51.4 


Control (Path) 1 
Temporal Ctx 


80.7 


Control (Path) 3 
Parietal Ctx 


6.9 


Control (Path) 2 
Temporal Ctx 


76.8 


Control (Path) 4 
Parietal Ctx 


54.7 



Table BPC . Generalscreeningjpanel_vl.4 



Tissue Name 


Rel. Exp.(%) 
Ag3524, Run 
213390931 


Tissue Name 


Rel. Exp.(%) 
Ag3524, Run 
213390931 


Adipose 


6.2 


Renal ca. TK-10 


9.3 


Melanoma* 
Hs688(A).T 


3.1 


Bladder 


22.1 


Melanoma* 
Hs688(B).T 


9.2 


Gastric ca. (liver met.) 
NCI-N87 


24.8 


Melanoma* Ml 4 


0.9 


Gastric ca. KATO III 


0.0 


Melanoma* 
LOXIMVI 


0.4 


Colon ca. SW-948 


1.2 


Melanoma* SK- 
MFI -5 


0.7 


Colon ca. SW480 


4.2 
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Squamous cell 
carcinoma SCC-4 


0.0 


Colon ca.* (SW480 
met) SW620 


10.5 


Testis Pool 


11.3 


Colon ca. 11T29 


1.1 


Prostate ca.* (bone 
met) PC-3 


3.1 


Colon ca. HCT-116 


5.3 


Prostate Pool 


8.7 


Colon ca. CaCo-2 


8 5 


Placenta 


1.3 


Colon cancer tissue 


7.2 


Uterus Pool 


2.9 


Colon ca.SWl 116 


1.4 


Ovarian ca. 
OVCAR-3 


6.9 


Colon ca. Colo-205 


1.6 


Ovarian ca. SK- 
0\-i 


11.7 


Colon ca. SW-48 


1.7 


Ovarian ca. 
OVCAR-4 


0.2 


Colon Pool 


12.6 


Ovsricint cb. 
OVCAR-5 


8.5 


Small Intestine Pool 


20.9 


IGROV-1 


2.5 


Stomach Pool 


13.2 


Ovarian ca. 
OVCAR-8 


4.1 


Bone Marrow Pool 


8.7 


Ovar> 


15.3 


Fetal Heart 


9.5 


Breast ca. MCF-7 


3.4 


Heart Pool 


11.3 


Breast ca. MDA- 
MB-23 1 


3.1 


Lymph Node Pool 


27.2 


Breast ca. BT 549 


23.5 


Fetal Skeletal Muscle 


9.1 


Breast ca. T47D 


19.5 


Skeletal Muscle Pool 


6.7 


Breast ca. MDA-N 


0.3 




8.9 


Breast Pool 


25.2 


Thymus Pool 


18.2 


Trachea 


12.6 


CNS cancer 
(glio/astro) U87-MG 


8.7 


Lung 


13.3 


CNS cancer 
(glio/astro) U-118-MG 


7.5 


Fetal Lung 


41.8 


CNS cancer 
(neuro;met) SK-N-AS 


21.3 


Lung ca. NCI-N417 


0.0 


CIMb cancer (astro) br- 
539 


2.4 


Lung ca. LX-1 


■ 

32.1 


CNS cancer (astro) 
SNB-75 


97.9 


Lung ca. NCI-H146 


1.4 


CNS cancer (glio) 
SNB-19 


3.0 


Lung ca. SHP-77 


2.3 


CNS cancer (glio) SF- 
295 


26.1 


Lung ca. A549 


5.4 


Brain (Amygdala) Pool 


4.1 


Lung ca. NCI-H526 


0.4 


Brain (cerebellum) 


33.0 
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Lung ca. NCI-H23 


100.0 


Brain (fetal) 


25.5 


Lung ca. NCI-H460 


7.9 


Brain (Hippocampus) 
Pool 


7.5 


Lung ca. HOP-62 


4.2 


Cerebral Cortex Pool 


7.7 


Lung ca. NCI-H522 


1.4 


Brain (Substantia 
nigra) Pool 


5.8 


Liver 


1 1 


Brain (Thalamus) Pool 


11.7 


Fetal Liver 


9.* 


Brain (whole) 


9.5 


Liver ca. HepG2 


5.2 


Spinal Cord Pool 


10.5 


Kidney Pool 


40 3 


Adrenal Gland 


9.1 


Fetal Kidney 


40 1 


Pituitary gland Pool 


2.0 


Renal ca. 786-0 


12.9 


Salivary Gland 


8 3 


Renal ca. A498 


3.4 


Thyroid (female) 


2.5 


Renal ca. ACHN 


5.4 


Pancreatic ca. 
CAPAN2 


9.3 


Renal ca. UO-31 


7.2 


Pancreas Pool 


20.9 



Table B PD. Panel 2D 



Tissue Name 


Rel. Exp.(%) 
Ag3524, Run 
169590472 


Tissue Name 


Rel. Exp.(%) 
Ag3524, Run 
169590472 


Normal Colon 


28.1 


Kidney Margin 
8120608 


1.8 


CC We 1 to Mod Diff 
(OD03866) 


2.4 


Kidney Cancer 
8120613 


2.7 


CC Margin (OD03866) 


1.4 


Kidney Margin 
8120614 


8.0 


CC Gr.2 rectosigmoid 
(OD03868) 


3.5 


Kidney Cancer 
9010320 


6.0 


CC Margin (OD03868) 


1.1 


Kidney Margin 
9010321 


11.7 


CC Mod Diff 
(ODO3920) 


9.9 


Normal Uterus 


4.8 


CC Margin (ODO3920) 


6.0 


Uterus Cancer 
064011 


2.3 


CC Gr.2 ascend colon 
(OD03921) 


12.3 


Normal Thyroid 


6.6 


CC Margin (OD03921) 


3.1 


Thyroid Cancer 
064010 


0.5 


CC from Partial 
Hepatectomy 
(ODO4309) Mets 


12.8 


Thyroid Cancer 
A302152 


2.1 


Liver Margin 
(ODO4309) 


34.6 


Thyroid Margin 
A3021 53 


9.2 
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Colon rnets to lung 
(OD04451-01) 


12.2 


Normal Breast 


13.6 


Lung Margin (OD04451- 
02) 


2.9 


Breast Cancer 
(OD04566) 


6.4 


Normal Prostate 6546-1 


9.9 


Breast Cancer 
(OD04590-01) 


9.8 


Prostate; Cancer 
(ODU4410) 


16.6 


Breast Cancer Mets 
(OD045 90-03) 


14.0 


Prostate Margin 
(OD04410) 


15.4 


Breast Cancer 

Metastasis 

(OD04655-05) 


5.3 


Prostate Cancer 
(OD04720-01) 


20.6 


Breast Cancer 
064006 


11.2 


Prostate; Margin 
(OD04720-02) 


16.6 


Breast Cancer 1024 


31.6 


Normal Lung 061010 


27.2 


Breast Cancer 
y lUUZoo 


8.0 


Lun'o Met to Muscle 
(OD04286) 


3.5 


Breast Margin 
9100265 


4.6 


Muscle Margin 
(OD04286) 


6.7 


Breast Cancer 
A209073 


4.1 


Lung Malignant Cancer 
(OD03126) 


3.7 


Breast Margin 
A209073 


8.4 


Lung Margin (OD03126) 


10.8 


Normal Liver 


100.0 


Lung Cancer (OD04404) 


4.1 


Liver Cancer 064003 


44.4 


Lung Margin (OD04404) 


4.3 


Liver Cancer 1025 


47.0 


Lung Cancer (OD04565) 


0 9 


L i ver Cancer 1 026 




Lung Margin (OD04565) 


X.5 


Liver Cancer 6004-T 


59.5 


Lung Cancer (OD04237- 
01) 


13.2 


Liver Tissue 6004-N 


6.6 


Lung Margin (OD04237- 
02) 


4.5 


Liver Cancer 6005-T 


1.5 


Ocular Mel Met to Liver 
(ODO4310) 


6.7 


Liver Tissue 6005-N 


0.5 


Liver Margin 
(ODO4310) 


6.7 


Normal Bladder 


21.5 


Melanoma Mets to Lung 

((~xr\(\A'j') i \ 
\kj\J\)h5Z 1 ) 


3.9 


Bladder Cancer 1023 


0.9 


Lung Margin (OD04321) 


4.7 


Bladder Cancer 
A302173 


5.9 


Normal Kidney 


35.8 


Bladder Cancer 
(OD04718-01) 


5.5 


Kidney Ca, Nuclear 
grade 2 (OD04338) 


18.0 


Bladder Normal 
Adjacent (OD04718- 
03) 


6.5 
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Kidney A*l3.rgin 
(OD04338) 


14.1 


Normal Ovary 


4.5 


Kidney Co. Nuclesr 
grade 1/2 (OD04339) 


13.7 


Ovarian Cancer 
064008 


3.7 


Kidney Margin 
(OD04339) 


9.9 


Ovarian Cancer 
(OD04768-07) 


11.0 


Kidney Ca, Clear cell 
type (OD04340) 


10.8 


Ovary Margin 
(OD04768-08) 


1.1 


Kidney Margin 
(OD04340) 


12.7 


Normal Stomach 


7.6 


Kidney Ca, Nuclear 
grade 3 (OD04348) 


0.5 


Gastric Cancer 
9060358 


3.3 


Kidney Margin 
(OD04348) 


10.6 


Stomach Margin 
9060359 


4.0 


Kidney Cancer 
(OD04622-0I) 


0.9 


Gastric Cancer 
9060395 


8.2 


Kidney Ivlsrgin 
(OD04622-03) 


1.6 


Stomach Margin 
9060394 


5.5 


Kidney Cancer 
(OD04450-01) 


4.2 


Gastric Cancer 
9060397 


3.8 


Kidney Margin 
(OD04450-03) 


5.4 


Stomach Margin 
9060396 


0.9 


Kidney Cancer 8120607 


1.0 


Gastric Cancer 
064005 


14.5 



Table B PE. Panel 4D 



Tissue Name 


Rel. Exp.(%) 
Ag3524, Run 
166445583 


Tissue Name 


Rel. Exp.(%) 
Ag3524, Run 
166445583 


Secondary Thl act 


13.9 


HUVEC IL-lbcta 


2.4 


Secondary Th2 act 


17.1 


HUVEC IFN gamma 


16.4 


Secondary Trl act 


15.4 


HUVEC TNF alpha + 
IFN gamma 


4.8 


Secondary Thl rest 


36.6 


HUVEC TNF alpha + 
1L4 


3.8 


Secondary Th2 rest 


19 I 


HUVEC IL-11 


6.1 


Secondary Trl rest 


28.7 


Lung Microvascular EC 
none 


9.2 


Primary Thl act 


10.2 


Lung Microvascular EC 
TNFalpha + IL-lbeta 


5.0 


Primary Th2 act 


18.2 


Microvascular Dermal 
EC none 


14.5 


Primary Trl act 


28.5 


Microsvasular Dermal 
EC TNFalpha + IL-1 beta 


10.4 
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Primary Thl rest 


100.0 


Bronchial epithelium 
TNFalpha + ILlbeta 


6.1 


Primary Th2 rest 


42.3 


Small airway epithelium 


3.3 


Primary Trl rest 


35.8 


Small airway epithelium 
TNFalpha + IL-lbeta 


29.3 


CD45RA CD4 
lymphocyte act 


14.6 


Coronery artery SMC rest 


5.6 


CD45RO CD4 
lymphocyte act 


28.7 


Coronery artery SMC 
TNFalpha + IL-lbeta 


3.4 


CD8 lymphocyte act 


17.7 


Astrocytes rest 


12.1 


Secondary CD8 
lymphocyte rest 


23.7 


Astrocytes TNFalpha + 
IL-lbetn 


11.5 


Secondary CD8 
lymphocyte act 


12.7 


KU-812 (Basophil) rest 


23.2 


CD4 lymphocyte none 


22.2 


KU-812 (Basophil) 
PIVI A/ i on om y c i n 


37.1 


2ry Thl/Th2/Trl_anti- 
CD95 CHI 1 


39.5 


CCD 1106 

(Keratinocytes) none 


5.0 


LAK cells rest 


17.9 


(Keratinocytes) 
TNFalpha + IL-lbeta 


41.5 


LAK cells IL-2 


41.2 


Liver cirrhosis 


37.9 


LAK cells IL-2+IL-12 


31.0 


Lupus kidney 


21.8 


LAK cells IL-2+IFN 
gamma 


47.0 


NCI-H292 none 


25.0 


LAK cells IL-2+ IL-18 


44.8 


NCI-H292 II -4 


21.3 


LAK cells 
PMA/ionomycin 


5.3 


NCI-H292 IL-9 


21.6 


NK Cells IL-2 rest 


17.8 


NCI-H292 IL-13 


<>2 


Two Way MLR 3 day 


47.6 


NCI-H292 IFN gamma 


10.6 


Two Way MLR 5 day 


13.0 


HPAEC none 


16.5 


Two Way MLR 7 day 


18.8 


HPAEC TNF alpha + IL- 
1 beta 


4.0 


PBMC rest 


7.6 


Lung fibroblast none 


30.4 


PBMC PWM 


24.7 


Lung fibroblast TNF 
alpha + IL-1 beta 


24.3 


PBMC PHA-L 


4.9 


Lung fibroblast IL-4 


23.8 


Ramos (B cell) none 


19.5 


Lung fibroblast IL-9 


146 


Ramos (B cell) 
ionomycin 


10.7 


Lung fibroblast IL-13 


17.1 


B lymphocytes PWM 


22.8 


Lung fibroblast IFN 
gamma 


24.1 


B lymphocytes CD40L 
and IL-4 


40.1 


Dermal fibroblast 
CCD 1070 rest 


20.3 
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EOL-1 dbcAMP 


15.9 


Dermal fibroblast 
CCD1070TNF alpha 


22.4 


EOL-1 dbcAMP 
PMA/ionomycin 


11.7 


Dermal fibroblast 
CCD 1070 IL-1 beta 


0.6 


Dendritic cells none 


22 2 


Dermal fibroblast IFN 
gamma 


10.6 


Dendritic cells LPS 


14.5 


Dermal fibroblast IL-4 


11.8 


Dendritic cells anti- 
CD40 


23.5 


IBD Colitis 2 


7.3 


Monocjrtes rest 


37.9 


IBD Crohn's 


8.0 


Monocles LPS 


15.8 


Colon 


54.0 


Macrophages rest 


10.2 


Lung 


9.7 


Macrophages LPS 


5.0 


Thymus 


39.5 


HUVEC none 


17.3 


Kidney 


63.7 


HUVEC starved 


11.8 







Table B PF. Panel 5 Islet 



Tissue Name 


Rel. Exp.(%) 
Ag3524, Run 
242386392 


Tissue Name 


Rel. Exp.(%) 
Ag3524, Run 
242386392 


97457_Patient- 
02go_adipose 


66.0 


94709_Donor 2 AM - A adipose 


9.0 


97476_Patient- 

07 k_sl eleta] iniikle 


10.7 


94710_Donor 2 AM - B_adipose 


9.8 


97477_Patient- 
i'7i i uterus 


9.4 


9471 l_Donor 2 AM - C_adipose 


8.1 


97478_Patient- 
07pl_placenta 


7.8 


94712_Donor 2 AD - A_adipose 


17.8 


99167 Bayer Patient 
1 


10.2 


94713_Donor 2 AD - B_adipose 


27.0 


97482_Patient- 
OSsj^utsrus 


0.0 


94714_Donor 2 AD - C adipose 


43.5 


97483_Patient- 
08pljplacenta 


0.0 


94742_Donor3U- 
A_Mesenchymal Stein Cells 


0.0 


"74oo Patient- 
09sk_4 eL'u! muscle 


0.0 


94743_Donor 3 U - 
B_Mesench\ iiku Stern Cells 


0.0 


97487_Patient- 
09ut_uterus 


0.0 


94730_Donor 3 AM - A_adipose 


16.8 


97488 Patient- 
09pl__p! icenta 


17.0 


94731_Donor 3 AM - B_adipose 


7.8 


97492JPatient- 
1 Out uterus 


28.1 


94732_Donor 3 AM - C_adipose 


18.0 


97493JPatient- 
1 Opl jplacenta 


25.3 


94733_Donor 3 AD - A adipose 


0.0 
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97495 Patient- 
1 lgoadipose 


61.6 


94734_Donor 3 AD - B_adipose 


0.0 




Q74Qf> Patipnt- 
y /*t"u jr aLiciii" 

1 1 sk skeletal muscle 


28.5 


94735_Donor 3 AD - C_adipose 


18.9 




llututerus 


16.2 


77 1 3 8_Li ver_HepG2untreated 


15.8 




97498 Patipnt- 

1 lpl_placenta 


5.0 


73556_Heart_Cardiac stromal 
cells (primary) 


9.7 




y/ji/u 1 aiieni- 
12go adipose 


100.0 


81735_Small Intestine 


62.9 




97501_Patient- 

1 2sR sk.clcts.1 muscle 


42.6 


72409_Kidney_Proximal 
Convoluted Tubule 


24.0 




97502_Patient- 
12ut uterus 


41.8 


82685_Small 
intestine Duodenum 


32.8 




97503_Patient- 
12pl_placenta 


0.0 


9065 OAdrenalAdrenocortical 
adenoma 


0.0 




94721_Donor2U- 










A_Mesenchymal 
Stem Cells 


0.0 


724 1 0_Kidney_HRCE 


39.8 




94722_Donor2U- 










B_MesenchymaI 
Stem Cells 


0.0 


7241 l_Kidney_HRE 


21.8 




94723_Donor2U- 
C_Mesenchymal 
Stem Cells 


0.0 


73 1 39_Uterus_Uterine smooth 
muscle cells 


8.1 



CNS_neurodegeneration_vl.0 Summary: Ag3524 No differential expression of the 
CG59276-01 gene is detected between Alzheimer's diseased postmortem brains and those 
of non-demented controls in this experiment. However, as observed in panel 1.4 this gene 
is expressed at low levels throughout the CNS, including in amygdala, substantia nigra, 
5 thalamus, cerebellum, cerebral cortex, and spinal cord. Therefore, this gene may play a role 
in central nervous system disorders such as Parkinson's disease, epilepsy, multiple 
sclerosis, schizophrenia and depression. 

General_screening_panel_vl.4 Summary: Ag3524 Expression of the CG59276-01 gene 
is highest in a sample derived from a brain and lung cancer cell lines (CTs = 29). Thus, the 
10 expression of this gene could be used to distinguish these samples from the other samples 
in the panel. The CG59276-01 gene encodes a dihydroorotate dehydrogenase (DHODH) 
homolog. DHODH is an enzyme involved in the pathway for pyrimidine production. Drugs 
known to inhibit DHODH activity, such as brequinar sodium (Dup-785), have been shown 
to have anti-tumor activities (ref. 1). Therefore, therapeutic modulation of the activity of 
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this gene encoded by this gene may be beneficial in the treatment of CNS and lung cancer. 
In addition, low to moderate expression of this gene is seen in all of the samples on this 
panel. Therefore, this gene may be playing an important role in cellular function. 

This gene is expressed at low to moderate levels in a number of tissues with 
metabolic or endocrine function, including adipose, adrenal gland, gastrointestinal tract, 
pancreas, and skeletal muscle. Therefore, therapeutic modulation of the activity of this 
gene may prove useful in the treatment of endocrine/metabolically related diseases, such as 
obesity and diabetes. 

Recently, it has been demonstrated that down regulation of DHODH mRNA using 
RNA interference (RNAi) may inhibit growth of Plasmodium falciparum (ref 2). 

References: 

1. BraakhuisBJ, van Dongen GA, Peters GJ, van Walsum M, Snow GB (1990) 
Antitumior activity of brequinar sodium (Dup-785) against human head and neck squamous 
cell carcinoma xenografts. Cancer Lett 49(2): 133-7. 

2. McRobert L, McConkey GA.(2002) RNA interference (RNAi) inhibits growth 
of Plasmodium falciparum. Mol Biochem Parasitol 1 19(2):273-8 

Panel 2D Summary: Ag3524 The expression of this gene appears to be highest in a 
sample derived from a normal liver tissue (CT=30.3). In addition, there appears to be 
substantial expression in other samples derived from liver cancers and breast cancers. Thus, 
the expression of this gene could be used to distinguish normal liver tissue from other 
samples in the panel. Moreover, therapeutic modulation of this gene, through the use of 
small molecule drugs, protein therapeutics or antibodies could be of benefit in the treatment 
of liver or breast cancer. 

Panel 4D Summary: Ag3524 Highest expression of the CG59276-01 gene is detected in 
resting primary Thl cells (CT=30.03). In addition, the expression of this gene is 
significantly reduced in activated primary Thl cells, suggesting a regulatory role for this 
gene in T-cell activation. The CG59276-01 encodes a dihydroorotate dehydrogenase, an 
enzyme involved in the pathway for pyrimidine production. Recently, an inhibitor of this 
enzyme, leflunomide has been shown to be an effective treatment for rheumatoid arthritis 



852 



(ref 1). Therefore, therapeutics designed with the protein encoded for by this transcript 
could be important in regulating T cell function and treating T cell mediated diseases such 
as asthma, rheumatoid arthritis, psoriasis, IBD, and systemic lupus erythematosus. 

Overall, this gene is expressed at low to moderate levels in a wide range of cell 
types of significance in the immune response in health and disease. These cells include 
members of the T-cell, B-cell, endothelial cell, macrophage/monocyte, and peripheral 
blood mononuclear cell family, as well as epithelial and fibroblast cell types from lung and 
skin, and normal tissues represented by colon, lung, thymus and kidney. This ubiquitous 
pattern of expression suggests that this gene product may be involved in homeostatic 
processes for these and other cell types and tissues. This pattern is in agreement with the 
expression profile in General_screening_panel_vl .4 and also suggests a role for the gene 
product in cell survival and proliferation. 

Therefore, modulation of the gene product with a functional therapeutic may lead to 
the alteration of functions associated with these cell types and lead to improvement of the 
symptoms of patients suffering from autoimmune and inflammatory diseases such as 
asthma, allergies, inflammatory bowel disease, lupus erythematosus, psoriasis, rheumatoid 
arthritis, and osteoarthritis. 

Reference: 

1 . Schattenkirchner M. (2000) The use of leflunomide in the treatment of 
rheumatoid arthritis: an experimental and clinical review. Immunopharmacology 47(2- 
3):291-8 

Panel 5 Islet Summary: Ag3524 This gene has a low level of expression in adipose tissue 
(CTs=33-35). Thus, this gene product may be a small molecule drug for the treatment of 
obesity and obesity-related diseases, including Type 2 diabetes. 

BQ. CG59268-01: KIAA2372 

Expression of gene CG59268-01 was assessed using the primer-probe set Ag3523, 
described in Table BQA. Results of the RTQ-PCR runs are shown in Tables BQB and 
BQC. 



Table BOA . Probe Name Ag3523 
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Primers 


Sequences 


Length 


Start 
Position 


SEQID 
NO: 


Forward 


5 1 -tactcttttggcttgatggaaa-3 ' 


22 


556 


608 


Probe 


TET-5 1 -ccaact tctacgaccaggcagaaaa- 3 ' - 
TAMRA 


25 


578 


609 


Reverse 


5 ' -gacaaagagttggtgcctcttt-3 ' 


22 


610 


610 



Table BOB . General_screening_panel_vl.4 



Tissue Name 


Rel. Exp.(%) 
AgJSZo, Kill) 
216874716 


Tissue Name 


Rel. Exp.(%) 
Ag3523, Run 
216874716 


Adipose 


47.6 


Renal ca. TK-10 


62.9 


Melanoma* 
Hs688(A).T 


4.0 


Bladder 


25.2 


Melanoma* 
Hs688(B).T 


0.0 


Gastric cel (liver met.) 
NCI-N87 


40.1 


Melanoma* Ml 4 


15.4 


Gastric ca. KATO III 


oo 


Melanoma* 
LOXIMVI 


0.8 


Colon ca. SW-948 


0.0 


Melanoma* SK- 
MEL-5 


39.5 


Colon ca. SW480 


9.7 


Squamous cell 
carcinoma SCC-4 


0.0 


Colon ca.* (S W480 
met) SW620 


10.2 


Testis Pool 


35.8 


Colon ca. HT29 


27.2 


Prostate ca.* (bone 
met) PC-3 


8.5 


Colon ca. HCT-116 


52.5 


Prostate Pool 


0.0 


Colon ca. CaCo-2 


31.2 


Placenta 


0.0 


Colon cancer tissue 


14.1 


Uterus Pool 


0.0 


Colon ca. SW1116 


0.0 


Ovarian ca. 
OVCAR-3 


0.0 


Colon ca. CoIo-205 


12.2 


Ovarian ca. SK- 
OV-3 


16.3 


Colon ca. SW-48 


0.0 


Ovarian ca. 
OVCAR-4 


10.3 


Colon Pool 


0.0 


Ovarian ca. 
OVCAR-5 


47.6 


Small Intestine Pool 


3.9 


Ovarian ca. 
IGROV-1 


0.0 


Stomach Pool 


4.3 


Ovarian ca. 
OVCAR-8 


1.5 


Bone Marrow Pool 


0.0 


Ovary 


0.0 


Fetal Heart 


4.2 


Breast ca. MCF-7 


6.0 


Heart Pool 


4.2 


Breast ca. MDA- 


8.8 


Lymph Node Pool 


0.0 
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MB-23 1 








Breast ca. BT 549 


33.4 


Fetal Skeletal Muscle 


0.0 


Breast ca. T47D 


62.9 


Skeletal Muscle Pool 


5.1 


Breast ca. MDA-N 


8.8 


Spleen Pool 


0.0 


Breast Pool 


0.0 


Thymus Pool 


0.0 


Trachea 


0.0 


CNS cancer 
(glio/astro) U87-MG 


9.0 


Lung 


0.0 


CNS cancer 
(glio/astro) U-118-MG 


33.2 


Fetal Lung 


15.1 


CNS cancer 
(neuro;met) SK-N-AS 


23.5 


Lung ca. NCI-N417 


3.5 


CNS cancer (astro) SF- 
539 


11.0 


Lung ca. LX-1 


3.7 


CNS Ccinccr (sstro) 
SNB-75 


13.8 


Lung ca. NCI-H146 


0.0 


CNS cancer (glio) 
SNB-19 


20.7 


Lung ca. SHP-77 


11.0 


CNS cancer (glio) SF- 
295 


30.1 


Lung ca. A549 


6.3 


Brain (Amygdala) Pool 


0.0 


Lung ca. NCI-H526 


0.0 


Brain (cerebellum) 


0.0 


Lung ca. NCI-H23 


3.8 


Brain (fetal) 


4.5 


Lung ca. NCI-H460 


13.5 


Brain (Hippocampus) 
Pool 


4.6 


Lung ca. HOP-62 


4.0 


Cerebral Cortex Pool 


0.0 


Lung ca. NCI-H522 


9.2 


Brain (Substantia 
nigra) Pool 


4 4 


Liver 


4.5 


Biain (lit il «mu >) 1* » >1 


0.0 


Fetal Liver 


0.0 


Brain (whole) 


0.0 


Liver ca. HepG2 


100.0 


Spinal Cord Pool 


0.0 


Kidney Pool 


6.5 


Adrenal Gland 


8.3 


Fetal Kiidney 


17.9 


Pituitary gland Pool 


6.6 


Renal ca. 786-0 


38.7 


Salivary Gland 


16.3 


Renal ca. A498 


3.2 


Thyroid (female) 


0.0 


Renal ca. ACHN 


9.0 


Pancreatic ca. 
CAPAN2 


16.5 


Renal ca. UO-31 


0.0 


Pancreas Pool 


11.3 



Table BOC . Panel 4D 



Tissue Name 


Rel. Exp.(%) 
Ag3523,Run 
166407138 


Tissue Name 


Rel. Exp.(%) 
Ag3523, Run 
166407138 


Secondary Thl act 


11.3 


HUVEC IL-lbeta 


1.5 
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Secondary Th2 act 


2.7 


HUVEC IFN gamma 


0.0 


Secondary Trl act 


9.5 


HUVECTNF alpha + 
IFN gamma 


0.0 


Secondary Thl rest 


0.0 


HUVEC TNF alpha + 
IL4 


0.0 


Secondary Th2 rest 


1.3 


HUVEC IL-11 


5.1 


Secondary Trl rest 


1.7 


Lung Microvascular EC 
none 


0.0 


Primary Thl act 


13.1 


Lung Microvascular EC 
TNFalpha + IL-lbeta 


0.0 


Primary Th2 act 


10.8 


Microvascular Dermal 
EC none 


1.0 


Primary Trl act 


7.5 


Microsvasular Dermal 
EC TNFalpha + IL-lbeta 


1.6 


Primary Thl rest 


1.4 


Bronchial epithelium 
TNFalpha + IL1 beta 


0.0 


Primary Th2 rest 


0.0 


Small airway epithelium 


1.0 


Primary Trl rest 


2.6 


Small airway epithelium 

TNFalnha 4- IT -Ihptn 

1 iNFaipild <^ 1L- 1 UCld 


4.4 


CD45RA CD4 
lymphocyte act 


0.0 


Coronery artery SMC rest 


0.0 


CD45RO CD4 
lymphocyte act 


8.0 


Coronery artery SMC 
TNFalpha + IL-lbeta 


0.0 


CD8 lymphocyte act 


0.0 


Astrocytes rest 


0.0 


Secondary CD8 
lymphocyte rest 


1.5 


Astrocytes TNFalpha + 
IL-lbeta 


0.0 


Secondary CD8 
lymphocyte act 


3.1 


KU-812 (Basophil) rest 


0.0 


CD4 lymphocyte none 


1.0 


KU-812 (Basophil) 
PMA/ionomycin 


8.1 


2ryThl/Th2/Trl anti- 
CD95 CH11 


3.7 


CCD1106 

(Keratinocytes) none 


4.0 


LAK cells rest 


7.2 


CCD 1106 

(Keratinocytes) 
TNFalpha + IL-lbeta 


2.2 


LAK cells IL-2 


0.0 


Liver cirrhosis 


15.3 


LAK cells IL-2+IL-12 


8.9 


Lupus kidney 


5.0 


LAK cells IL-2+1FN 
gamma 


12.2 


NCI-H292 none 


1.0 


LAK cells IL-2+IL-18 


4.7 )NCI-H292 IL-4 


0.6 


LAK cells 
PMA/ionomycin 


1.4 


NCI-H292 IL-9 


1.6 


NK Cells IL-2 rest 


7.9 


NC1-H292 IL-13 


1.5 


Two Way MLR 3 day 


0.0 


NCI-H292 IFN gamma 


1 2 
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Two Way MLR 5 day 


0.0 


HPAEC none 


0.0 


Two Way MLR 7 day 


0.0 


HPAEC TNF alpha + IL- 
1 beta 


0.0 


PBMC rest 


5 5 


Lung fibroblast none 


0.0 


PBMC PWM 


2.6 


Lung fibroblast TNF 
alpha + IL-1 beta 


0.0 


PBMC PHA-L 


0.0 


Lung fibroblast IL-4 


3.9 


Ramos (B cell) none 


0.0 


Lung fibroblast IL-9 


0.0 


Kamos (r> cell) 
ionomycin 


0.0 


Lung fibroblast IL-1 3 


0.0 


B lymphocytes PWM 


8.2 


Lung fibroblast IFN 
gamma 


0.0 


B lymphocytes CD40L 
and IL-4 


1.5 


Dermal fibroblast 
iv t\j rest 


0.0 


EOL-1 dbcAMP 


4.5 


Dermal fibroblast 
Lvuju/u i iNr aipna 


13.9 


FOT -1 HhrAMP 
PMA/ionomycin 


10.8 


Dermal fibroblast 
CCD 1070 IL-1 beta 


0.4 


Dendritic cells none 


0.0 


Dermal fibroblast IFN 
gamma 


0.0 


Dendritic cells LPS 


0.0 


Dermal fibroblast IL-4 


0.0 


Dendritic cells anti- 
CD40 


0.0 


IBD Colitis 2 

I 


0.0 


Monocytes rest 


11.5 


IBD Crohn's 


66.0 


Monoc>1^ LPS 


6.4 


Colon 


100 0 


Macrophages rest 


4.5 


Lung 


0.0 


Macrophages LPS 


1.4 


Thymus 


38.4 


HUVEC none 


1.9 


Kidney 


0.0 


HUVEC starved 


1.6 





CNS_neurodegeneration_vl.O Summary: Ag3523 Expression of this gene is 
low/undetectable (CTs > 35) across all of the samples on this panel (data not shown). 



General_screening_panel_vl.4 Summary: Ag3523 Expression of the CG59268-01 gene 
is highest in sample derived from liver cancer cell line (CT=32.55). Therefore, expression 
of this gene may be used to distinguish liver cancers from the other samples on this panel. 
In addition, low levels of expression of this gene are also observed in one of the ovarian 
cancer, 2 of the breast cancer, 2 of the renal cancer, bladder, gastric cancer, 3 of the colon 
cancer, and 4 of the CNS cancer samples. Therefore, therapeutic modulation of the activity 
of this gene product may be beneficial in the treatment of these cancers. 
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Among the tissues with metabolic or endocrine function, this gene is expressed at 
low levels in adipose tissue sample. Adipose tissue has several crucial roles including (i) 
mobilization from stores of fatty acids as an energy source, (ii) catabolism of lipoproteins 
such as very-low-density lipoprotein and (iii) synthesis and release of hormonal signals 
5 such as leptin and interleukin-6 (Coppack et al., 2001). Therefore, therapeutic modulation 
of the activity of this gene may prove useful in the treatment of endocrine/metabolically 
related diseases, such as obesity, hyperlipidemia, and insulin resistance. 

References: 

1. Coppack SW, Patel JN, Lawrence VJ. (2001) Nutritional regulation of lipid 
10 metabolism in human adipose tissue. Exp Clin Endocrinol Diabetes ;109(Suppl 2):S202- 
S214 

Panel 4D Summary: Ag3523 Expression of the CG59268-01 gene is highest in sample 
derived from colon (CT=31.56). Therefore, expression of this gene may be used to 
distinguish colon sample from the other samples on this panel. In addition, significant 
15 expressiion of this gene is also observed in IBD Crohn's sample (CT=32. 16). Thus, 

expressiion of this gene in colon and Crohn's sample can be used to distinguish these two 
samples from IBD Colitis 2 sample. In addition, therapeutic modulation of the activity of 
this gene product may be beneficial in the treatment of IBD Crohn's disease. 

BR. CG59549-01: H326 like 

20 Expression of gene CG59549-01 was assessed using the primer-probe set Ag3464, 

described in Table BRA. Results of the RTQ-PCR runs are shown in Tables BRB and 
BRC. 

Table BRA . Probe Name Ag3464 



Primers 


Sequences 


Length 


Start 
Position 


SEQ n> 
NO: 


Forward 


5 ' -gtgcgtcacctgttacagaga-3 ' 


21 


1678 


611 


Probe 


TET-5 ' -ctcatcaacccggctggagagatcat-3 ' - 
TAMRA 


26 


1700 


612 


Reverse 


5 1 -ctcttcttcatctgggaactca-3 ' 


22 


1731 


613 



Table BRB . General_screeningjpanel_vl.4 
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Tiss ue Name 


Kel. Jl>xp.(7o) 
Ag3464, Run 
217067408 


Tissue Name 


Rel. Exp.(%) 
Ag3464, Run 
217067408 


Adipose 


0.0 


Renal ca. TK-10 


0.0 


Melanoma* 
Hs688(A).T 


0.0 


Bladder 


0.0 


Melanoma* 
Hs688(B).T 


0.0 


Gastric ca. (liver met.) 
NCI-N87 


1.6 


Melanoma* M14 


1.0 


Gastric ca. KATO III 


2.1 


Melanoma* 
LOXIMVI 


0.0 


Colon ca. SW-948 


0.0 


Melanoma* SK- 
MEL-5 


23.8 


Colon ca. SW480 


2.6 


Squamous cell 
carcinoma SCC-4 


0.0 


Colon ca.* (SW480 
met) SW620 


0.0 


Testis Pool 


22.8 


Colon ca. HT29 


0.0 


Prostate ca.* (bone 
met) PC-3 


0.0 


Colon ca.HCT-1 16 


0.0 


Prostate Pool 


0.0 


Colon ca. CaCo-2 


0.0 




0 0 


Colon cancer tissue 


0.3 


i reus Pool 


0.0 


Colon ca. SW1116 


0.0 


Ovarian ca. 
OVCAR-3 


0.0 


Colon ca. Colo-205 


0.0 


Ovarian ca. SK- 
OV-3 


1.0 


Colon ca. SW-48 


0.0 


Ovarian ca. 
OVCAR-4 


0.0 


Colon Pool 


3.3 


Ov£iri3.n C3.. 
OVCAR-5 


0.0 


Small Intestine Pool 


0.9 


Ovarian ca. 
IGROV-1 


0.0 


Stomach Pool 


0.0 


Ovarian ca. 
OVCAR-8 


4.8 


Bone Marrow Pool 


0.0 


Ovary 


0.0 


Fetal Heart 


0.0 


Breast ca. MCF-7 


0.0 


Heart Pool 


0.0 


Breast ca. MDA- 
MB-231 


0.0 


Lymph Node Pool 


0.0 


Breast ca. BT 549 


0.0 


Fetal Skeletal Muscle 


0.0 


Breast ca. T47D 


4.9 


Skeletal Muscle Pool 


0.0 


Breast ca. MDA-N 


1.1 


Spleen Pool 


0.0 


Breast Pool 


2.2 


Thymus Pool 


0.7 


Trachea 


0.0 


CNS cancer 
(glio/astro) U87-MG 


0.0 


Lung 


0.0 


CNS cancer 
(glio/astro) U-118-MG 


20.4 
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Fetal Lung 


0.0 


^ino cdncer 
(neuro;met) SK-N-AS 


0.0 


Lung ca. NCI-N417 


0.0 


\_-iN5> cancer (astro j or- 
539 


0.0 


Lung ca. LX-1 


0.0 


CNS cancer (astro) 
SNB-75 


0.0 


Lung ca.NCI-H146 


2.0 


CNS cancer (glio) 
SNB-19 


0.0 


Lung ca. SHP-77 


0.7 


CNS cancer (glio) SF- 
295 


100.0 


Lung ca. A549 


0.6 


Brain (Amygdala) Pool 


1.1 


Lungca. NCI-H526 


0.0 


Brain (cerebellum) 


0.0 


Lung ca. NCI-H23 


0.0 


Brain (fetal) 


0.0 


Lung ca. NCI-H460 


0.0 


Brain (Hippocampus) 
Pool 


0.0 


Lung ca. HOP-62 


2.6 


Cerebral Cortex Pool 


0.0 


Lung ca. NCI-H522 


0.0 


Brain (Substantia 
nigra) Pool 


0.8 


Liver 


0.0 


Brain (Thalamus) Pool 


1.1 


Fetal Liver 


0.0 


Brain (whole) 


0.0 


Liver ca. HepG2 


0.0 


Spinal Cord Pool 


0.8 


Kidney Pool 


0.0 


Adrenal Gland 


2.0 


Fetal Kidney 




3.2 


Pituitary gland Pool 


0.0 


Renal ca. 786-0 




0.0 


Salivary Gland 


0.0 


Renal ca. A498 


0.0 


Thyroid (female) 


0.0 


Renal ca. ACHN 


0.0 


Pancreatic ca. 
CAPAN2 


0.0 


Renal ca. UO-31 


0.0 


Pancreas Pool 


1.8 


Table BRC. Panel 4D 


Tissue Name 


Rel. Exp.(%) 
Ag3464, Run 
166417099 


Tissue Name 


Rel. Exp.(%) 
Ag3464, Run 
166417099 


Secondary Thl act 


0.0 


HUVEC IL-lbeta 


0.0 


Secondary Th2 act 


0.0 


HUVEC IFN gamma 


0.0 


Secondary Trl act 


2.5 


HUVEC TNF alpha + 
IFN gamma 


0.0 


Secondary Thl rest 


0.0 


HUVEC TNF alpha + 
IL4 


0.0 


Secondary Th2 rest 


0.0 


HUVEC IL-1 1 


0.0 


Secondary Trl rest 


0.0 


Lung Microvascular EC 
none 


0.0 


Primary Thl act 


0.0 


Lung Microvascular EC 


0.0 
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TNFalpha + IL-lbeta 




Primary Th2 act 


0.0 


Microvascular Dermal 
EC none 


0.0 


Primary Trl act 


0.0 


Microsvasular Dermal 
EC TNFalpha + IL-lbeta 


0.0 


Primary Thl rest 


0.0 


Bronchial epithelium 
TNFalpha + ILlbeta 


0.0 


Primary Th2 rest 


0.0 


Small airway epithelium 


0.0 


Primary Trl rest 


0.0 


Small airway epithelium 
TNFalpha + IL-lbeta 


0.0 


CD45RA CD4 
lymphocyte act 


0.0 


Coronery artery SMC rest 


0.0 


CD45RO CD4 
lymphocyte act 


0.0 


Coronery artery SMC 
TNFalpha + IL-lbeta 


0.0 


CD8 lymphocyte act 


0.0 


Astrocytes rest 


00 


Secondary CD8 
lymphocyte rest 


0.0 


Astrocytes TNFalpha + 
IL-lbeta 


0.0 


Secondary CD8 
lymphocyte act 


0.0 


KU-812 (Basophil) rest 


0.0 


CD4 lymphocyte none 


0.0 


KU-812 (Basophil) 


12.0 


2ry Thl/Th2/Trl anti- 
CD95 CH11 


5.9 


CCD 1106 


0.0 


LAK cells rest 


0.0 


CCD1 106 
(Keratinocytes) 
TNFalpha + IL-lbeta 


0.0 


LAK cells IL-2 


0.0 


Liver cirrhosis 


100.0 


LAK cells IL-2+IL-12 


0.0 


Lupus kidney 


0.0 


LAK cells IL-2+IFN 
gamma 


0.0 


NCI-H292 none 


0.0 


LAK cells IL-2+IL- 18 


0.0 


NCI-H292 11-1 


0.0 


LAK cells 
PMA/ionomycin 


0.0 


NCI-H292 IL-9 


10.0 


NK Cells IL-2 rest 


0.0 NCI-H292IL-13 


0.0 


Two Way MLR 3 day 


0.0 


NCI-H292 I FN gamma 


3.2 


Two Way MLR 5 day 


0.0 


HPAEC none 


0.0 


Two Way MLR 7 day 


0.0 


HPAEC TNF alpha + IL- 
1 beta 


0.0 


PBMC rest 


0.0 


Lung fibroblast none 


7.0 


PBMC PWM 


0.0 


Lung fibroblast TNF 
alpha + IL-1 held 


0.0 


PBMC PHA-L 


0.0 


Lung fibroblast IL-4 


0.0 


Ramos (B cell) none 


0.0 


Lung fibroblast IL-9 


0.0 


Ramos (B cell) 


0.0 


Lung fibroblast IL- 13 


0.0 
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ionomycin 








B lymphocytes PWM 


0.0 


Lung fibroblast IFN 
gamma 


0.0 


B lymphocytes CD40L 
and IL-4 


0.0 


Dermal fibroblast 

PPD1070 rpct 


0.0 


EOL-1 dbcAMP 


0.0 


Dermal fibroblast 

UUlylU/U I J.NJT aipild 


0.0 


FOT -1 rlhrAMP 

PMA/ionomycin 


0.0 


Dermal fibroblast 
CCD 1070 IL-1 beta 


0.0 


Dendritic cells none 


0.0 


Dermal fibroblast IFN 
gamma 


0.0 


Dendritic cells LPS 


0.0 


Dermal fibroblast IL-4 


0.0 


Dendritic cells anti- 
CD40 


0.0 


IBD Colitis 2 


0 0 


Monocytes rest 


0.0 


IBD Crohn's 


6.3 


Monocytes LPS 


0.0 


Colon 


27.4 


Macrophages rest 


0.0 jLung 


3.8 


Macrophages LPS 


0.0 | Thymus 


0.0 


HUVEC none 


0.0 


Kidney 


0.0 


HUVEC starved 


0.0 







CNS_neurodegeneration_vl.O Summary: Ag3464 Expression of the CG59549-01 gene 
is low/undetectable (CTs > 35) across all of the samples on this panel (data not shown). 



General_screening_panel_vl.4 Summary: Ag3464 Expression of the CG59549-01 gene 
is highest in a CNS cancer (glio) SF-295 sample (CT = 31.15). Thus, the expression of this 
5 gene could be used to distinguish this sample from the other samples in the panel. In 
addition, low to moderate expression of this gene is detected in a melanoma and a CNS 
cancer sample. Therefore, therapeutic modulation of this gene or its protein product may be 
beneficial in the treatment of melanoma and CNS cancer. 

Panel 4D Summary: Ag3464 Low but significant expression of the CG59549-01 gene is 
1 0 detected exclusively in liver cirrhosis sample (CT=33.4). Therefore, expression of this gene 
may be used to distinguish liver cirrhosis from the other samples on this panel. 
Furthermore, expression of this gene is not detected in normal liver in Panel 1 .3D, 
suggesting that its expression is unique to liver cirrhosis. Therefore, antibodies or small 
molecule therapeutics could reduce or inhibit fibrosis that occurs in liver cirrhosis. In 
1 5 addition, antibodies to this gene product could also be used for the diagnosis of liver 
cirrhosis. 
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BS. CG59641-01: ACETYL-COA CARBOXYLASE 2 

Expression of gene CG59641 -01 was assessed using the primer-probe set Ag3502, 
described in Table BSA. Results of the RTQ-PCR runs are shown in Table BSB. 



Table BSA . Probe Name Ag3502 



Primers 


Sequences 


Length 


Start 
Position 


SEQID 
NO: 


Forward 


5 ' -ctccctacgtcaccaaggat-3 ' 


20 


5090 


614 


Probe 


TET-5 ■ -aagcgattccaggcccagaccct-3 • - 
TAMRA 


23 


5122 


615 


Reverse 


5 ' -ccgggaagtcatagatgtaggt-3 ' 


22 


5152 


616 



Table E.SB . General screeningjpanel_vl.4 



Tissue Name 


Rel. Exp.(%) 
Ag3502, Run 
217131537 


Tissue Name 


Rel. Exp.(%) 
Ag3502, Run 
217131537 


Adipose 


100.0 


Renal ca. TK-10 


6.6 


Melanoma* 
Hs688(A).T 


1.5 


Bladder 


14.2 


Melanoma* 
Hs688(B).T 


1.4 


Gastric ca. (liver met.) 
NCI-N87 


11.6 


Melanoma* Ml 4 


7.2 


Gastric ca. KATO III 


; 2 


Melanoma* 
LOXIMVI 


0.0 


Colon ca. SW-948 


0.5 


Melanoma* SK- 
MEL-5 


14.8 


Colon ca. SW480 


7.8 


Squamous cell 
carcinoma SCC-4 


0.2 


Colon ca * (SW480 
met) S W620 


5.8 


Testis Pool 


10.0 


Colon ca. HT29 


1.0 


Prostate ca.* (bone 
met) PC-3 


19.9 


Colon ca. HCT-116 


5.8 


Prostate Pool 


12.9 


Colon ca. CaCo-2 


8.2 


Placenta 


1.9 


Colon cancer tissue 


7.6 


' terus \>ol 


9.2 


Colon ca. SW1116 


2.0 


Ovarian ca. 
OVCAR-3 


6.6 


Colon ca. Colo-205 


9.0 


Ovarian ca. SK- 
OV-3 


12.2 


Colon ca. S W-48 


1.7 


Ovarian ca. 
OVCAR-4 


1.9 


Colon Pool 


20.7 


Ovarian ca. 


8.5 


Small Intestine Pool 


27.9 
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OVCAR-5 








Ovarian ca. 
IGROV-1 


1.2 


Stomach Pool 


28.7 


Ovarian ca. 
OVCAR-8 


1.7 


Bone Marrow Pool 


8.0 


Ovary 


12.1 


Fetal Heart 


J0.6 


Breast ca. MCF-7 


63.7 


Heart Pool 


16.7 


Breast ca. MDA- 
MB-231 


4.5 


Lymph Node Pool 


18.8 


Breast ca. BT 549 


1.8 


Fetal Skeletal Muscle 


5.4 


Breast ca. T47D 


10.8 


Skeletal Muscle Pool 


66.0 


Breast ca. MDA-N 


2.3 


OpiCCll rOOI 




Breast Pool 


39.0 


Thymus Pool 


16.4 


Trachea 


15.3 


CNS cancer 
(glio/astro) U87-MG 


5.8 


Lung 


5.6 


CNS cancer 
(glio/astro) U-118-MG 


9.1 


Fetal Lung 


13.6 


CNS cancer 
(neuro;met) SK-N-AS 


5.0 


Lung ca. NCI-N417 


2.3 


CNS cancer (astro) SF- 
539 


3.1 


Lung ca. LX-1 


6.4 


CNS cancer (astro) 
SNB-75 


3.7 


Lung ca. NCI-H146 


2.2 


CNS cancer (glio) 
SNB-19 


1.3 


Lungca.SHP-77 [ 2.8 jCNS cancer (glio) SF- 


8.1 


Lung ca. A549 1 0 0 Brain (Amygdala) Pool 


6.2 


Lung ca. NCI-H526 


0.9 


Brain (cerebellum) 


13.0 


Lung ca. NCI-H23 


26.6 


Brain (fetal) 


4.2 


Lung ca. NCI-H460 


3.9 


Brain (Hippocampus) 
Poo! 


7.5 


Lung ca. HOP-62 


1.4 


Cerebral Cortex Pool 


8.4 


Lung ca. NCI-H522 


13.5 


Brain (Substantia 
nigra) Pool 


? " 


Liver 


23.2 


Brain (Thalamus) Poo! 


10.2 


Fetal Liver 


11.7 


Brain (whole) 


8.8 


Liver ca. HepG2 


6.6 


Spinal Cord Pool 


9.1 


Kidney Pool 


38.7 


Adrenal Gland 


50.0 


Fetal Kidney 


10.4 


Pituitar} gland Pool 


7.4 


Renal ca. 786-0 


1.5 


Salivary Gland 


11.3 


Renal ca. A498 


1.4 


rhyroid (female) 


5.1 


Renal ca. ACHN 


3.0 


Pancreatic ca. 
CAPAN2 


1.7 
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[Renal ca. UO-31 | 2.5 }Pancrea s Pool | 17.0 | 

Genera I_screening_panel_vl. 4 Summary: Ag3502 The CG59641-01 encodes an acetyl- 
CoA carboxylase 2 (ACC2) protein. Expression of this gene is highest in adipose tissue 
(CT=25.5). High levels of expression of this gene are also detected in other tissues with 
metabolic or endocrine function such as pancreas, adrenal gland, gastrointestinal tract, 
heart, skeletal muscle, and thyroid. Acetyl-coenzyme A (acetyl-CoA) carboxylase (ACC) 
catalyzes the synthesis of malonyl-CoA, a metabolite that plays a pivotal role in the 
synthesis and oxidation of fatty. Hence, ACC links fatty acid and carbohydrate metabolism 
through the shared intermediate acetyl-CoA, the product of pyruvate dehydrogenase. It has 
been shown recently that mutations in ACC2 gene lead to loss of body fat in a normal 
caloric intake in mouse (Abu-Elheiga et al., 2001). Therefore, therapeutic modulation of 
the activity of this gene may prove useful in the treatment of endocrine/metabolically 
related diseases, such as obesity and diabetes. 

Low to moderate expression of this gene is also detected in most of the samples 
used in this panel suggesting the possibility of a wider role in intercellular signaling for this 
molecule. 

Among tissues that originate in the central nervous system, this gene is expressed in 
all regions represented on this panel. Therefore, therapeutic modulation of the expression 
or function of this gene may be useful in the treatment of neurologic disorders, such as 
Alzheimer's disease, Parkinson's disease, schizophrenia, multiple sclerosis, stroke and 
epilepsy. 

In addition, significantly higher levels of expression are seen in a breast cancer cell 
line. Thus, expression of this gene could be used to differentiate between this sample and 
other samples on this panel and as a marker to detect the presence of breast cancer. 
Furthermore, therapeutic modulation of the expression or function of this gene may be 
effective in the treatment of breast cancer. 

Reference: 

1 . Abu-Elheiga L, Matzuk MM, Abo-Hashema KA, Wakil SJ. (2001) Continuous 
fatty acid oxidation and reduced fat storage in mice lacking acetyl-CoA carboxylase 2. 
Science 2001 Mar 30;291(5513):2613-6 
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BT. CG59630-01: Midnolin 



Expression of gene CG59630-01 was assessed using the primer-probe set Ag3425, 
described in Table BTA. Results of the RTQ-PCR runs are shown in Tables BTB, BTC and 
BTD. 



Table BTA . Probe Name Ag3425 



Primers 


Sequences 


Length 


Start Position 


SEQBD 
NO: 


Forward 


5 ' -aagctgaccttggtacccac-3 1 


20 


295 


617 


Probe 


TET-5 ' -ctcatgtctcaggcctcaaggcc-3 1 - 
TAMRA 


23 


328 


618 


Reverse 


5 ' -ctctcgagagcttgcatcac-3 1 


20 


361 


619 



Table B TB. CNS_neurodegeneration_vl.0 



Tissue Name 


Rel. Exp.(%) Ag3425, 
Run 210350911 


Tissue Name 


ReL Exp.(%) Ag3425, 
Run 210350911 


AD 1 Hippo 


18.6 


Control (Path) 3 
Temporal Ctx 


14.0 


AD 2 Hippo 


44.1 


Control (Path) 4 
Temporal Ctx 


25.0 


AD 3 Hippo 


11.3 


AD 1 Occipital Ctx 


13 6 


AD 4 Hippo 


19.3 


AD 2 Occipital Ctx 
(Missing) 


0.0 


AD 5 Hippo 


53.2 


AD 3 Occipital Ctx 


11.5 


\D 6 Hippo 


100.0 


AD 4 Occipital Ctx 


24.7 


Control 1 Hippo 47.0 


AD 5 Occipital Ctx 


53.2 


< a mtrol \ Hipp 38.2 


AD 6 Occipital Ctx 


80.7 


Control (Path) 3 
Hippo 


10.5 


Control 1 Occipital 
Ctx 


19.9 


AD 1 Temporal 
Ctx 


17.3 


Control 2 Occipital 
Ctx 


49.3 


AD 2 Temporal 
Ctx 


33.7 


Control 3 Occipital 
Ctx 


31.2 


AD 3 Temporal 
Ctx 


10.4 


Control 4 Occipital 
Ctx 


13.9 


AD 4 Temporal 
Ctx 


19.1 


Control (Path) 1 
Occipital Ctx 


81.2 


AD 5 Inf Temporal 
Ctx 


49.3 


Control (Path) 2 
Occipital Ctx 


11.0 


AD 5 Sup 
Temporal Ctx 


45.4 


Control (Path) 3 
Occipital Ctx 


8.4 
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AD 6 Inf Temporal 
Ctx 


97.3 


x^A/IlUUI yJT axil) t- 

Occipital Ctx 


19.6 


AD 6 Sup 
Temporal Ctx 


85.9 


Control 1 Parietal 
Ctx 


18.8 


Control 1 
Temporal Ctx 


20.3 


f^rmtrrJ 0 Parietal 

Ctx 


29.9 


Temporal Ctx 


59.9 


v^unuuJ d r an eta i 
Ctx 


32.3 


Control 3 
Temporal Ctx 


32.3 


Control (Path) 1 
Parietal Ctx 


91.4 


Control 3 
Temporal Ctx 


17.4 


Control (Path) 2 
Parietal Ctx 


32.3 


Control (Path) 1 
Temporal Ctx 


70.7 


Control (Path) 3 
Parietal Ctx 


7.7 


Control (Path) 2 
Temporal Ctx 


27.5 


Control (Path) 4 
Parietal Ctx 


45.1 



Table BTC . General_screening_panel_vl.4 



Tissue Name 


Rel. Exp.(%) 
Ag3425, Run 
217049295 


Tissue Name 


Rel. Exp.(%) 
Ag3425,Run 
217049295 


Adipose 


17.8 


Renal ca.TK- 10 


26.4 


Melanoima* 
Hs688(A).T 


15.5 


Bladder 


16.0 


Melanoima* 
Hs688(B).T 


19.5 


Gastric ca. (liver met.) 
NCI-N87 


26.6 


Melanoma* Ml 4 


10.5 


Gastric ca. KATO III 


23.0 


Melanoima* 
LOXIMVI 


15.7 


Colon ca. SW-948 


11.6 


Melanoma* SK- 
MEL-5 


8.0 


Colon ca. SW480 


27.5 


Squamous cell 
carcinoma SCC-4 


27.2 


Colon ca.* (SW480 
met) SW620 


15.3 


Testis Pool 


5.6 


Colon ca. HT29 




Prostate ca.* (bone 
met) PC-3 


18.7 


Colon ca. HCT-116 


40.6 


Prostate Pool 


3.3 


Colon ca. CaCo-2 


41.8 


Placenta 


16.8 


Colon cancer tissue 


19.6 


Uterus Pool 


2.6 


Colon ca. SW1116 


9.5 


Ovarian ca. 
OVCAR-3 


27.7 


Colon ca. Colo-205 


41.5 


Ovarian ca. SK- 
OV-3 


58.2 


Colon ca. SW-48 


6.8 


i >v u ian ca. 


4.0 


Colon Pool 


5.9 
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OVCAR-4 








Ovarian ca. 
OVCAR-5 


27.2 


Small Intestine Pool 


6.6 


Ovarian ca. 
IGROV-1 


30.1 


Stomach Pool 


42.9 


Ovarian ca. 
OVCAR-8 




Bone Marrow Pool 


2.7 


Ovary 


7.6 


Fetal Heart 


«> 0 


Breast ca. MCF-7 


37.1 


Heart Pool 


5.7 


Breast ca. MDA- 
MB-231 


15.3 


i^ympn JNode rool 


6.5 


Breast ca. BT 549 


51.1 


Fetal Skeletal Muscle 


6.6 


Breast ca. T47D 


100.0 


Skeletal Muscle Pool 


17.4 


Breast ca. MDA-N 


8.3 


Spleen Pool 


11.2 


Breast Pool 


41 8 


Thymus Pool 


8.0 


Trachea 


15.9 


CNS cancer 
(glio/astro) U87-MG 


46.3 


Lung 


2.6 


CNS cancer 
(glio/astro)U-118-MG 


20.7 


Fetal Lung 


55.5 


CNS cancer 
(neuro;met) SK-N-AS 


29.1 


Lung ca. NCI-N417 


43.5 


CNS cancer (astro) SF- 
539 


13.8 


Lung ca. LX-1 


? „ s CNS cancer (astro) 
jSNB-75 


36.9 


Lung ca. NCI-H146 


5.8 


CNS cancer (glio) 
SNB-19 


29.1 


Lung ca. bHJr- / / 


11.7 


CNS cancer (glio) SF- 
295 


73.7 


Lung ca. A549 


21.5 


Brain (Amygdala) Pool 


5.3 


Lung ca. NCI-H526 


15.7 


Brain (cerebellum) 


14.6 


Lung ca. NCI-H23 


12.9 


Brain (fetal) 


20.6 


T nno osi "MPT TJdfift 
JL.UIlg Cel. rN^l-llHOU 


49.3 


Brain (Hippocampus) 
Pool 


4.6 


Lung ca. HOP-62 


10.1 


Cerebral Cortex Pool 


4.9 


Lung ca. NCI-H522 


16.2 


Brain (Substantia 
nigra) Pool 


10.4 


Liver 


1.3 


Brain (Thalamus) Pool 


6.1 


Fetal Liver 


9.3 


Brain (whole) 


45.1 


Liver ca. HepG2 


29.7 


Spinal Cord Pool 


5.8 


Kidney Pool 


8.0 jAdrenal Gland 


13.6 


Fetal Kidney 


1 1 .3 (Pituitary gland Pool 


6.8 


Renal ca. 786-0 


23.0 .Salivary Gland 


4.6 


Renal ca. A498 


18.2 (Thyroid (female) 


13.9 
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Renal ca.ACHN | 13.6 


Pancreatic ca. r | 
CAPAN2 j " J 


Renal ca.UO-31 1 28.5 


Pancreas Pool 14.U 


Table BTD. Panel 4. ID 


Tissue Name 


Rel. Exp.(%) 1 

Ag3425,Run j Tissue Name 
169839020 1 


Rel. Exp.(%) 
Ag3425, Run 
169839020 


Secondary Phi act 


20.6 


HI'VLC IL-lbeta 


46.7 


Secondary Th2 act 


30.6 


HUVEC 1FN gamma j 


23.3 


[Secondary Trl act j 


JHUVEC TNF alpha + i 
llFN gamma 


13.3 1 


(Secondary Thl rest 


1IUVRC TNF alpha f- 
iS - 11.4 ! 


22.8 


j Sccond&ry Tli2 rest 


23.8 IIUVFC IL-1 1 


22.5 


Secondary Trl rest 


, ^ , I.ung Microvascular EC 1 
12.6 'none 


37.9 


(Primary Thl act 

L___ 


•Lun - \ln.n>\ascular EC 
lTNFalpha + IL-1 beta 


22.2 

— —■' - - 


Primary Th2 act 


Microvascular Dermal 
1 C none 


24.1 


Primary Trl act 


IMicrosvasular Dermal 
20.0 EC TNFa | pha _ [L-lbeta 


21.2 


Primary Thl rest 


(Bronchial epithelium 
18.6 iTNFalpha + ILlbeta 


26.6 


Primary Th2 rest 


19 5 jSmall airway epithelium 
inone 


23.5 


Primary Trl rest 


Small airway epithelium 
— 4 rNFalpha - IL-lbeta 


27.2 


C | > \ Cl)\ 

h mphc cyte ad 


1 7.7 ICoronery artery SMC rest 


22.1 


CD45RDCD4 
lymphocyte act 


ICoronery artery SMC 
U - h TNFalpha + IL-lbeta 


20.2 


CDS hmphocyte act 


14.1 Astroc\tes rest 




Secondary CD8 
lymphocyte rest 


1 Astrocytes TNFalpha + 
238 flL-lbeta 


1— — — 
17.8 


Secondary CD8 
lymphocyte act 


12.2 jKU-8 1 2 (Basophil) rest 


16.5 


CD4 lymphocyte none 


Kl -812 (Basophil) 
TMA/ionomycin 


36.6 


2ryThimi2mi anti- 
CD95 CHI 1 


I" - T7T" "CCD 11 06 

i(Keratinocytes) none 


j 40.1 


|LAK cells rest 


CCD1106 
i(Keratinocytes) 


41.8 
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TNI alpha IL Ibctd 




J \k '.ellsIL-2 


24.5 


Liver cirrhosis 


20.0 


LAK cells IL-2+IL-12 


21.6 ]NCI-H292 none 


41.2 


L \k cells II -2 [FN 
gamma 


23.2 


NCI-H292 IL-4 


58.6 


LAK cells IL-2+ IL-18 


22.2 


NCI-H292 IL-9 


61.6 


LAK cells 

PM A/io nomy cin 


97.3 


NCI-H292 IL-13 


47.6 


NK Cells IL-2 rest 


28.7 


NCI-H292 IFN gamma 


54.0 


Two Way MLR 3 day 


32.5 |HPAEC none 


16 (i 


Two Way MLR 5 day 

L_ 


3Q , 1 ll'AEC TNF alpha + IL- 
jl beta 


20.7 


Two Way MLR 7 day 


1 8.7 Lung fibroblast none 


46 0 


PBMC rest 


27.9 


Lung fibroblast TNF 
alpha + II -1 beta 


20.2 


PBMC PWM 


25.3 


Lunii ribiobla<;t IL-4 


47.0 


PBMC PHA-L 


22.7 


Lung fibroblast IL-9 


52.1 


Ramos < B i ell » none 




1 wvj !ibn>blcisl U -13 


"isT 


Ramos (B cell) 
ionomycin 


23 5 (Lung fibroblast IFN 
gamma 


50.0 


B lymphocytes PWM 


1 „ „ iDermal fibroblast 
CCD 1070 rest 


28.3 


B lymphocytes CD40L | Dermal fibroblast 
and IL-4 CCD 1 070 TNF alpha 


37.1 


crvf , a\/td 1 ->a a (Dermal fibroblast 
E01 <lbcAMP 344 1CCD1070IL-1 beta 


13.3 


EOL-1 dbcAMP 
PMA/ionomycin 


y IDermal fibroblast IFN 
gamma 


28.5 


Dendi it c cells none 


32.3 Dermal fibroblast 11-4 


26.2 


Dendrite cells LPS 


2 1 .5 'Dermal Fibroblasts rest 


26.6 


Dendritic cells anti- 
CD40 


32.S Neutrophils rNFa+LPS 


33.7 


Monocytes rest 


54.3 (Neutrophils rest 


100.0 


Monocj tes L PS 


93.3 


Colon 


18.4 


Macrophages rest 


23.8 


Lung 


28.7 


Macrophages LPS 




Thymus 


28.1 


HUVEC none 


24.7 


Kidney 


14.0 


HUVEC starved 


38.4 1 





CNS_neurodegeneration_vl.0 Summary: Ag3425 This panel confirms the expression of 
this gene at low levels in the brain in an independent group of individuals. However, no 
differen tial expression of this gene was detected between Alzheimer's diseased postmortem 
brains and those of non-demented controls in this experiment. Please see Panel 1 .4 for a 
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discussion of the potential utility of this gene in treatment of central nervous system 
disorders. 



General_screening_panel_vl.4 Summary: Ag3425 The CG59630-01 gene is a 
homologue of mouse midnoline (midbrain nucleolar protein). Its expression is moderate to 
5 high across all of the samples on this panel, with highest expression in a breast cancer cell 
line (CT=25.3). The widespread expression suggests that this gene may play an important 
role in cellular function. In mouse, the expression of this gene is developmentally 
regulated: it is strongly expressed at the mesencephalon (midbrain) of the embryo and is 
involved in regulation of genes related to neurogenesis in the nucleolus (Tsukahara et al., 
10 2000). Based on the gene's expression in all CNS regions examined, this gene may 
therefore play a role in central nervous system disorders such as Alzheimer's disease, 
Parkinson's disease, epilepsy, multiple sclerosis, schizophrenia and depression. 

Among tissues with metabolic function, this gene is expressed at moderate to low 
levels in pituitary, adipose, adrenal gland, pancreas, thyroid, and adult and fetal skeletal 
15 muscle, heart, and liver. This widespread expression among these tissues suggests that this 
gene product may play a role in normal neuroendocrine and metabolic and that disregulated 
expression of this gene may contribute to neuroendocrine disorders or metabolic diseases, 
such as obesity and diabetes. 

Reference: 

20 1 . Tsukahara M, Suemori H, Noguchi S, Ji ZS, Tsunoo H. (2000) Novel nucleolar 

protein, midnolin, is expressed in the mesencephalon during mouse development. Gene 
2000 Aug 22;254(l-2):45-55 

Panel 4.1D Summary: Ag3425 The CG59630-01 gene is a homologue of mouse 
midnoline (midbrain nucleolar protein). Its expression is moderate to high across all of the 

25 samples on this panel, with highest expression in resting neutrophils (CT=29. 1). In 

addition, this gene is expressed at high to moderate levels in a wide range of cell types of 
significance in the immune response in health and disease. These cells include members of 
the T-cell, B-cell, endothelial cell, macrophage/monocyte, and peripheral blood 
mononuclear cell family, as well as epithelial and fibroblast cell types from lung and skin, 

30 and normal tissues represented by colon, lung, thymus and kidney. This ubiquitous pattern 
of expression suggests that this gene product may be involved in homeostatic processes for 
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these and other cell types and tissues. This pattern is in agreement with the expression 
profile in General_screening_panel_vl.4 and also suggests a role for the gene product in 
cell survival and proliferation. Therefore, modulation of the gene product with a functional 
therapeutic may lead to the alteration of functions associated with these cell types and lead 
5 to improvement of the symptoms of patients suffering from autoimmune and inflammatory 
diseases such as asthma, allergies, inflammatory bowel disease, lupus erythematosus, 
psoriasis, rheumatoid arthritis, and osteoarthritis. 

BU. CG59561-01: CYTOSOLIC ACYL COENZYME A THIOESTER 
HYDROLASE 

10 Expression of gene CG59561-01 was assessed using the primer-probe set Ag3424, 

described in Table BUA. Results of the RTQ-PCR runs are shown in Tables BUB, BUC 
and BUD. 



Table BUA . Probe Name Ag3424 



r— 
jPrimers 


Sequences 


Length 


Start 
Position 


ISEQID 1 
NO: 


[Forward 


5 1 -aagctgaccaataaggccac-3 ' 


~ 20 ; 


345" 


620 


iProbe 


TET-5 ' -gtggacaaggtcctcgaagagcctc-3 ' - 
TAMRA 


25 


396 


621 


(Reverse 


5 1 -ctgccggaaatacacaacag-3 1 


20 


421 


622 | 



Table BUB . CNS neurodegeneration vl.O 



Tissue Name 


Rel. Exp.(%) Ag3424, 
Run 210350585 


Tissue Name 


Rel. Exp.(%) Ag3424, 
Run 210350585 


AD 1 Hippo 


8.8 


Control (Path) 3 
Temporal Ctx 


1.5 


AD 2 Hippo 




16.7 


ControKPath) 4 
Temporal Ctx 


25.5 


AD 3 Hippo 


2.7 


AD 1 Occipital 
i i\ 


3.7 


AD 4 Hippo 


4.7 


\l > 2 < >ccipital 
Ctx (Missing) 


0.0 


AD 5 hippo 


100.0 


AD 3 Occipital 
Ctx 


2.0 


AD 6 Hippo 


34.6 


AD 4 Occipital 
C l\ 


8.5 


Control 2 Hippo 


44.8 


AD 5 Occipital 
Ctx 


11.7 
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Control 4 Hippo 


2.2 |AD 6 Occipital j 9Q g 


Control (Path) 3 
Hippo 


2 o jControl 1 Occipital ; ^ ^ 


AD 1 Temporal Ctx 


iControl 2 Occipital i .. . . 
2A Jctx j 84J 


AD 2 Temporal Ctx 


16.6 


Control 3 Occipital! . . 

ctx ! ^ 


AD 3 Temporal Ctx 


2.1 


Control 4 Occipital 
Ctx 


1.3 


AD 4 Temporal Ctx 


8.6 


Control (Path) 1 
Occipital Ctx 


96.6 


AD 5 Inf Temporal 
Ctx 


62.9 


Control (Path) 2 
Occipital Ctx 


5.7 

- 


AD 5 SupTemporal 
Ctx 


Control (Path) 3 
Occipital Ctx 


0.5 


AD 6 Inf Temporal 

Ctx 


26.6 


Control (Path) 4 
Occipital Ctx 


8.8 


AD 6 Sup Temporal 
Ctx 


22.1 


Control 1 Parietal 
Ctx 


2.0 


Control 1 Temporal 

Ctx 


1.2 


Control 2 Parietal 
Ctx 


22.1 


Control 2 Temporal 
Ctx 


65.1 


Control 3 Parietal 
Ctx 


19.9 


Control 3 Temporal 

Ctx 


8.7 


Control (Path) 1 
Parietal Ctx 


94.0 


Control 4 Temporal 1 - . 

Ctx ZA 


Control (Path) 2 
Parietal Ctx 


14.0 


Control (Path) 1 fiJ 1 
Temporal Ctx j 


Control (Path) 3 
Parietal Ctx 


0.9 


Control (Path) 2 
Temporal Ctx 


28.3 


Control (Path ) 4 
Parietal Ctx 


39.0 



Table BUC . Panel 4D 



Tissue Name 


Rel. E.vp.(%) 

Ag3424, Run ) Tissue Name 
166385382 


Rel. Exp.(%) 
Ag3424, Run 
166385382 


^Secondary Thl act 


62.9 UUVEC lL-lbeta 


4.0 


Secondary Th2 act 


37.9 jHUVEC IFN gamma 


10.7 


Secondary Trl act 


68.8 


HUVEC TNF alpha + 
IFN gamma 


18.6 


Secondary Thl rest 


4.2 


HUVEC TNF alpha + 
IL4 


15.5 


Secondary Th2 rest 


12.1 


HUVEC IL- 11 


10.2 
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Secondary Trl rest 


4.4 


Lung Microvascular EC 7 
none 1 


Primary Thl act 


66.9 


Lung Microvascular EC 
TNFalpha + IL-lbeta 


25.7 


Primary Th2 act 


61.1 


Microvascular Dermal 
EC none 


36.3 


Primary Trl act 


43.2 


Microsvasular Dermal 
EC TNFalpha + IL-lbeta 


20.3 


Primary Thl rest 


30.1 


Bronchial epithelium 
TNFalpha + ILlbeta 


25.5 


Primary Th2 rest 


17.4 


Small airway epithelium i 
none 


19.5 


Primary Trl rest 


15.9 


Small airway epithelium 
TNFalpha + IL-lbeta ] 


52.1 


CD45RA CD4 
lymphocyte act 


14.8 


Coronery artery SMC rest 


14.8 


CD45RO CD4 
lymphocyte act 


37.6 


Coronery artery SMC 
TNFalpha + IL-1 beta 


8.3 


CD8 lymphocyte act 


43.8 


Astrocytes rest 


13.9 


Secondairy CD8 
lymphocyte rest 


49.7 


Astrocytes TNFalpha + 
IL-lbeta 


13.3 


Secondary CD8 
lymphocyte act 


24.1 


KU-812 (Basophil) rest 


23.8 


CD4 lymphocyte none 


Q9 |KU-8 12 (Basophil) 
jPMA/ionomyc i n 


51.4 


2ry Thl/Th2/Trl_anti- 
CD95 CHI 1 


9Q 1CCD1106 

(Keratinocyies) none 


62.4 


LAK cells rest 


CCD1 106 
33.0 j(Keratinocytes) 

I NI" alpha i IL-lbeta 


37.1 


[LAK cells IL-2 


27.7 Liver cirrhosis 


3.5 


L \K cells I! -2 i-IL-1 2 


25.5 


Lupus kidney 


1.0 


)LAK cells IL-2+IFN 
gamma 


35.6 


NCI-H292 none 


16.2 


LAK cells IL-2+ IL-18 


21.9 NCI-H292 IL-4 


23.5 ! 


LAK cells 
PMA/ionomycin 


3.6 JNCI-H292 IL-9 


29.5 


NK Cells IL-2 rest 


15.1 NCI-H292 IL-13 


13.4 


Two Way MLR 3 day 


9.4 NCI-H292 IFN gamma 


20 6 


rwo \Va\ MLR 5 da\ 


17.4 


HPAEC none 


17.4 


Two Way MLR 7 day 


13.1 


HPAEC TNF alpha + IL- 
1 beta 


20.9 


PBMC rest 


0.8 


I uii'j tsbi.iMast none 


37.9 


PBMC PWM 


88.9 

„ . 


Lung fibroblast TNF 
alpha + IL-1 beta 


34.9 
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PBMC 1*1-1 A-L 


51 5 L tin? iihi'-ibiast IL-4 ; 


69.7 


Ramos (B cell) none 


17.2 : Lung fibroblast IL-9 


49.7 


Ramos (B cell) 
ionomycin 


31.9 Lung fibroblast IL-1 3 


45.4 


B lymphocytes PWM 


y~ g l.ung fibroblast IFN 
igamma 


90.8 


B lymphocytes CD40L 
and IL-4 


9 9 Dermal fibroblast 
CCD 1070 rest 


54.3 


EOL-1 dbcAMP 


j ^ q Dermal fibroblast 

(CCD 1 070 TNF alpha 


84.1 


EOL-1 dbcAMP 
PMA/ionomycin 


o , iDermal fibroblast 
JCCD1070 IL-1 beta 


30.6 


Dendritic cells none 


,^ 2 JDermal fibroblast IFN 

: gam ma 


31.4 


Dendritic cells 1 PS 


31.9 Dermal fibroblast IL-4 


45.4 


Dendriti c cells anti- 
|CD40 


37.1 |lBD Colitis 2 


2.5 


Monocytes rest 


0 4 IBD Crohn's 


4.7 


iMonocyles LPS 


0.6 


Colon 


45.4 


Macrophages rest 


12.6 


Lung 


16.6 


Macrophages LPS 


5.8 


Thymus 


100.0 


HUVEC none 


16.0 Kidney 


20.6 


fir\E( hulI 


1 ^ 1 





Table BUD . Panel 5 Islet 



Tissue Name 


Rel. E\p.<%) 
Ag3424,Run 
242385366 


Tissue Name 

; 


Rel. Exp.(%) 
Ag3424, Run 
242385366 


97457_Patient- 
02go adipose 


8.2 


94709JDonor 2 AM - A_adipose 


6.7 


97476_ >atient- 
07sk_skeletal muscle 


3.9 


94710_Donor 2 AM - B adipose 


— — — 


97477_Patient- 
07ut_uterus 


7.4 




94711 Donor 2 AM -C adipose 




5.9 


97478_ 5 atient- 
07pl placenta 


7.9 19471 2_Donor 2 AD - A adipose 


8.3 


99167 Jayer Patient 
1 


4.8 |947 1 3_Donor 2 AD - B_adipose 


5.4 


97482 'alienl- 
08ut_uterus 


4. 1 |947 1 4_Donor 2 AD - C_adipose 


7.7 


97483_ >atient- 
08pl_placenta 


24 l94712_Donor3U- 

A Mess.ii'-h>mal Si cm iciL 


3.0 


Q7486_ >atient- 
09sk skeletal muscle 


0 0 94743_Donor 3 II - 

B Mesenchymal Stem Cells 


9.9 
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97487_Patient- 
09ul uterus 


5.3 


94730_Donor 3 AM - A adipose 


22.5 


97488_Patient- 
09pl_placenta 


,3 


94731_Donor3 AM - B_adipose i 


15.1 


97492_Patient- 
lOut uterus 


4.8 


94732_Donor 3 AM - C_adipose \ 


10.2 


97493_Patient- 
1 Op 1 placenta 


6.6 


94733_Donor 3 AD - A adipose 


29.7 


97495_Patient- 
llgoadipose 


0.9 


94734_Donor 3 AD - B adipose 


6.4 


97496_Patient- 

1 1 sk skeletal muscle 


0.3 


94735_Donor 3 AD - C_adipose 


34.9 


97497_Patient- 
llut uterus 


9.8 


77 1 3 8_Liver_HepG2untreated 


57.8 


97498_Patient- 
1 lpl_placenta 


3.0 


73556_Heart_Cardiac stromal 
cells (primary) 


10.3 


97500_Patient- 
12go_aclipose 




81735_Small Intestine 


23.7 


97501_Patient- 

1 2sk_skeletal mus< le 


1 4 


72409_Kidney_Proximal 
L\nnolured Tubule 


— — 

4 5 


97502_Patient- 
1 2ut uterus 


10.2 


82685_Small 
intestine Duodenum 


1.2 


97503_Patient- 
12pl_placenta 


3.2 


90650_Adrenal_Adrenocortical 
adenoma 


1.4 


94721_Donor2 U - 
AMesenchymal 
Stem Cells 


6.8 


72410_Kidney_HRCE 


100.0 


94722_Donor 2 1 I - 
BMesenchymal 
Stem Cells 


2.8 


72411_Kidney_HRE 


69.7 


94723_Donor2 1 1 - 
CMesenchymal 
Stem Cells 


5.1 


73139_Uterus_Uterine smooth 
muscle cells 


22.4 



CNS_neurodegeneration_vl.O Summary: Ag3424 This panel confirms the expression of 
the CG59561-01 gene at low levels in the brain in an independent group of individuals. 
However, no differential expression of this gene was detected between Alzheimer's 
diseased postmortem brains and those of non-demented controls in this experiment. This 
expression profile suggests that this gene may play a role in central nervous system 
disorders such as Alzheimer's disease, Parkinson's disease, epilepsy, multiple sclerosis, 
schizophrenia and depression. 
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General_screening_panel_vl.4 Summary: Ag3424 Results from one experiment with 
the CG59561-01 gene are not included. The amp plot indicates that there were 
experimental difficulties with this run. (Data not shown.) 

Panel 4D Summary: Ag3424 The CG59561-01 gene encodes a protein homologous to 
5 cytosolic acyl coenzyme A thioester hydrolase (Brain acyl-CoA hydrolase, BACH). 

Among the tissue samples used in this panel, highest expression of this gene is detected in 
thymus (CT=29.6). In addition, expression of this gene is stimulated in activated primary 
and secondary - Thl, Th2 and Trl cells. Therefore, this gene product may play an 
important role in T cell development. Thus, therapeutics designed with the protein encoded 
10 for by this transcript could be important in regulating T cell function and treating T cell 
y3 mediated diseases such as emphysema, asthma, arthritis, psoriasis, IBD, and systemic lupus 

JS erythematosus. 

0 Interestingly, expression of this gene is also seen in activated PBMCs (CTs=30) as 

compared to resting PBMCs (CT=36) suggesting a role for this gene product in B-cell and 
1 5 T-cell proliferation. Therefore, small molecules that antagonize the function of this gene 
~- product may be useful as therapeutic drugs to reduce or eliminate the symptoms in patients 

with autoimmune and inflammatory diseases in which B cells play a part in the initiation or 
progression of the disease process, such as systemic lupus erythematosus, Crohn's disease, 
ulcerative colitis, multiple sclerosis, chronic obstructive pulmonary disease, asthma, 
20 emphysema, rheumatoid arthritis, or psoriasis. 

Panel 5 Islet Summary: Ag3424 The CG59561-01 gene is expressed at low levels in 
adipose and placenta, with highest expression in the kidney (CT=30.8). As an enzyme 
involved in lipid homeostasis, therapeutic modulation of this gene product may be a 
treatment for obesity and obesity-related diseases, including Type 2 diabetes. 

25 BV. CG59452-01 : CELL PROLIFERATION RELATED PROTEIN CAP - 

Expression of gene CG59452-01 was assessed using the primer-probe set Ag3443, 
described in Table BVA. Results of the RTQ-PCR runs are shown in Tables BVB and 
BVC. 



Table BVA . Probe Name Ag3443 
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Primers 


Sequences 


Length 


Start 
Position 


! SEQID 
1 NO: 


1 Of l\an 1 '■> ' ni-^-)^ - gga tt a 3 ' 


22 


387 


623 


Probe 


TET-5 1 -catctacaacaagcctggagatgaca-3 1 - 

TAMRA 


26 


431 


1 624 


Reverse 


5 ' -tttccagagcttctgccatt-3 ' 


20 


464 


; 625 



Table BVB . CNS_neurodegeneration_vl.O 



Tissue Name 


Kel. iLxp.(yo) Ag3443, 
Run 210374885 


Tissue Name 


Rel. Exp.(%) Ag3443, 
Run 210374885 


AD 1 Hippo 


10.2 


Control (Path) 3 
Temporal Ctx 


6.2 


AD 2 Hippo 


28.3 


Control (Path) 4 
Temporal Ctx 


27.0 


AD 3 Hippo 


8.0 


AD 1 Occipital Ctx 


13.6 


AD 4 Hippo 


4.8 


AD 2 Occipital Ctx 
(Missing) 


0.0 


AD 5 Hippo 


X4.I 


AD 3 Occipital Ctx 


6.0 


AD 6 Hippo 


7L7 


AD 4 Occipital Ctx' 


" 17.9 


Control 2 Hippo 


31 .4 


AD 5 Occipital Ctx 


~ 54.3" """" 


Control 4 1 lippo 


(..1 


AD 6 Occipital Ctv 


' ' 22.7 


Control (Path) 3 
Hippo 


3 3 


Control 1 Occipital 
Ctx 


4 0 


AD 1 Temporal 




Control 2 Oct ipital 
Ctx 


73.7 


AD 2 Temporal 
Ctx 


33.4 


Control 3 Occipital 
Ctx 


13 ' 7 


AD 3 Temporal 
Ctx 


7.4 


Control 4 Occipital 
Ctx 


7.7 


AD 4 Temporal 
Ctx 


16.8 


Control (Path) 1 
Occipital Ctx 


100.0 


AD 5 II I Temporal 
Ctx 


74.7 


Control (Path) 2 
Occipital Ctx 


11.0 


AD 5 Sup 
Temporal Ctx 


36.1 


Control (Path) 3 
Occipital Ctx 


1.3 


AD 6 Inf Temporal 
Ctx 


79.0 


( in i . <l I'.iihi 1 

Occipital Ctx 


15.3 


AD 6 Sup 
Temporal Ctx 


85.9 


Control 1 Parietal 
Ctx 


6.1 


Control 1 
Temporal Ctx 


8.6 


Control 2 Parietal 
Ctx 


35.8 


Control 2 
Temporal Ctx 


48.3 


Control 3 Parietal 
Ctx 


12.9 


Control 3 


114 


Control (Path) 1 


59.5 
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temporal Ctx 




Parietal Ctx 




[Control 3 

1 smporal Ctx | 


6.5 


Control (Path) 2 
Parietal Ctx 


25.5 


JControl (Path) 1 j 
Temporal Ctx | 


93.3 


Control (Path) 3 
Parietal Ctx 


2.4 


Control (Path) 2 
(Temporal Ctx [ 


23.8 


Control (Path) 4 
Parietal Ctx 


23.2 



Table B V C. Panel 4D 



Tissue Name 


Rel. Exp.(%) 

Ag3443, Run j Tissue Name 
166397102 


Rel. Exp.(%) 
Ag3443, Run 
166397102 


Secondary Till act 


22.7 HUVEC IL-lbeta 


is.: 


Second; ry Th2 act 


25.7 JHUVEC IFN gamma j 18.6 


(Secondary Trl act 


3? |HUVEC TNF alpha + 
jlFN gamma 


16.6 


Secondary Thl rest 


HL"VE< rNl alpha + 

JIL4 


17.0 


iSecondary Th2 rest 


11.2 Til X I C 11-11 


9.3 


[Secondary Trl rest 


. j j 5 (Lung Microvascular EC 
mone 


14.2 


Primary Thl act 


1 , , Lung Microvascular EC 
I \ V alpha + IL-lbeta 


12.9 


Primary Th2 act 


_ Q _ Microvascular Dermal 
|EC none 


17.7 


Primary Trl act 


1 jMicrosvasular Dermal 

EC rNFalpha II -lbeta 


17.1 


Primary Thl rest 


_ I'.i, ik 111 1 . .mi In nam 

TNFalpha + IL lbeta 


6.8 


Primary Th2 rest 


4 q g Small airway epithelium 
none 


8.5 


Primary Trl rest 


, : Small airway epithelium 
TNFalpha + IL-lbeta 


40.9 


CD45RA CD4 
lymphocyte act 


15.6 


Coronery artery SMC rest 


9.4 


CD45RO CD4 
lymphocyte act 


30.4 


Coronery artery SMC 
TNFalpha + IL-lbeta 


8.5 


CDX lynipho<-\te ad 


19.3 


A'-iR»',>tes rest 


16.3 


(Secondary CD8 
lymphocyte rest 


27.7 


Astrocytes TNFalpha + 
IL-lbeta 


30.4 


Secondary CD8 
hinpliocyte act 


16.3 |kU-8 12 (Basophil) rest j 26.8 


!cD4 lymphocyte none j ,2.4 |™; 8 ! 2 < BaS0 ? hil > | 80.7 
\ I IM \ ionom\cin 
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2ry Thl/Th2/Trl_anti- ! 


1?3 1CCD1106 

j(Keratinocytes) none 


13.6 


LAK cells rest 


ICCD1106 
10.7 J(Keratinocytes) 

(TNFalpha 4- IL-lbeta 


100.0 


LAK cells IL-2 


30.4 iLiver cirrhosis 


10.4 


LAK cells IL-2+IL- 12 ! 


24.5 Lupus kidney 


16.5 1 


LAK cells IL-2+IFN 
gamma 


34.2 


NCI-H292 none 


22.4 


1 AK cells 11. -2+ 1 1 - 1 S 


20.9 


NCI-H292 IL-4 


36.9 


LAK cells 
PMA/ionomycin 


13.3 


NCI-H292 IL-9 


27.5 


NK Cells IL-2 rest 


15.6 NCI-H292 1L-13 


15.9 


Two Way MLR 3 day 


19. 5 NCI-H292 IFN gamma 


15.9 


Two Way MLR 5 day 


16.2 


IIP \l C none 


9.3 


Two Way MLR 7 day 


15.1 


HPAEC I NI alpha - II - 
1 beta 


18.9 


PBMC rest 


13 1 


Lung fibroblast none 


20.4 


PBMC PWM 


26.6 


Lung fibroblast TNF 
alpha + IL-1 beta 


20.9 


PBMC PHA-L 


9.6 


Lung fibroblast IL-4 


19.5 


Ramos (B cell) none 


39.2 Lung fibroblast IL-9 12.2 


Ramos (B cell) 
, "in vein 


37.9 Tung fibroblast TL- 13 


10.5 


B lymphocytes PWM 


31.4 


Lung fibroblast IFN 
gamma 


29.1 


B lymphocytes CD40I 
and IL-4 


32.3 


Dermal fibroblast 
CCD1070 rest 


26.8 


EOL-1 dbcAMP 


18.7 


Dermal fibroblast 
CCD1070 TNFalpha 


46.3 


|EOL-l dbcAMP 
|P^MA/io nomy cin 


— 

45.7 


Dermal fibroblast 
CCD1070IL-1 beta 


16.0 


jDendritic cells none 


12.8 


Dermal fibroblast IFN 
gamma 


7.5 


Dendritic cells 1 PS 


9.6 


Dermal fibr^hkiit 11- 1 


17.1 


Dendritic cells anti- 
JCD40 


14.6 


IBD Colitis 2 


4.2 


Mi mi >c\ tes rest 


20.6 


IBD Crohn's 


2}) 


Monocj'tes LPS 


20.9 jColon 


49.0 


Macrophages rest 


15.8 |Lung 


1 1.1 


Macrophages LPS 


12.9 (Thymus 


20.2 


III VEC none 


:l k.dnc 


33.2 


|HUVEC starved 


33 - 7 
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CNS_neurodegeneration_vl.O Summary: Ag3443 This panel confirms the expression of 
the CG59452-01 gene at significant levels in the brain in an independent group of 
individuals. However, no differential expression of this gene was detected between 
Alzheimer's diseased postmortem brains and those of non-demented controls in this 
5 experiment. Expression of this gene in the brain suggests that it may play a role in central 
nervous system disorders other than Alzheimer's disease, such as Parkinson's disease, 
epilepsy, multiple sclerosis, schizophrenia and depression. 

Generall_screening_panel_vl.4 Summary: Ag3443 The amp plot indicates that there 
were experimental difficulties with this run. (Data not shown). 

10 Panel 4D Summary: Ag3443 Highest expression of the CG59452-01 gene is detected in 
TNFalpha + IL-lbeta treated keratin ocytes and PMA/ionomycin treated KU-812 basophil 
cells (CTs=24.5). Thus, antibody or small molecule therapies designed with the protein 
encoded for by this gene could block or inhibit inflammation or tissue damage due to 
basophil activation in response to asthma, allergies, hypersensitivity reactions, psoriasis, 

1 5 and viral infections. 

BW. CG59572-01 and CG59572-02: Pseudouridine Synthase 3 

Expression of gene CG59572-01 and CG59572-02 was assessed using the primer- 
probe set Ag3476, described in Table BWA. Results of the RTQ-PCR runs are shown in 
Tables BWB, BWC and BWD. Please note that CG59572-02 represents a full-length 
20 physical clone of the CG59572-01 gene, validating the prediction of the gene sequence. 



Table BWA . Probe Name Ag3476 



Primers 


Sequences 


iLength 


Start 
Position 


SEQED 
NO: 


Forward 


5 1 -acctacaacaactgtgggctaa-3 ' 




1070 


626 


Probe 


TET-5 ' -tcatgctgtcaaaactcacatgttgt-3 ' - 
TAMRA 


1 26 


1092 


627 


Reverse 


5 ' -ggaacagtgtccagtccttgta-3 ' 




1127 


628 



Table BWB . CNS_neurodegeneration_vl.0 



Tissue Name 


Rel. Exp.(%) Ag3476, 
Run 210377171 


Tissue Name 


Rel. Exp.(%) Ag3476, 
Run 210377171 


AD 1 Hippo 


16.7 


Control (Path) 3 


4.5 
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Temporal Ctx 




at) 9 u; nnn 


23 8 


Control (Path) 4 
Temporal Ctx 


31.6 


AD 3 Hippo 




AD 1 Occipital 
Ctx 


28.5 


AD 4 Hippo 


8.5 


AD 2 Occipital 
Ctx (Missing) 


0.0 


AD 5 hippo 


88.3 


AD 3 Occipital 
Ctx 


7.9 


AD O riippo 


77.4 


AD 4 Occipital 
Ctx 


20.6 


Control 2 Hippo 


37.4 


AD 5 Occipital 

Ctx 


35.4 


IControl 4 Hippo 


9.6 


AD 6 Occipital 
Ctx 


54.7 


Control (Path) 3 
Hippo 


8.4 


Control 1 Occipital 
Ctx 


4.7 


AD 1 Temporal Ctx 


18.6 


Control 2 Occipital 
Ctx 


67.8 


AD 2 Temporal Ctx 


32.3 


Control 3 Occipital 
Ctx 


15.6 


AD 3 Temporal Ctx 


8.0 


Control 4 Occipital 
Ctx 


5.6 


jAD 4 Temporal Ctx 


21.6 


Control (Path) 1 
Occipital Ctx 


100.0 


AD 5 Inf Temporal 
Ctx 


84.1 


Control (Path) 2 
Occipital Ctx 


7.8 


AD 5 SupTemporal 
Ctx 


41.5 


Control (Path) 3 
Occipital Ctx 


5.9 


|AD 6 Inf Temporal 

[Ctx 


77.4 


Control (Path) 4 
Occipital Ctx 


18.0 


LAD 6 Sup Temporal 
Ci\ 


^ 


Control 1 Parietal 
Ctx 


8.1 


Control 1 Temporal 
Ctx 




Control 2 Parietal 
Ctx 


47.0 

-1 


Control 2 Temporal 
Ctx 


40 1 


Control 3 Parietal 
Ctx 


18.2 


i Control 3 Temporal 
Ctx 


22.1 


Control (Path) 1 
Parietal Ctx 


71.7 


Control 4 Temporal 
( i \ 


5.1 


Control (Path) 2 
Parietal Ctx 


20.6 


Control (Path) 1 
Temporal Ctx 


55.1 


Control (Path) 3 
Parietal Ctx 


5.3 


Control (Path) 2 
Temporal Ctx 


31.6 


Control (Path) 4 
Parietal Ctx 


46.3 
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Table BWC . General_screening_panel_vl.4 



Tissue Name 


Rel. Exp.(%) 
Ag3476, Run 
217119004 


Tissue Name 


Rel. Exp.(%) 
Ag3476, Run 
217119004 


Adipose 


5.7 


Renal ca. TK- 10 


13.9 


Melanoma* 
Hs688(A).T 


13.4 


r» ladder 




Melanoma* 
Hs688(B).T 


14.6 


Gastric ca. (liver met.) 
NCI-N87 


42.9 


Melanoma* M14 


18.6 


Gastric ca. k A TO III 


92.0 


Melanoma* 
LOXIMVI 


38.7 


Colon ca. SW-948 


7.3 


Melanoma* SK- 
MEL-5 


27.0 


Colon ca. SW480 


40.9 


Squamous cell 
carcinoma SCC-4 


9.9 


Colon ca.* (SW480 
met) SW620 


27.5 


1 CM!'. I'.'ol 


5.9 


Colon ca. HT29 


24.0 


Prostate ca.* (bone 
met) PC-3 


34.4 


Colon ca. HCT-1 16 


32.5 


(Prostate Pool 


5.6 


Colon ca. CaCo-2 


52.1 


Placenta 


II 


Colon cancer tissue 


10.4 


1 terns Pool 


3.8 


Colon ca. SW1116 


3.9 


Ovarian ca. 
OVCAR-3 


15.4 


Colon ca. Colo-205 


6.1 


lOvarian ca. SK- 
OV " 3 




v_oion ca. o w -ho 


90 

' ™ J 


lOvarian ca. 
OVC KR-4 


14.5 


Colon Pool 


13.6 


Ovarian ca. 
IOVCAR-5 


44.8 


Small Intestine Pool 




Ovarian ca. 

K il\t >\ - 1 


18.4 


Stomach Pool 


7.0 

i 


jOvarian ca. 
OVCAK-8 


10.3 


Bone Marrow Pool 


4.0 


Ovary 


7.0 


Fetal Heart 


6.3 | 


Breast ca. MCF-7 


16.8 


1 leart Pool 


5.4 


iBreast ca. MDA- 
lMB-231 


31.0 


Lymph Node Pool 


14.5 


Breast ca. BT 540 


51.1 


Fetal Skeletal Muscle 


4.2 1 


Breast ui. T4 7D 


100.0 


Skeletal Muscle Pool 


10.7 | 


[Breast ca. MDA-N 


28.7 


Noletn Pool 


5.4 


Breast Pool 


11.0 


Thymus Pool 


| n 1 


iTrachea 


6.2 


CNS cancer 


11.9 
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(glio/astro) U87-MG j 


Lung 


2.0 


CNS cancer 
(glio/astro) U-118-MG 


44.1 


— 

Fetal Lung 


15.5 


CNS cancer 
(neuro;met) SK-N-AS 


29.9 


Lung ca. NCI-N417 


14.5 


CNS cancer (astro) SF- 
539 


13.6 


— 

Lung ca. LX-1 


31.0 


CNS cancer (astro) 
SNB-75 


43.8 


Lung ca. NCI-H146 




- - 

9.3 


CNS cancer (glio) 
SNB-19 


16.0 


Lung ca. SHP-77 


, 1CNS cancer (glio) SF- 

|295 


44.8 

______ 


Lung td. r\J^ry 


14.4 (Brain (Amygdala) Pool 


L__ . 


Lung ca. NCI-H526 


21.6 


Brain (cerebellum) 


. 


Idling Cd. lNol-rlZJ 


23.0 


Brain (fetal) 


7.6 


Lung ca. NCI-H460 


13.9 


Brain (Hippocampus) 
Pool " 


4.5 


1 ,ung ca. I K >P-62 


14.9 jCerebral Cortex Pool 


f> > 


Lung ca. NCI-H522 




nigra) Pool 




Liver 


1.9 jBiam i 1 i il mn : P . 1 


6.9 


Fetal Liver 


1 7.2 (Brain (whole) 


3.8 " " 


Liver ca. HepG2 


18.4 jSpinal Cord Pool 


_ 3.6 


Kicinej Pool 


12.9 (Adrenal Gland 


s 


Fetal Kidney 


15.3 


Pllt ltjl\ gL T.d PC < 1 


"3.2 ~ " 


Renal ea 786-0 


1 v 1 


Salivary Gland 


IS 


Renal ca. A498 


7.1 


Thyroid (female) 


4.4 


Renal ca. ACHN 


10.4 


Pancreatic ca. 
CAPAN2 


27.7 


Renal ca. LO-31 


22.1 


Pancreas Pool 


14.7 



Table BWD . Panel 4D 



Tissue Name 


Rel. Exp.(%) 
Ag3476, Run 
166420471 


Tissue Name 

j 


Rel. Exp.(%) 
Ag3476, Run 
166420471 


Secondary Thl act 


31.4 


JHUVEC IL-lbeta 


27.5 


Secondary Th2 act 


30.6 


fHUVEC IFN gamma 


23.7 


Secondary Trl act 


40.3 


jHUVEC TNF alpha + 
jIFN gamma 


15.4 


Secondary Thl rest 


11.6 


iHUVEC TNF alpha + 
111 


16.2 


Secondary Th2 rest 


9.3 


jHUVEC IL-1 1 


13.4 
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Secondary Trl rest 


9.2 


Lung Microvascular EC 
none 


12.3 


Primary Thl act 


23.5 


Lung Microvascular EC 
TNFalpha + IL-lbeta 


11.0 


Primary Th2 act 


^ g {Microvascular Dermal 
iEC none 


20.3 


Primary Trl act 


s , , jMicrosvasular Dermal 

EC TNFalpha + IL-lbeta 


13.6 j 


Primary Thl rest 


. „ , Bronchial epithelium 
fNFalpha - ILlbeta 


7.8 


Primary Th2 rest 


24.7 


Small airway epithelium 
none 


9.5 


Primary Trl rest 


22.8 


Small airway epithelium 
TNFalpha + IL-1 beta 


48.0 


CD45RA CD4 
lymphocyte act 


15.9 


Coronery artery SMC rest 


14.1 


CD45RO CD4 
lymphocyte act 


. jCoronery artery SMC 
(TNFalpha + IL-lbeta 


9.8 

j 


CD8 lymphocyte act 


26.2 {Astrocytes rest 


13.9 


Secondary CDS 
lymphocyte rest 


46.3 


Astrocytes TNFalpha + 
IL-lbeta 


14.2 


Secondary CD8 
lymphocyte act 


23.0 


KU-8 12 (Basophil) rest 


29.5 


CD4 lymphocyte none 


63 KU-8 12 (Basophil) 
hVlA/ionomycin 


55.1 


2ry Thl/Th2/Trl_anti- 
Cl >95 CH11 


,_ 8 CCD 1106 

j(Keratinocytes) none 


18.7 


LAK cells rest 


|CCD1106 
8.4 j(Keratinocytes) 

rNFalpha - IL-lbeta 


100.0 




29.1 Liver cirrhosis 


12.8 


il^/Vrs. CeilS 1L.-Z+1JL.-1Z 


35.8 Lupus kidney 


7.9 


It w tt 9+tfm 

U/a rv Cells xJu-ZTir' in 
Igamma 


42.6 .NCI-H292 none 


44.4 


1 Ak' <-r>llQ II .">+ 11.18 
L.v/ V IV CCI1S 1 1-J ^ ' tLj 1 O 


28.7 NCI-H292 IL-4 


52 5 


I, a ts. ceils 
[PTVlA/ionomycin 


14.0 |NCI-H292 IL-9 


55.5 


NK C l'I s II -2 rest 


19.9 \( I-H292 II -1 5 


29. 3 ' 


Two Way MLR 3 day 


23.0 (NCI-H292 IFN gamma 


30.1 


Two Way MLR 5 day 


31.0 iHPAECnone 


10.7 


Two Way MLR 7 day 


. o A 5HPAEC TNF alpha + IL- 

I o.U . , , 

Jl beta 


13.1 


PBMC rest ZIIZZ 


4.8 sLung fibroblast none 


23.3 


PBMC PWM 


LimL fibroblast TNF 
[alpha + IL-1 beta 


16.0 
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FhsivlL rHA-L 


11.4 


Lung lihrobl;tM it -4 


27.7 


Ramos (B cell) none 


18.2 


Lung fibroblast 11 .-9 


22.5 


Ramos (B cell) 
ionomycin 


16.2 


Lung fibroblast IL- 13 


19.6 


B lymphocytes PWM 


56.3 


Lung fibroblast IFN 
gamma 


48.3 


B lymphocytes CD40L 
and IL-4 


36.9 


Dermal fibroblast 
CCD 1070 rest 


28.5 


EOL-1 dbcAMP 


23.5 


Dermal fibroblast 
CCD! 070 TNF alpha 


38.7 


EOL-1 dbcAMP 


20.0 


Dermal fibroblast 


11.8 


Dendritic cells none 


10.1 


Dermal fibroblast IFN 
gamma 


9.9 


Dendritic cells LPS 


9.7 


Dermal fibroblast IL-4 


15.7 


Dendritic cells anti- 
CD40 


10.2 


IBD Colitis 2 


6.6 


Monocytes rest 


12.0 


IBD Crohn's 


62 . 1 


\1onoc\tes LPS 


1" . 


Colon 


-<> 0 


Macrophages rest 


13.3 


Lung 


12.0 


Macrophages LPS 


7.4 iThymus 


21.8 


HUVEC none 


24.3 


Kidney 


1 X 3 


HUVEC starv ed 


33.4 







CNS_neurodegeneration_vl.O Summary: Ag3476 This panel confirms the expression of 
the CG5 95 72-01 gene at low levels in the brain in an independent group of individuals. 
However, no differential expression of this gene was detected between Alzheimer's 
diseased! postmortem brains and those of non-demented controls in this experiment. Please 
5 see Panel 1.4 for a discussion of the potential utility of this gene in treatment of central 
nervous system disorders. 

General_screening_panel_vl.4 Summary: Ag3476 Highest expression of the CG59572- 
01 gene is detected in a breast cancer cell line sample (CT=27.4). Furthermore, moderate to 
high expression of this gene is detected in CNS cancer, colon cancer, gastric cancer, 
10 pancreatic cancer, lung cancer, ovarian cancer, and prostate cancer. Therefore, therapeutic 
modulation of the activity of this gene or its protein product, through the use of small 
molecule drugs, protein therapeutics or antibodies, might be beneficial in the treatment of 
these cancers. 
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This gene is expressed at low to moderate levels throughout the CNS, including in 
amygdala, substantia nigra, thalamus, cerebellum, cerebral cortex, and spinal cord. 
Therefore, this gene may play a role in central nervous system disorders such as 
Alzheimer's disease, Parkinson's disease, epilepsy, multiple sclerosis, schizophrenia and 
5 depression. 

Among tissues with metabolic function, this gene is expressed at moderate to low 
levels in pituitary, adipose, adrenal gland, pancreas, thyroid, and adult and fetal skeletal 
muscle, heart, and liver. This widespread expression among these tissues suggests that this 
gene product may play a role in normal neuroendocrine and metabolic and that disregulated 
1 0 expression of this gene may contribute to neuroendocrine disorders or metabolic diseases, 
such as obesity and diabetes. 

In addition, this gene is expressed at much higher levels in fetal lung and liver 
tissue (CTs=30) when compared to expression in the adult counterpart (CTs=33). Thus, 
expression of this gene may be used to differentiate between the fetal and adult source of 
15 these tissues. 

Panel 4D Summary: Ag3476 Highest expression of the CG59572-01 gene is detected in 
TNFalpha + IL-lbeta treated keratinocytes (CT=27.2). Expression of this gene appears to 
be stimulated in activated secondary Thl, Th2 and Trl cells, PWM treated PBMCs, PWM 
treated B-lymphocytes, IL-2/IL-2+IL-12/IL-2+IFN gamma/IL-2+IL-18 treated LAK cells, 

20 and TNFalpha + IL-lbeta treated small airway epithelium (CTs=28-30). Thus, this gene 
may be important in the activation of T and B cells or the function of activated T and B 
cells. Therefore, small molecules that antagonize the function of this gene product may 
reduce or eliminate the symptoms in patients with autoimmune and inflammatory diseases 
in which B and T cells play a part in the initiation or progression of the disease process, 

25 such as systemic lupus erythematosus, Crohn's disease, ulcerative colitis, multiple 

sclerosis, chronic obstructive pulmonary disease, asthma, emphysema, rheumatoid arthritis, 
or psoriasis. 

BX. CG59522-01: Myosin I 

Expression of gene CG59522-01 was assessed using the primer-probe set Ag3456, 
30 described in Table BXA. Results of the RTQ-PCR runs are shown in Table BXB. 
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Table BXA . Probe Name Ag3456 



Primers 


Sequences 


Length 


Start 
Position 


3 SEQID 

1 NO: 


Forward 


5 ' -atgaactgcacttggagagaaa-3 ' 


22 


664 


j 629 


Probe 


TET-5 ' -aatttcacacaccagggagcaggact-3 ' - 
TAMRA 


26 


699 


1 630 


Reverse 


5 1 -ctctgctcatcactcacagtca-3 1 


22 


730 


J 631 



Table BXB . Panel 4D 



Tissue Name 


Rel. Exp.(%) 
Ag3456, Run 
166397214 


Tissue Name 


Rel. Ei p. i '•'..» 
Ag3456, Run 
166397214 


Secondary Thl ik 1 


20.2 


HUVEC II -lbeta 


0.0 


Secondary Th2 act 


19.6 


HUVEC I FN gamma 


0.0 


Secondary Trl act 


35.4 


HUVEC TNF alpha + 
IFN gamma 


0.0 


Secondary Thl rest 


34.2 


HUVEC TNF alpha + 
IL4 


0.0 


Secondan f f 2 rest 


17.3 


HUVEC IL-1 1 


0.0 


Secondary Trl rest 


20.7 


Lung Microvascular EC 
none 


0.0 


Primary Thl act 


10.6 


Lung Microvascular EC 
INI alpha r [L- lbeta 


0.0 


Primary Th2 act 


13 3 


Microvascular Dermal 
EC none 




Primary Trl act 


25.2 


Microsvasular Dermal 
EC TNFalpha I- IL-1 beta 


0.0 


Primary Thl rest 


100.0 


Bronchial epithelium 
TNFalpha + IL lbeta 


0.0 


Primary Th2 rest 


35.1 


Small airway epithelium 
none 


0.0 


Primary Trl rest 


25.9 


Small airway epithelium 
TNFalpha i IL-1 beta 


0.0 


CD45RA CD4 
lymphocyte act 


9.9 


Coronery artery SMC rest 


0.0 

U - ; 


CD45RO CD4 
lymphocyte act 


28.9 


Coronery artery SMC 
TNFalpha IL-1 beta 


0.0 


CD8 lymphocyte act 


27.5 


Astrocytes rest 


0.0 


Secondary CD8 
lymphocyte rest 


15.6 


Astrocytes TNFalpha + 
IL-1 beta 


0.0 


Secondary CD8 
lymphocyte act 


26.6 


KU-812 (Basophil) rest 


2.2 


CD4 lymphocyte none 


- — 


KU-812 (Basophil) 


5.2 
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; PMA/ionomycin \ 


2ry (hi rii2/'l'rl_anli- 
CJL>yj LH 1 1 


312 CCD 1106 

(Keratinocytes) none 


0.0 


LAK cells rest 


CCD 1106 
4.7 (Keratinocytes) 

jlNFalpha + IL-lbeta 


1.0 


LAK cells IL-2 


41.5 Liver cirrhosis 


0.8 


LAK cells IL-2+IL-12 


22.4 iLupus kidney 




LAK cells IL-2+1FN 
gamma 


29.5 |NCI-H292 none 


0.3 


LAK cells IL-2+ IL-18 


25.7 


NCI-H292 IL-4 




LAK cells 

PMA/ionomycin 


4.4 


NCI-H292 IL-9 


0.0 


NK Cells IL-2 rest 


22.2 


NCI-H292 IL-1 3 


0.0 


Two Way MLR 3 day 


14.2 " 
16.0 


NCI-H292 IFN gamma 


0 0 




HPAEC none 


bTo 


Two Way MLR 7 day 


17 , IHPAEC TNF alpha + IL- 
... 1 1 beta 


0.0 


FBMC re. 


7.5 


Lung fibroblast none 


~ To 


PBMC PWM 


33.9 


Lung fibroblast TNF 
alpha + IL-1 beta 


0.0 


PBMl PHA-L 


12.7 


Luiij: fibniblust fl -4 


0.0 


Ka v is ( B lc'1) none 


4" 1 ung fibroblast 11 


0.0 


Ramos (B cell) 
ionomycin 


8.5 jLung fibroblast IL- 13 




B lymphocytes PWM 


. 1 ung fibroblast IFN 

igamma 


— — 

0.0 


B lymphocytes CD40L 
and IL-4 


, o o iDermal fibroblast 
JCCD1 070 rest 


0.0 


EOL-1 dbcAMP 


, _ t IDermal fibroblast 

CCD 1070 TNF alpha 


43.5 


EOL-1 dbcAMP 
PMA, ionomycin 


q „ IDermal fibroblast 

CCD1070 IL-1 beta 


0.0 


Dendritic cells none 


_ , Den al 11 r< blast 11 N 

Z.l i 

igamma 


o.o 


Dendritic cells LPS 


16.4 


Dermal fibroblast IL-4 


0.0 


Dendritic cells anti- 
CD40 


1.6 


IBD Colitis 2 


0.3 


Monocytes rest 


20.9 


IBD Crohn's 


„ P.i 


Monocytes LPS 


45.7 


Colon 


4.2 


Macrop lages rest 


1.3 


Lung 


f 2.5""" 


Mau<>p luc 1 P 1 - 


15.5 Ih\mus 


0.0 


ill \ 1 « II..IK- 


U.O Kidney 


11.3 


HUVEC starved 


0.0 
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CNSneurodegenerationvl.O Summary: Ag3456 Expression of CG59522-01 gene is 
iow/undetectable (CTs > 35) across all of the samples on this panel (data not shown). 

General_screening_panel_vl.4 Summary: Ag3456 Expression of CG59522-01 gene is 
low/undetectable (CTs > 35) across all of the samples on this panel (data not shown). 

5 Panel 4B Summary: Ag3456 Highest expression of the CG59522-01 gene is detected in 
sample derived from resting primary Thl cells (CT=29.8). Thus, expression of this gene 
can be used to distinguish this sample from other samples in this panel. This gene is also 
expressed at low but significant levels in T cells prepared under a number of conditions, 
LAK cells, macrophages and dendritic cells also express the transcript. The only non- 
10 hematopoietic cell type that expresses the transcript detected by this primer and probe at 
significant levels is dermal fibroblasts. Colon and kidney also express low levels of the 
transcript. Thus, this transcript or the protein it encodes could be used to detect 
hematopoietically-derived cells. Furthermore, therapeutics designed with the protein 
encoded by this transcript could be important in the regulation the function of antigen 
1 5 presenting cells (macrophages and dendritic cells)or T cells and be important in the 
treatment of asthma, emphysema, psoriasis, arthritis, and IBD. Therefore, therapeutics 
designed with the protein encoded for by this transcript could be important in regulating T 
cell function and treating T and B cell mediated diseases such as asthma, arthritis, 
psoriasis, IBD, and systemic lupus erythematosus. 

20 BY. CG59520-01 : FARNESYL PYROPHOSPHATE SYNTHETASE 

Expression of gene CG59520-01 was assessed using the primer-probe set Ag5923, 
described in Table BYA. Results of the RTQ-PCR runs are shown in Tables BYB and 
BYC. 

Table B YA. Probe Name Ag5923 



Primers 


Sequences 


Length 


Start 
Position 


SEQ n> 
NO: 


Forward 
Probe 


5 ' -gaatgggaaaccagaaatcag-3 ' 

TET-5 1 -tttatgcccaagcaaagcaggatttc-3 ' - 

TAMRA 


21 

26 


29 


632 
633 


Reverse 


5 ' -accctaacgatctgggagtagt-3 ' 


22 


62 


634 



25 Table BYB . General_screening_panel_vl.5 
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Tissue Name 


ReL Exp.(%) 
Ag5923, Run 
247608956 


— 

Tissue Name 


ReL Exp.(%) 
Ag5923, Run 
247608956 


Adipose 


0.6 


Renal ca. TK-10 


12.1 


Melanoma* 
rlSDoo(Aj.i 


1.9 


Bladder 


7.1 


Melanoma* 
HSD56(i5). i 


2.9 


Gastric ca. (liver met.) 
NCI-N87 


26.2 


Melanoma* M14 


8.9 


Gastric ca. KATO III 


31.0 


Melanoma* 
LOXIMVI 


3.9 


Colon ca. SW-948 


6.8 


Melanoma* SK- 
MEL-5 


2.4 


Colon ca. SW480 


15.3 


Scniamous cell 
carcinoma SCC-4 


1.8 


f n | nn * /"CWARO 
Luion Cd. ^SWtoU 

met) SW620 


8.2 






^oion ca. rxizy 




Prostate ca.* (bone 
met) PC-3 


5.6 


Colon ca. HCT-116 


9.3 


Prostate Pool 


2.3 


Colon ca. CaCo-2 


6.4 


Placenta 




Colon cancer tissue 


7.3 


Uterus Pool 


0.0 


Colon ca. SW1116 


_ _.. - 7A , 


Ovarian ca. 
OVCAR-3 


6.7 


Colon ca. Colo-205 


9.5 


Ovarian ca. SK- 
OV-3 


14.8 


Colon ca. SW-48 


3.8 


Ovarian ca. 
OVCAR-4 


4.2 


Colon Pool 


5.6 


Ovarian ca. 
OVCAR-5 


23.0 


Small Intestine Pool 


7.0 


Ovarian ca. 
T(TROV-1 

1 VJ I V W V I 


4.1 


Stomach Pool 


0.6 


Ovarian ca 
OVCAR-8 


0.7 


Bone Marrow Pool 


2.1 


Ovary 


12.1 


Fetal Heart 




4.1 


Breast ca. MCF-7 


4.4 


Heart Pool 


1.1 


Breast ca ^NIDA" 
MB-231 


6.8 


Lymph Node Pool 


7.3 


Breast ca. BT 549 


17.8 


Fetal Skeletal ^Nluscle 


1.0 


Breast ca 1 PI ) 


5-6 


Skeletal Muscle Pool 


5.3 


Breast ca. MDA-N 


2.5 


Spleen Pool 


2.0 


Breast Pool 


9.5 


1 hymus Pool 


8.0 


Trachea 


14.1 


CNS cancer 
(glio/astro) U87-MG 


1.7 


Lung 


13.9 


CNS cancer 

(Mil ) »tr< * L'-l 18-MC 


10.2 
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Fetal Lung 


16.7 


CNS cancer 
(neuro:met) SK-N-AS 


2.8 


Lung ca. NCI-N417 


1.7 


CNS cancer (astro) SI - 
539 


2.8 


Lungca. LX-1 


12.8 


CNS cancer (astro) 
SNB-75 


14.2 


Lung ca. NCI-H146 


1.6 


CNS cancer (glio) 
SNB-19 


3.9 


Lung ca. SHP-77 


1.5 


CMb cancer (glio) SF- 
295 


10.8 


Lung ca. A549 


1 1.0 


Brain (Amygdala) Pool 


1.8 


Lung ca. !nl i-i i>zo 


0.5 


Ri iin icoie'ielhno) 


4.0 


L Ling ca. iNL 1-H_.> 


10.9 


Brain (fetal) 


4.6 


Lungca. NCI-H460 | 


4.1 


Brain (Hippocampus) 
Pool 


3.3 


Lung ca. HOP-62 


2.3 


Cerebral Cortex Pool 


0 o 


Lung ca. NCI-H522 


5.2 


Brain (Substantia 
nigra) Pool 


3.2 


Liver 


0.5 


Brain (Thalamus) Pool 




Fetal Live" 


4.3 


Brain (whole) 


- :: 


Liver ca. HepG2 


0.3 


Spinal Cord Pool 




Kidney Pool 


13.6 


Adrenal Gland 




Fetal Kidney 


7.1 


Pituiiar\ gland Pool 


0.0 


Renal c.i. 786-0 


1 1.7 


Salivary Gland 




Renal ca. A498 


5.6 


Thyroid (female) 






Renal ca. ACHN 




6.2 


Pancreatic ca. 
CAPAN2 




100.0 


Renal ca.UO-31 




14.0 


Pancreas Pool 




15.3 


Table BYC. Panel 4. ID 










Tissue Name 


Rel. Exp.(%) 
Ag5923, Run 
24757<)<)4<» 


Tissue Name 




Rel. Exp.(%) 
Ag5923, Run 
247579946 


Secondary Thl act 


14.6 


HUVEC IL-lbeta 




5.3 


Secondajy i hi act 


36.6 


HUVEC 1FN gamma 




12.1 


Secondary Trl act 


5.6 


HUVEC TNF alpha + 
1 FK gamma 


0.0 


Secondary Thl rest 


0.0 


HUVEC TNF alpha + 
IL4 


0.0 


Secondary Th2 rest 


10.5 


HUVEC TL-11 


6.7 


Seconda.ry Trl rest 


0.0 


Lung Microvascular EC 
none 




Primary Thl act 


0.0 


1 nng Microvascular EC 
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j rNFalpha + IL-lbeta 




Primary Th2 act 


^ ^ j [Microvascular Dermal 
jEC none 


0.0 


Primary Trl act 


1 _ (Microsvasular Dermal 

EC TNFalpha + IL-lbeta 


5.0 


Primary Th 1 rest 


0.0 


Bronchial epithelium 
TNFalpha + ILlbeta 


13.5 


Primary Th2 rest 


1.5 


Small airway epithelium 
none 


7.2 


Primary Trl rest 


1.3 


Small airway epithelium 
TNFalpha + IL-lbeta 


44.8 


CD45RA CD4 
lymphocyte act 


13.5 


Coronery artery SMC rest 


1.6 


CD45RO CD4 
lymphocyte act 


. ? „ jCoronery artery SMC 
|TNFalpha + IL-lbeta 


6.7 


CD8 lymphocyte act 


1.8 J Astrocytes rest 


1.5 


Secondary CD8 
lymphocyte rest 


, (Astrocytes TNFalpha + 
IL-lbeta 


0.0 


Secondary CD8 
lymphocyte act 


0.0 KU-812 (Basophil) rest 


0.0 


CD4 lymphocyte none 


0Q jKU-8 12 (Basophil) 
|PMA/ionomycin 


15.3 


2ry Thl/Th2/Trl_anti- 
CD95 CHI 1 


00 1CCD1106 

(Keratinoc)tes) none 


21.5 


LAK cells rest 


CCD 1 106 
15.9 j(Keratinocytes) 

rNFalpha ' IL-lbeta 


31.9 


JL/VIv cens LL-Z 


3.5 jLiver cirrhosis 


4 0 


LAK cells IL-2+IL-12 


0.0 


t NCI-H292 none 


61.1 


LAK cells IL-2+IFN 
gamma 


2.3 


NCI-H292 IL-4 


100.0 


LAK cells IL-2+ IL-18 


0.0 


NCI-H292 IL-9 


45.1 


LAK cells 
PMA/ionomycin 


13.0 


NCI-H292 IL-13 


47.0 


iniv ^cijLb il-z rest 


'7.1 


NCI-H292 I FN gamma 


14.5 


I wo vva> iviLK _> dav 


4.9 


III' \l C none 


0.0 


Two Way MLR 5 day 


0.0 


HPAEC rNFalpha + IL- 
1 beta 


24.1 


Two Way MLR 7 day 


1.2 


Lung iihrohldst none 


6.0 


PBMC rest 


0.0 


Lung fibroblast 1 NT- 
alpha + IL- 1 beta 


3.7 


PBMCPWM 


0.0 


Lung fibroblast II -4 


1.0 


PBMC PHA-L 


0.0 


Lung fibroblast IL-9 


8.7 


Ramos (B cell) none 


0.0 


Lung fibroblast 11-13 


0.0 


Ramos (B cell) 


17.7 


1 ung fibroblast IFN 


3.8 
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ionomycin 


gamma | 


B lymphocytes PWM 


, . . iDermal fibroblast 

ICCD 1070 rest 5 * 


D lyilipilUCy ICS ^LJHUL 

and IL-4 


O o Dermal fibroblast 

CCD 1070 TNF alpha 


21.3 


EOL-1 dbcAMP 


_ .. „ i Dermal fibroblast 
CCD1070IL-1 beta 


8.6 


EOL-1 dbcAMP 
PMA/ionomycin 


3.5 


Dermal fibroblast IFN 
gamma 


12.9 


Dendritic cells none 


11.7 


Dermal fibroblast IL-4 


13.8 


Dendritic cells LPS 


8.4 


Dermal Fibroblasts rest 


0.0 


Dendriti c cells anti- 
CD40 


3.8 (Neutrophils TNFa+LPS 


1.5 


Monocytes rest 


0.0 Neutrophils rest 


18.0 


Monocytes LPS 


57.8 Colon 


0.0 


Macrophages rest 


1.7 ;Lung 


0.0 


Macropnages LPS 


9.8 Thymus 


8.S 


HUVEC none 


3.1 Kidney 


4.0 


HI i\ Lf carved 


2.5 





CNS_neurodegeneration_vl.O Summary: Ag5923 Expression of the CG59520-01 gene 
is low/undetectable (CTs > 34.5) across all of the samples on this panel (data not shown). 



General_screening_panel_vl.5 Summary: Ag5923 Highest expression of the CG59520- 
01 gene is detected in sample derived from a pancreatic cancer cell line (CT=31.5). Thus, 
5 expression of this gene can be used in distinguishing this sample from other samples from 
the panel and as a marker for pancreatic cancer. In addition low levels of expression of this 
gene are associated with samples derived from CNS, colon, gastric, renal, lung, breast, 
ovarian and melanoma cnacer cell lines. This gene encodes a farnesyl pyrophosphate 
synthetase, which is involved in cholesterol biosynthesis. It has been suggested that in 
10 several types of cancer, activation of p21 would be aided by continuous farnesylation due 
to stimulation of the cholesterol biosynthetic pathway in tumors (Rao, 1995). Therefore, 
therapeutic modulation of the activity of protein encoded by this gene may be beneficial in 
the treatment of these cancers. 

In addition, low but significant levels of expression in the pancreas suggest that this 
1 5 gene product may be useful in the treatment of type II diabetes. 

References: 
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1. Rao KN. (1995) The significance of the cholesterol biosynthetic pathway in cell 
growth and carcinogenesis (review). Anticancer Res 1995 Mar-Apr; 15(2):309-1 4 

Panel 4.1D Summary: Ag5923 High expression of the CG59520-01 gene is detected in 
sample derived from untreated and IL4 treated NCI-H292 cells (CTs=33). Thus, expression 
5 of this gene could be used to distinguish these samples from other samples from the panel. 
Also, therapeutic modulation of the activity of this gene product may be beneficial in the 
treatment asthma and emphysema. 

Panel 5 Islet Summary: Ag5923 Expression of the CG59520-01 gene is low/undetectable 
(CTs > 34.5) across all of the samples on this panel (data not shown). 

10 BZ. CG 59704-01: Serine/Threonine Kinase 

Expression of gene CG59704-01 was assessed using the primer-probe set Ag3509, 
described in Table BZA. Results of the RTQ-PCR runs are shown in Tables BZB, BZC and 
BZD. 

Table BZA . Probe Name Ag3509 



Primers Sequences 


— i 

Position 


SEQ ID NO: 


Forward 5 ' -gacttccctcacctagcttctg-3 ' 


22 jp~422X 


635 


p , TEl 5' ictg at jo ic actgccgagta - 3 ' - 

rrooe Stamra 


24 j 4257 


636 ' 


Reverse 5 ' -caccaacctagcaaacaaacag-3 ' 


22 j 4281 


637 



1 5 Table BZB . CN S_neurodegeneration_v 1 .0 



Tissue Name 


Rel. Exp.(%) Ag3509, 
Run 210499481 


j Tissue Name 


Rel. F\|).( "..) Ag3S09, 
Run 210499481 


AD 1 Hippo 


20.2 


JControl (Path) 3 
Temporal Ctx 


24.1 


AD 2 Hippo 


20.2 


C ontrol (Path) 4 
Temporal Ctx 


20.7 


\l> 3 Hippo 


23.3 


\\ ) 1 Occipital Ctx 


16.4 ™ 


AD 4 Hippo 


13.5 


AD 2 Occipital Ctx 
j(Missing) 


0.0 


AD 5 Hippo 


66.4 


\[> 3 Occipital Ctx 




AD Hippo 


84.7 


\)) I Occipital ( ix 


4.6 


Control 2 Hippo 


18.0 


JAD 5 Occipital Ctx 


30.4 
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Control 4 Hippo 


0.4 


AD 6 Occipital Ctx 


0.0 


Control (Path) 3 
Hippo 


0.3 


Control 1 Occipital 
Ctx 


10.0 


AD 1 Temporal 
Ctx 


23.7 


Control 2 Occipital 
Ctx 


35.6 


AD 2 Temporal 
Ctx 


15.4 


Control 3 Occipital 
Ctx 


.6.3 


AD 3 Temporal 
Ctx 


10.8 


Control 4 Occipital 
Ctx 


25.2 


AD 4 Temporal 
Ctx 


18.8 


Control (Path) 1 
Occipital Ctx 


57.4 


AD 5 In f Temporal 
(i\ 


100.0 


Control (Path) 2 
Occipital Ctx 


8.1 


AD 5 Sup 
Temporal Ctx 


72.2 


Control (Path) 3 
Occipital Ctx 


5.4 


AD 6 Inf Temporal 
Ctx 


21.9 


Control (Path) 4 
Occipital Ctx 


30.4 


AD 6 Sup 
Temporal Ctx 


62.4 


Control 1 Parietal 
Ctx 


12.1 


Control 1 
Temporal Ctx 


5.4 


Control 2 Parietal 
Ctx 


54.0 


Control 2 
Temporal Ct\ 


30.6 


Control 3 Parietal 
Ctx 


12.0 1 


Control 3 
Temporal Ctx 


— 

6.0 


Control (Path) 1 
I': rictal Ctx 


51.4 


Control 3 
rempon.il Ctx 


12.2 


Control (Path) 2 
Parietal Ctx 


16.0 


Control (Path) 1 
Temporal Ctx 


40.9 


Control (Path) 3 
Parietal Ctx 


2.1 


Control (Path) 2 
Temporal Ctx 


31.9 


Control (Path) 4 
Parietal Ctx 


29.7 



Table BZC . General_screeningjpanel_vl.4 



Tissue Name 


Rel. Exp.(%) 
Ag3509, Run 
217240617 


Tissue Name 


Rel. Exp.(%) 
Ag3509, Run 
217240617 


Adipose 


9.5 


Renal ca. TK- 10 


20.4 


Melanoma* 
Hs688(A).T 


8.7 


Bladder 


i 

16.6 


Melanoma* 
Hs688(B).T 


4.5 


Gastric ca. (liver met.) 
NCI-N87 


37.6 


Melanoma* Ml 4 


12.0 


Gastric ca. KATO III 


23.3 


Melanoma* 
LOX1MV1 


3.1 


Colon ca. SW-948 


6.9 
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Melanoma* SK- 
MEL-5 


11.0 

— -~ 


Colon ca. SW480 


35.8 


carcinoma SCC-4 


9.5 


met) SW620 


24.7 




5.4 


Pnlnn cr\ T4T9Q 


9 "? 


Prostate ca.* (bone 


24.1 


Colon ca. HCT-116 


62.4 


Prostate Pool 


4.5 


Colon ca. CaCo-2 


8.8 


Placenta 


1.9 


Colon cancer tissue 


10.9 


Uterus Pool 


1.8 


Colon ca.SWl 116 


3.7 


Ovarian ca. 
OVCAR-3 


43.2 


Colon ca. Colo-205 


1.2 


Ovarian ca. SK- 
OV-3 


42.3 


Colon ca. SW-48 


2.8 


Ovarian ca. 
OVCAR-4 


1.1 


Colon Pool 


9.2 


Ovarian ca. 
OVCAR-5 


19.1 


Small Intestine Pool 


8.2 


Ovarian ca. 

luKUV-1 


8.7 


Stomach Pool 


3.7 


Ovarian ca. 
OVCAR-8 


7.5 


Bone Marrow Pool 


4.6 




3.4 


Fetal Heart 


7.3 


Breast ca. MCF-7 


20.7 


Heart Pool 


5.6 


Breast ca. MDA- 
MB-231 


32.8 


Lymph Node Pool 


10.1 


Breast ca. BT 549 


35.1 


Fetal Skeletal Muscle 


6.7 


Breast ca. T47D 


34.4 


Skeletal Muscle Pool 


3.6 


Breast ca. MDA-N 


18.6 


Spleen Pool 


7.7 


Breast Pool 


15.7 


Thymus Pool 


11.0 


Trachea 


6.8 


CNS cancer 
(glio/astro) U87-MG 


27.4 


Lung 


15.3 


CNS cancer 
(glio/astro) U-118-MG 


45.4 


Fetal Lung 


17.6 


CNS cancer 
(neuro;met) SK-N-AS 




16.3 


Lung ca.NCI-N417 


11.8 


CNS cancer (astro) SF- 
539 


9.9 


Lung ca. LX-1 


29.5 


CNS cancer (astro) 
SNB-75 


50.7 


Lung ca. NCI-H146 


5.6 


CNS cancer (glio) 
SNB-19 


7.0 


Lung ca. SHP-77 


19.1 


CNS cancel (glio) SI 
295 


25.7 


Lung ca. A549 


17.2 


Brain ( \nn jJ ilai I < k<\ 


1.9 
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Lung ca. NCI-H526 




4.2 


Brain (cerebellum) 1 


13.7 


Lung ca. NCI-H23 


100.0 


Brain (fetal) 


14.8 "~ """"" 


Lung ca. NCI-H460 


27.2 


Brain (Hippocampus) 
Pool 


3.7 


Lung ca. HOP-62 


16.2 


Cerebral Cortex Pool 


3.0 


Lung ca . NCI-H522 




29.9 


Brain (Substantia 
nigra) Pool 


1.7 


Liver 


0.0 (Brain (Thalamus) Pool j 


5.6 


Fetal Liver 




10.1 


Brain (whole) 


6.1 


Liver ca.. HepG2 




17.7 


Spinal Cord Pool J 


2.0 


Kidne> Pool 




26.1 


Adrenal Gland 


8.8^ 


Fetal Kidney 




_ 1 i 


Piluil; r\ gland Pool 


3.4 


Renal ca. 786-0 




19.8 


Salivary Gland 


4.9 


Renal ca. A498 




2.4 


riis f oid < -female) 


3.4 


Renal ca. ACHN 




8.2 


Pancreatic ca. I 
CAPAN2 j 


34.2 


Renal ca. UO-31 




9.5 


Pancreas Pool 


16.7 


Table BZD. Panel 4D 


Tissue Name 


Rel. Exp.(%) 
Ag3509, Run 
166407201 


Tissue Name 


1 Rel. Exp.(%) 
| Ag3509, Run 
166407201 


Secondary Thl u t 


50.0 


III fVEC IL-lbeta 


4.7 


Secondary Th2 act 


45.4 


HUVEC I FN gamma 


4.8 


Secondary Trl act 


87.7 


HUVEC TNF alpha + 
I FN gamma 


10.0 


Secondary Ihl ic.st 


,7.4 


HUVEC TNF alpha + 
IL4 


19.9 


Secondary Th2 rest 


21.9 


HUVEC IL-11 


13 4 


Secondary Trl rest 


32.5 


Lung Microvascular EC 
none 


1 H.l 


Primary Thl act 


40.9 


Lung Microvascular EC 
I NI alpha I IL-lbeta 


24.8 


Primary Th2 act 


83.5 


Microvascular Dermal 
EC none 


22.1 

\ - 


Primary Trl act 


100.0 


Microsvasular Dermal 
EC TNFalpha + IL-lbeta 


12.2 


Primary Thl rest 


75.8 


Bronchial epithelium 
TNFalpha + ILlbeta 


9.9 


Primary Th2 rest 


31.0 


Small airway epithelium 
none 


7.7 


Primary Trl rest 


41.5 


Small airway epithelium 
TNFalpha + IL-lbeta 


23.0 
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CD45RACD4 
lymphocyte act 


Coronery artery SMC rest 


7.6 


CD45RO CD4 i 56 6 
lymphocyte act 


Coronery artery SMC 
TNFalpha + IL-1 beta 


2.0 


CDS lymphocyte act i 58.6 Astrocytes rest 


25.5 


Secondary CD 8 
lymphocyte rest 


. jAstrocytes TNFalpha + 
ilL-lbeta 


28.5 


Secondary CD8 
lymphocyte act 


52.1 


KU-812 (Basophil) rest 


25.5 


CD4 lymphocyte none 


21.0 


KU-812 (Basophil) 
PMA/ionomycin 


68.8 


2ry Thl Th2/Trl_anti- 
CD95 CH11 


41g [CCD1106 

j(Keratinocytes) none 


17.4 


LAK cells rest 


CCD1106 
27.9 j(Keratinocytes) 

jTNFalpha + IL-1 beta 


36.6 


LAK cells IL-2 


58.6 Liver cirrhosis 


— — 


L \K cells 11 -2 11-12 


61.1 Lupus kidney 


: 


LAK cells IL-2+IFN 


85.3 |NCI-H292 none 


20.0 


1-./ \ IV CClIi> 1 1 j -1 i IL-lO 


61.1 iNCI-H292 IL-4 


— ~ 


LAK cells 


49.3 |NCI-H292 IL-9 


24.5 


NK Cel ls IL-2 rest 


43.2 NCI-H292 IL-13 




Two Way MLR 3 day 


50.0 


NCI-H292 IFN gamma 


_\ 


Two Way MLR 5 day- 


39.5 


HPAEC none 




Two Way MLR 7 day 


„ JHPAEC TNF alpha + IL- 
5> A jlbeta 


25.9 


PBMC rest 


1 5.1 Lung fibroblast none 


19.6 


PBMC PWM 


R 1 .ung fibroblast TNF 
;alpha + IL-1 beta 


21.3 


PBMC PHA-L 


14.8 Lung fibroblast IL-4 


„ . 

18.8 


Ramos (B cell) none 


75.3 Lung fibroblast IL-9 


17.8 


Ramos (B cell) 
ionomycin 


1 7.8 jLung fibroblast IL-1 3 


21.9 


B lymphocytes PWM 


^ 34 Lung fibroblast IFN 
j gamma 


21.3 


B lymphocytes CD40L 
and IL-4 


40 _ JDermal fibroblast 
jCCDl 070 rest 


46.7 


EOL-1 dbcAMP 


9 ^ _ jDermal fibroblast 
1 CCD1070 TNF alpha 


69.3 


EOI -1 dbcAMP 
PMA/ionomycin 


18.9 


Dermal fibroblast 
CCD1070 IL-1 beta 


20.3 


Dendritic cells none 


10.7 


Dermal fibroblast IFN 
gamma 


17.4 
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Dendritic cells LPS 


13.6 


jPermal fibroblast 1 [ -4 


21.8 


Dendritic cells anti- 
CD40 




jltJU L-OlltlS Z 


7.1 


Monocytes rest 


10.0 


jIBD Crohn's 


5 5 


Monocytes LPS 


25.3 


1 Colon 


53.6 


Macrophages rest 


29.3 


jLung 


5.6 


Macrophages LPS 


7.2 


Thymus 


27.5 


HI VI C: none 


24.1 


Kidney 


51.4 


HUVEC starved 


33.9 


, L 





CNS_neurodegeneration_vl.O Summary: Ag3509 This panel confirms the expression of 
this gene at low levels in the brain in an independent group of individuals. However, no 
differential expression of this gene was detected between Alzheimer's diseased postmortem 
brains and those of non-demented controls in this experiment. 



5 Genera l_screening_panel_vl. 4 Summary: Ag3509 Highest expression of the CG59704- 
01 gene is detected in a sample derived from a lung cancer cell line (CT=31.69). Thus, 
expression of this gene can be used in distinguishing this sample from other samples in this 
panel. Furthermore, moderate expression of this gene is associated with cell lines derived 
from pancreatic, brain, colon, gastric, renal, lung, breast and ovarian cancers. Therefore, 
10 therapeutic modulation of the activity of this gene or its protein product, through the use of 
small molecule drugs, or antibodies, might be beneficial in the treatment of these cancers. 

Panel 4D Summary: Ag3509 Expression of the CG59704-01 gene is stimulated in T 
cells, LAK cells and B cells, with highest expression in primary activated Trl cells 
(CT=32). Therefore, therapeutics designed with the protein encoded for by this transcript 
15 could be important in regulating T and B cell function and treating T cell/B cell mediated 
diseases such as asthma, arthritis, psoriasis, IBD, allergies, hypersensitivity reactions, 
microbial and viral infections systemic lupus erythematosus, multiple sclerosis, chronic 
obstructive pulmonary disease and systemic lupus erythematosus. 

Furthermore, expression of this gene is decreased in colon samples from patients 
20 with IBD colitis and Crohn's disease relative to normal colon. Therefore, therapeutic 
modulat ion of the activity of this gene product may be useful in the treatment of 
inflammatory bowel disease. 
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Panel 5 Islet Summary: Ag3509 Expression of this gene is low/undetectable (CTs > 35) 
across alii of the samples on this panel (data not shown). 

CA. CG59628-01: short-chain dehydrogenase like homo sapiens 

Expression of gene CG59628-01 was assessed using the primer-probe set Ag3500, 
described in Table CAA. Results of the RTQ-PCR runs are shown in Tables CAB and 
CAC. 



Table CAA . Probe Name Ag3500 



Primers 


Sequences 


Length 


Start 
Position 


SEQ ID 
NO: 


Forward 


5 ' -gcagatgtggtgatgagtatga-3 ' 


1 22 


1159 


638 


Probe 


TET-5 ' -tcaggtaaactaaaaccaacaatggca-3 ' - 
TAMRA 




1207 


639 


Reverse 


5 ' -atcttcaatttccctgacatga-3 ' 


| 22 


1235 


640 



Table CAB . General_screening_panel_vl.4 



Tissue Name 


Rel. Exp.(%) 
Ag3500, Run 
217131378 


Tissue Name 


Rel. Exp.(%) 
Ag3500, Run 
217131378 


Adipose 


25.0 


Renal ca. TK-10 


82.4 


Melanoma* 
Hs688(A).T 


19.3 


Bladder 


15.6 


Melanoma* 
Hs688(B).T 


18.3 


Gastric ca. (liver met.) 
NCI-N87 


36.3 


Mcljn una ' M 1 -! 


33.9 


Gastric ca. (CATC) III 


51.1 


Melanoma* 
LOXIMVI 


36.3 


Colon ca. SW-948 


17.8 


Melanoma* SK- 
MEL-5 


61.1 


Colon ca. S W480 




66.4 


Squamous cell 
carcinoma SCC-4 


(Colon ca.* (SW480 
|met) SW620 


36., 


Testis Pool 


11.4 


Colon ca. HT29 


31.6 


Prostate ca.* (bone 
met) PC-3 


35.1 


Colon ca. HCT-116 


46.0 


Prostate Pool 


3.9 


Colon ca. CaCo-2 


58.6 


Placenta 


1.2 


Colon cancer tissue 


24.1 


Uterus Pool 




Colon ca. SW1116 


6.1 ~ 
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Ovarian ca. 
OVCAR-3 


12.2 


Colon ca. Colo-205 


12.1 


Ovarian ca. SK- 
OV-3 


55.9 


Colon ca. SW-48 


25.0 


Ovarian ca. 

U\ t Ak-4 


7.2 


Colon Pool 


16.2 


Ovarian ca. 
OVCAR-5 


47.6 


Small Intestine Pool 


9.9 1 

j 


Ovarian ca. 
IGROV-1 


23.8 


Stomach Pool 


6.2 


OVCAR-8 


16.5 


Bone Marrow Pool 


4.6 


Ovary 


31.2 


Fetal Heart 


27.0 


Breast ca. MCF-7 


9.7 


Heart Pool 


12.9 


Breast Ca. MDA"- 
MB-231 


33.2 


Lymph Node Pool 


13.1 


Rf^ocf r-n TIT 1 ^/IO 

;r>reasi ca. r> i j^-y 




Fetal Skeletal Muscle 




1 5.1 


Breast ca. T47D 


90.8 


Skeletal Muscle Pool 


40.9 


Breast ca. MDA-N 


IS •) 


Spleen Pool 


1 1 .6 


Breast Pool 


14.1 


Thymus Pool 


13.4 'j 


Trachea 


7.7 


CNS cancer 
(glio/astro) U87-MG 


27.0 


Lung 


8.1 


CNS cancer 
(glio/astro) U-118-MG 


39.0 


Fetal Lung 


27.7 


CNS cancer 
(neuro;met) SK-N-AS 


20.0 

- ] 


Lung ca. NCI-N417 


4.4 


CNS cancer (astro) SF- 
539 


15.0 


Lung ca. LX-1 


30.1 


CNS cancer (astro) 
SNB-75 


100.0 


Lung ca. NCI-H146 


1.7 


CNS cancer (glio) 
SNB-19 


22.5 

— — — j 


Lung ca. SHP-77 


27.4 


CNS cancer (glio) SF- 
295 


20.6 


Lung ca. A549 


24.5 Biam ( Aimiidala) Poo! 


4 5 


Tuner >JPT_T-TS9ft 


5.6 


Brain (cerebellum) 




Lung ca. NCI-H23 


23.8 


Brain (fetal) 


2 0 

™ 


Lung ca. NCI-H460 


22.2 


Brain (Hippocampus) 
Pool 


9.2 


Lung ca. HOP-62 


14.4 


Cerebral Cortex Pool 


11.7 


Lungca. NCI-H522 


24.0 


Brain (Substantia 
nigra) Pool 


7.7 


Liver 


1.2 


Bidin (Thalamus) 


10.1 


Fetal Liver 


28.7 


Brain (whole) 


2.8 
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Liver ca. HepG2 




36.6 


Spinal Cord Pool 


17.2 


Kidney Pool 




10.8 


Adrenal Gland 


18.6 


Fetal Kidney 




13.6 


Pitt ' ir\ gland Pool 


2.4 


Renal ca. 786-0 




44.1 


Salivary Gland 


1.7 


Renal c.i \498 


13.7 


Tin ro id (female) 


11.3 


Renal ca. ACHN 


19.1 


Pancreatic ca. 
CAPAN2 


34.9 


Renal ca. UO-31 


37.4 


I'.u.uvas Pool 


22.8 


Table CAC. Panel 4D 


Tissue Name 


Rel. Exp.(%) 
Ag3500, Run 
loo44iy4z 


Tissue Name 


Rel. Exp.(%) 
Ag3500, Run 

1 CCA At C\A^ 


Secondary Thl act 


6.0 


HUVEC IL-lbeta 


6,8 


Secondarv Th2 act 


10.1 


HUVEC IFN gamma 


17.1 


Secondary Trl act 


14.0 


HUVEC TNF alpha + 
I FN gamma 


5.6 


Secondary Thl rest 




4.1 


HUVEC TNF alpha + 
IL4 




6.2 


Secondary Th2 rest 




2.4 


HUVEC IL-11 




7.2 


Secondary Trl rest 


3.1 


Lung Microvascular EC 
none 


6.0 


Primary Thl act 


1.7 


Lung Microvascular EC 
INI alpha - L IL-lbeta 


4.5 


Primary Th2 act 




6.4 


Microvascular Dermal 
EC none 


20.0 


Primary Trl act 




10.5 


Microsvasular Dermal 
EC TNFalpha + IL-lbeta 


5.9 


Primary Thl rest 


20.3 


Bronchial epithelium 
rNFalpha [Llbeta 


6.8 


Primary Th2 rest 


9.8 


Small airway epithelium 
none 


11.2 


Primary Trl rest 


18.0 


Small airway epithelium 
TNFalpha + IL-lbeta 


50.7 


CD45RA CD4 
lymphocyte act 




11.5 


Coronery artery SMC rest 


6.8 


CD45RO CD4 
1 mph >c\tc act 


8.5 


Coronery artery SMC 
rNFalpha - IL-lbeta 




5.7 


CDS lymphocyte act 


13.2 


Astrocytes rest 




17.9 


Secondary CDS 
lymphocyte rest 


6.9 


Astrocytes TNFalpha + 
IL-lbeta 


26.2 


Secondary' CDS 
ly mph< icyte act 


6.2 


KU-812 (Basophil) rest 


13.2 
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CD4 lymphocyte none 


2.1 


jKU-812 (Basophil) 
•PMA/ionomycin 


23.0 


2ry fhl7Th2/Trl_anti- 
v ij 7 j n l i 


3.2 


ICCD1106 

i(Keratinocytes) none 


14.5 










LAK cells rest 


2.5 


J(Keratinocytes) 
fTNFalpha + IL-lbeta 


68.8 


L \K cells 1L-2 


12.2 


Liver cirrhosis 


20.4 


LAK cells II -2+1L-12 


9.8 


jLupus kidney 


13.5 


LAK cells IL-2+IFN 
gamma 


11.7 


i 

INCI-H292 none 


47.0 


LAK cells IL-2+IL-18 


8.0 


INCI-H292 IL-4 


21.8 


LAK cells 
PMA/ionomycin 


2.0 


INCI-H292 IL-9 


29.5 


NK Cells IL-2 rest 


6.3 


INCI-H292 IL-13 


13.3 


Two Way MLR 3 day 


64 


INCI-H292 IFN gamma 


12.3 


[\vo Wa> MLR 5 da\ 


12.3 


fHPAEC none 


10.5 


Two Way MLR 7 day 




|HPAEC TNF alpha + IL- 
ll beta 


7.3 


PB\1( re- 


3.3 


Lung fibroblast none 


10.1 


PBMC PWM 


11.6 


I Lung fibroblast TNF 

lolr\Vio -4- TT 1 Vi*ato 
jaipild <^ LL,- 1 ucla 


15.1 


I >I } \ 1 nuA T 


5.7 


iLung fibroblast IL-4 


9.1 


Ramos (B cell) none 




1 nut: fibroblast IL-9 


------ 


Ramos (B cell) 
ionomycin 


8.3 


— I-,.,, — «. — , 

I 

Lung fibroblast IL-13 


8.1 


B lymphocytes PWM 


27.4 


Lung fibroblast IFN 
gamma 


24.5 


B lymphocytes CD40L 


15.0 


Dermal fibroblast 


39.8 


onA TT A 
anil Uj-H 


L v_-L»iu/u rest 


EOL-1 dbcAMP 


20.7 


Dermal fibroblast 
el u IU/U 1 f\r alpha 


47.3 


1 ( tl ! I'i ^ \ 1 1 > 

HvJJL-1 UDCAlVIr 

PMA/ionomycin 


4.7 


Dermal fibroblast 
CCD1070 IL-1 beta 


12.4 


Dendritic cells none 


6.9 


(Dermal fibroblast IFN 
jgamma 


11.4 


Dendritic cells LPS 


11.6 


(Dermal fibroblast IL-4 


48.3 


Dendritic cells anti- 
CD40 


3.8 


f IBD Colitis 2 


2.2 


Monocytes rest 


6.0 


IBD Crohn's 


8.4 


Monocytes LPS 


2.3 


Colon 


100.0 


Macrophages rest 


12.2 


Lung 


14.0 


Macrophages 1 PS 


- 54 


jThymus 


73.2 ! 


HLJVEC none 


19.2 


kidney 


25.0 
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HUVEC v, dived " 31.9 "I 

CNS_neurodegeneration_vl.O Summary: Ag3500 Results from one experiment with the 
CG59628-01 gene are not included. The amp plot indicates that there were experimental 
difficulties with this run. 

Generall_screening_panel_vl.4 Summary: Ag3500 Highest expression of the CG59628- 
5 0 1 gene is detected in a sample derived from a CNS cancer cell line (CT=3 1.1). Therefore, 
expression of this gene may be used to distinguish this sample from the other samples on 
this panel. In addition, significant expression of this gene is associated with samples 
derived from colon, ovarian, breast, renal, lung, melanoma, and brain cancer cell lines. 
Therefore, therapeutic modulation of the activity of this gene or its protein product might 
1 0 be beneficial in the treatment of these cancers. 

Among tissues with metabolic function, this gene is expressed at low but significant 
levels in pituitary, adipose, adrenal gland, pancreas, thyroid, and adult and fetal skeletal 
muscle, heart, and liver. This widespread expression among these tissues suggests that this 
gene product may play a role in normal neuroendocrine and metabolic and that disregulated 
1 5 expression of this gene may contribute to neuroendocrine disorders or metabolic diseases, 
such as obesity and diabetes. 

This molecule is also expressed at low levels in the CNS, including the 
hippocampus, thalamus, substantia nigra and cerebral cortex. Therefore, therapeutic 
modulation of the expression or function of this gene may be useful in the treatment of 
20 neurologic disorders, such as Alzheimer's disease, Parkinson's disease, schizophrenia, 
multiple sclerosis, stroke and epilepsy. 

Panel 4D Summary: Ag3500 Highest expression of the CG59628-01 gene is detected in 
colon (CT=30.3). Therefore, expression of this gene may be used to distinguish colon from 
the other tissues on this panel. Furthermore, expression of this gene is decreased in colon 
25 samples from patients with IBD colitis and Crohn's disease relative to normal colon. 

Therefore, therapeutic modulation of the activity of the GPCR encoded by this gene may 
be useful in the treatment of inflammatory bowel disease. 

CB. CG 59671-02: acetyl-coenzyme A synthetase 
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Expression of gene CG59671-02 was assessed using the primer-probe sets Ag3506 
and Ag3581, described in Tables CBA and CBB. Results of the RTQ-PCR runs are shown 
in Tables CBC, CBD, CBE and CBF. 



Table CBA . Probe Name Ag3506 



Primers 


Sequences 


Length 


Start Position 


SEQID 
NO: 


Forward 


5 1 -aggacacagctacgtggtgtat-3 1 


22 


1072 


641 


Probe 


TET-5 ' -cctctctgcaatggtgccaccag-3 1 - 
TAMRA 


23 


1097 


642 


Reverse 


5 ' -gataaactggggtgctctcaa-3 ' 


:i 


1128 


643 


Table CBB. Probe Name Aa3581 


















Primers 


Sequences 


Length 


Start 
Position 


SEQID 
NO: 


Forward 


5 ' -aggacacagctacgtggtgtat-3 1 


22 


1072 


644 


Probe 


TET-5 1 -cctctctgcaatggtgccaccag-3 1 - 
TAMRA 


23 


1097 


645 


Reverse 


5 ' -gataaactggggtgctctcaa-3 ' 


21 


1 1 :s 


646 



Table CBC . CNSneurodegenerationvl.O 



Tissue 
Name 


Rel. Exp.(%) 
Ag3506, Run 
210497900 


Rel. Exp.(%) 
Ag3581, Run 
210643840 


Tissue T*el.Exp.(%) 
f? SSUe Ag3506,Run 

Name 210497900 


Rel. 1! \p.(%) 
Ag3581, Run 
2101.43840 


AD 1 Hippo 


30.1 


36.9 


Control 
(Path) 3 
Temporal 
Ctx 


15.8 


21.5 


AD 2 Hippo 


87.7 


89.5 


Control 
(Path) 4 
Temporal 
Ctx 


42.6 


53.6 


AD 3 Hippo 


12.5 


12.5 


AD 1 
Occipital 

Ctx 


35.6 


31.4 


AD 4 Hippo 

j 


29.1 


38.4 


AD 2 

Occipital 

Ctx 

(Missing) 


0.0 


0.0 


AD 5 Hippo 


55.5 


69.3 

, 


AD 3 
Occipital 

Ctx 


17.8 


18.7 
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AD 6 H ippo 


24.1 


33.4 


AD 4 

Occipital 

Ctx 


' " ~ 

47.0 




38.4 


v^oniroi z 
Hippo 


42.3 


56.3 


AD 5 

Occipital 

Ctx 


42.9 




26, 


Control 4 
Hippo 


42.6 


60.3 


AD 6 

Occipital 

Ctx 


24.3 


51.1 


Control 
(Path) 3 
Hippo 


19.8 


23.0 


Control 1 
Occipital 
Ctx 


20.3 


16.2 


AD 1 

Temporal 

Ctx 


63.7 


48.3 


Control 2 
Occipital 
Ctx 


71.2 


63.7 


AD 2 

Temporal 

Ctx 


60.7 


87.7 


Control 3 
Occipital 
Ctx 


35.4 


41.5 


AD 3 

Temporal 

Ctx 


15.9 


15.6 


Control 4 
Occipital 
Ctx 


20.0 

- 


27.9 


AD 4 

Temporal 

Ctx 


47.0 


57.0 


Control 
(Path) 1 
Occipital 
Ctx 


81.2 


100.0 


AD 5 Inf 
Temporal 
Ctx 


68.3 


72.2 


Control 
(Path) 2 
Occipital 
Ctx 


18.9 


. 

26.4 


AD 5 Sup 
Temporal 


65.5 


57.8 


Control 
(Path) 3 
Occipital 
Ctx 


I— 

15.7 


10.1 


AD 6 Inf 
Temporal 
Ctx 


29.5 


35.6 


Control 

(Path) 4 
Occipital 
Ctx 


24.8 


26.8 


AD 6 Sup 
Temporal 
Ctx 


31.2 


31.9 


Control 1 
Parietal Ctx 


21.9 


33.7 


Control 1 
Temporal 
Ctx 


22.5 


29.3 


Control 2 
Parietal Ctx 


51.8 


79.0 


Control 2 
Temporal 
Ctx 


55.1 


58.6 


Control 3 
Parietal Ctx 


31.4 


38.4 


Control 


26.4 


34.6 


Control 


100.0 


93.3 
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Temporal 
Ctx 






(Path) 1 
Parietal Ctx 






Control 3 
Temporal 
Ctx 


35.6 


35.8 


Control 
(Path) 2 
Parietal Ctx 


51.1 


57.4 


Control 
(Path) 1 
Temporal 
Ctx 


62.9 


73.7 


Control 
(Path) 3 
Parietal Ctx 


17.1 


17.2 


Control 
(Path) 2 
Temporal 
Ctx 


54.0 


55.1 


Control 
(Path) 4 
Parietal Ctx 


54.3 


55.9 



Table CBD . General_screening_panel_vl.4 



Tissue Name 


Rel. Exp.(%) 
Ag3506, Run 
217236187 


Rel. Exp.(%)| 

Ag3581,Run) Tissue Name 
217423588 I 


Rel. Exp.(%) 
Ag3506, Run 
217236187 


Rel. Exp.(%) 
Ag3581, Run 
217423588 


Adipose 


3.6 


3.5 iRenal ca.TK-10 


4.1 


3.4 


Melanoma* 
Hs688(A).T 


1.8 




1.0 


Bladder 


16.5 


12.2 


Melanoma* 
Hs688(B).T 


„ 


0.6 


Gastric ca. (liver 
met.) NCI-N87 


i 

21.5 


,73 


Melanoma* 
M14 


100.0 


30.4 


Gastric ca. 
KATO III 


63.3 


53.2 


Melanoma* 
LOXIMVI 


0.0 


0.0 


( iolon ca. SW- 
948 


8.0 


5.0 


Melanoma* 
SK-MEL-5 


z 


100.0 


Colon ca. 
SW480 


20.7 

l—t--- ■ 


16.6 


Squamous 
cell 

carcinoma 
SCC-4 


10.9 


6.3 


Colon ca.* 
(SW480 met) 
SW620 


0.0 


0.1 


Testis Pool 


11.9 


9.8 Colon ca. HT29 


24.7 


15.7 


Prostate ca.* 
(bone met) 
PC-3 


2.0 


7 ^ Colon ca. I ICT- 
Z ' 5 II 16 


0.0 


0.0 


Prostate Pool 


13.0 


7.5 


Colon ca. CaCo- 
2 


60.3 


40.1 


Placente 


87.1 


50.3 


Colon cancer 
tissue 


30.8 


18.8 


Uterus Pool 


7.0 


4.5 


Colon ca. 
SW1I16 


5.3 


2.4 


Ovarian ca. 
OVCAR-3 


28.5 


14.4 


Colon ca. Colo- 
205 




1.9 
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Ovarian ca. 
SK-OV-3 


7.7 


4.2 


Colon ca. SW-48 


33.2 


168 


Ovarian ca. 


2.6 


1.0 


Colon Pool 


14.3 


15.1 


Ovarian ca. 

OA I AD ^ 


28.3 


14.1 


Small Intestine 
Pool 


15.0 


7.4 


[Ovarian ca. 
ilGROV-1 


0.5 


0.2 

.„,,,„ 


Stomach Pool 


8.3 


4.8 


jOvarian ca. 
OVCAR-8 


1.4 


1.1 


Bone Marrow 
Pool 


7.3 


4.7 


(har\ 


7.9 




Fetal Heart 


16.0 


12.8 


Breast ca. 
MCF-7 


3.8 




2.1 


Heart Pool 


13.1 


10.1 


Breast ca. 

MDA-MB- 

231 


8.2 


6.0 


Lymph Node 
Pool 


16.3 


9.2 


Breast ca. H 1 

[549 


1.6 


1.2 


Fetal Skeletal 
Muscle 


6.9 


3.7 


Breast ca. 
jT47D 


62.9 


31.2 


Skeletal Muscle 
Pool 


18.9 


15.1 


Breast ca. 
MDA-M 


29.7 


15.4 


Spleen Pool 


8.8 


6.9 


Breast Fool 


12.5 


8.8 


Thymus Pool 


19.6 


11.6 


Trachea 


12.4 


8.2 


CNS cancer 
(glio/astro) U87- 
MG 


0.6 


0.3 


Lung 




1.7 


0.8 

1- — 


CNS cancer 
(glio/astro) U- 
118-MG 


0.4 


0.2 


Fetal Lung 


29.7 




18.0 


CNS cancer 
(neuro;met) SK- 
N-AS 


3.4 


2.2 


r 

Lung ca. 
NCI-N417 


0.1 


0.0 


CNS cancer 
(astro) SF-539 


mmmmm 

0.1 


0.0 


Lung ca. LX- 
1 


0.1 


0.1 


CNS cancer 
(astro) SNB-75 


10.4 


2.1 


Lung ca. 
NC1-H146 


0.3 


0.1 


CNS cancer 
(glio) SNB-19 


1.0 


0.2 


Lung ca. 
SHP~-77 


7.8 


5.0 


CNS cancer 
(glio) SI -295 


1.3 


0.6 


Luna ca. 
A549 


7.8 


5.1 


Brain 

(Amygdala) Pool 


10.3 


5.9 


Lung ca. 
NCI-H526 


10.2 


2.5 


Brain 

(cerebellum) 


32.3 


21.0 


Lung ca. 
NCI-H23 


3.4 


3.2 | 


Brain (fetal) 


9.7 


4.7 
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Lung ca. 
NCI-H460 


1.9 


0.8 


Brain 

(Hippocampus) 
Pool 


15.5 


10.6 


Lung ca. 
HOP-62 


0.9 


0.6 


Cerebral Cortex 
Pool 


20.7 


14.5 


Lung ca. 
NCI-H522 


0.1 


0.1 


Brain (Substantia 
nigra) Pool 


18.8 


9.9 


Liver 




0.8 


0.5 


Brain 

(Thalamus) Pool 


15.9 




10.4 


Fetal Liver 


37.1 


22.4 


Brain (whole) 


20.3 


r _j±i__ 


Liver ca. 
HepG2 


1.8 


0.8 


Spinal Cord Pool 


14.8 


9.0 


Kidney Pool 


18.7 | 14.9 


Adrenal Gland | 5.6 


3.8 


Fetal Kidney 


6.2 


6.0 


Pituitary gland 
Pool 


1.6 


1.0 


Renal ca. 
786-0 


0.3 


0.1 


Salivary Gland 


15.3 


8.6 


Renal ca. 
A498 


1.8 


1.1 


Thyroid (female) 


11.0 


11.4 


Renal ca. 
ACHN 


4.3 


1.4 


Pancreatic ca. 
CAPAN2 


0.3 


0.2 


Renal ca. 
UO-31 


9.3 


2.5 


Pancreas Pool 


16.7 


16.5 



Table CBE . Panel 4. ID 



Tissue Name 


Rel. Exp.(%) 
Ag3581, Run 
169910402 


Tissue Name 


Rel. Exp.(%) 
Ag3581, Run I 
169910402 


Secondary Thl act 


36.6 


HUVEC IL-lbeta 


4.0 


Secondary rh2 act 


38.2 


HUVEC I FN gamma 


19.5 


Secondary Trl act 


45 ! HUVEC TNF alpha i 
;1FN gamma 


5.3 


Secondary Thl rest 


HUVEC 'l\l alpha i 
' 111 


— 7T~1 


Secondary Hi 2 rest 


58.2 III \U II -II 


:::::ii5 ; i 


Secondary Trl rest 


^ ~ jLung Microvascular EC 
Jnone 


27.9 1 

1 


Primary Thl act 


_ Lung Microvascular EC 
TNFalpha + IL-lbeta 


28.1 


Primary Th2 act 


. 0 , Microvascular Dermal 
48.6 L_ 

1 ( none 


7.9 


Primary Trl act 


, iMicrosvasular Dermal 

LC I Nl'alpha i lL-lbeta 


10.6 


Primary fhl rest 


90.8 Bronchial epithelium 


6.9 
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z 


TNFalpha + ILlbeta 




Primary Th2 rest 


^ l Small airway epithelium 
jnone 


... 

1.7 


Primary Trl rest 


Qft „ (Small airway epithelium 
1 Nr alpha LL-l beta 


3.2 


CD45RA CD4 
1} mphocyte act 


1 

1 5 .2 ICoronery artery SMC rest 

_„ _ 


0.3 


CD45RO CD4 
lymphocyte act 


^ j |Coronery artery SMC 
I N 1 'alpha - IL-lbeta 


0.3 


CD8 lymphocyte act 


36.3 Astrocytes rest 


0.0 


Secondary CD8 
lymphocyte rest 


„ 7 n (Astrocytes TNFalpha + 
IL-lbeta 


0.0 


Secondary CD8 
lymphocyte act 


4 1 .2 Kl 1-812 ( Basophil) rest 


100.0 


CD4 lymphocyte none 


31Q KU-8 12 (Basophil) 
jPMA/ionomycin 


25.2 


2ry Thl/Th2/Trl_anti- 
CD95 CHI 1 


?74 CCD 1106 

(Keratinocytes) none 


13.0 


LAK cells rest 

! 


CCD1 106 
40.6 |(Keratinocytes) 

TNFalpha IL-lbeta 


19.3 


LAK cells IL-2 


66.0 


Liver cirrhosis 


8.9 


LAK cells IL-2+IL-12 


28.3 


NCI-H292 none 


0.5 


LAK cells IL-2+IFN 
|gamma 


34.9 |nCI-H292 IL-4 


0.8 


iLAK cells IL-2+ IL-1 8 


32T JNCI-H292 IL-9 0.8 


LAK cells 

PMA/ionomycin 


9.3 1NCI-H292 IL-13 1.2 

1 1 


NK Cells IL-2 rest 


82.4 NCI-H292 I FN gamma \ 3.0 


Two Way MLR 3 day 


59.0 HPAECnone 


8.6 


Two Way MLR 5 day 


, g _ HPAEC 1 NF alpha + IL- 
1 beta 


7.7 


Two Way MLR 7 day 


27.4 Lung fibroblast none 


1.9 


PBMC rest 


^ j jLung fibroblast TNF 
alpha + IL-1 beta 


0.1 


PBMC PWM 


19.5 Tung fibroblast IL-4 


1.2 


PBMC PHA-L 


28.1 


Lung fibroblast IL-9 


().•? 


Ramos (B cell) none 


47.0 


1 ung fibroblast IL-13 


0.6 


Ramos (B cell) 
ionomycin 


37.4 


Lung fibroblast IFN 
gamma 


2.6 


B lymphocytes PWM 


16.4 


Dermal fibroblast 
CCD 1070 rest 


3.6 


B lymphocytes CD40L 
and IL-4 


48.6 


Dermal fibroblast 
CCD 1070 TNF alpha 


73.7 


EOL-1 dbcAMP 


0.1 


1 >ermal fibn 'blast 


3.5 
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JtHJJL-1 UDCAJVli' 

PMA/ionomycin 


i\^^u i u / u ii^-i oeia 
q q (Dermal fibroblast IFN 
jgamma 


8.3 


Dendritic cells none 


24.0 .Dermal fibroblast IL-4 


6.1 


Dendritic cells LPS 


6.5 Dermal Fibroblasts rest 


23 0 


Dendritic cells anti- 
CD40 


24.1 (Neutrophils TNFa+LPS 


2.1 


Monocytes rest 


44.8 .Neutrophils rest 


8.7 


Monocytes LPS 


10.7 (Colon 


69.3 


Vlacrop lages rest 


38.7 


Lung 


27.7 


Maetvp lages LPS 


9.5 


Thymus 


75.3 


HUVEC none 


2.9 


Kidney 


84.7 


HUVEC: starved 


3.3 






Table CBF. Panel 4D 


Tissue Name 


Rel. Exp.(%) 1 

Ag3506, Run 1 Tissue Name 
166407188 | 


Rel. Exp.(%) 
Ag3506, Run 
16640 "INS 


Secondary Thl act 


8.4 HU\E( 1 L- 1 beta 


0.4 


Secondary Th2 act 


7.2 HUVEC IFN gamma 


6.7 


Secondary Trl act 


11.4 


HUVEC TNF alpha + 
\hU gamma 


1.1 


Secondary Thl rest 


31.6 


HUVEC TNF alpha + 
IL4 


1.0 


Second, ry 1 h2 rest 


1 v7 


HUVEC IL- 11 


2.3 


Secondary Trl rest 


22.5 

. - 


Lung Microvascular EC 
nunc 


3.8 


Primary Th 1 act 


~ g Lung Microvascular EC 
p'NFalpha -ML- 1 beta 


4.0 


Primary Th2 act 


, . , i Microvascular Dermal 
14.6 L_ 

(EC none 


3.4 


Primary Trl act 


. . iMicrosvasular Dermal 

EC I NI alpha - IL-lbeta 


2.8 


Primary Th 1 rest 


69.7 


Bronchial epithelium 
TNFalpha + ILlbeta 


0.9 


Primary Th2 rest 


44.1 


Small airway epithelium 
none 


0.2 


Primary Trl rest 


28.9 


Small airway epithelium 
1 Nl alpha + II -Ibeta 


1.2 


CD45RA ( 1)4 
htuph.K\T L act 


3.2 


Coronery artery SMC rest 


0.0 


CD45ROCD4 
lymphocyte act 


12.3 


Coronery artery SMC 
TNFalpha + IL-lbeta 


0.3 
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CDS lymphocyte act- 
Secondary CD8 
lymphocyte rest 


8.7 fAstrocytes rest 
p - Astrocytes TNFalpha + 
jlL-lbeta 


—™ 

0.1 


Secondary CD8 
lymphocyte act 


12.0 JKU-812 (Basophil) rest 


18.3 


CD4 lymphocyte none 


9.6 


KU-812 (Basophil) 
PMA/ionomycin 


14.9 


2ry Thl/Th2/Tr ^anti- 
CD 1 > 5 CH11 


23.8 


CCD1106 

(Keratinocytes) none 


2.0 


LAK cells rest 


6.8 


CCD1106 
(Keratinocytes) 
TNFalpha + IL-lbeta 


11.7 




21.2 Liver cirrhosis 


5 4 


1 A W r-pllc II 9-f-lT I 0 


8.5 iLupus kidney 




T AK" fpllc TT 9+TPM 

gamma 


12.2 JNCI-H292 none 


— — -~ 

0.2 


1 Ak' ivlk II ■>+ II In 
L , V i\- L/CliS I Li.' 1 L I O 


12.0 NCI-H292 IL-4 


(X2 -~ 


L/\Fv CcIlS 

PM \ i( homy- in 


1.8 1NCI-H292 IL-9 


0.2 


JNls. cells il-z rest 

— — . 


; '— ^ — ^ 

16.0 |NCI-H292 IL-13 


0.0 


i wo vv .iy iv ilk j> a ay 


16.6 NC1-H292 IFN gamma 


— _ = 


1 WO VV ,3V IV ILK j day 


7.2 'HPAEC none 


2.9 


Two Way MLR 7 day 


Q6 HPAEC LNF alpha I II - 
II beta 


2.8 


PBMC rest 


8.5 Lung fibroblast none 


0.5 


PBMC PWM 


, - Lung fibroblast TNF 
|alpha + IL-l beta 


0.6 


PBMC PHA-L 


2.8 Lung fibroblast IL-4 


0.1 


Ramos (B cell) none 


11.3 Tung fibroblast II.- 1 ) 


0.1 


Ramos (B cell) 
ionomycin 


5 .6 Lung fibroblast IL- 1 3 


0.0 


B lymphocytes PWM 


g 2 Lung fibroblast IFN 
gamma 


0.5 


15 KmphoevtesC O401 
and IL-4 


. _ _ jDermal fibroblast 1 . 

ICCD1070rest 1-1 


EOL-1 dbcAMP 


0.1 


Dermal fibroblast J _ 1 „ 
CCD] 070 "INF alpha j 


JbUL-I QDCAMr 
I'M \ ionomycin 


0.0 


Dermal fibroblast . n 
CCD1 070 IL-1 beta 


Dendritic cells none 


4.3 


Dermal fibroblast IFN 
gamma 


2.1 


Dendritic cells LPS 


M 


D rmai fibroblast IL-4 


1.8 


Dendritic cells anti- 
CD40 


4.6 


IBD Colitis 2 


2.6 


Monocytes rest 


1 1 .9 


IBD Crohn's 


5.5 " 



913 



[Monocytes LPS 


1.7 


JColon 


I 100.0 


Macrophages rest 


0.3 


jLung 


1 5.9 " 


Macrop lages LPS 


1.6 


JThymus 


37.9 


HUVE( none 


()» 


(Kidney 


24.7 


Nl'M < starved 


1.7 







CNS_neurodegeneration_vl.O Summary: Ag3506/Ag3581 This panel confirms the 
expression of the CG59671-02 gene at significant levels in the brain in an independent 
group of individuals. However, no differential expression of this gene was detected 
between Alzheimer's diseased postmortem brains and those of non-demented controls in 
this experiment. 



This gene is expressed at moderate levels throughout the CNS, including in 
amygdala, substantia nigra, thalamus, cerebellum, cerebral cortex, and spinal cord as 
observed in panel 1 .4. Therefore, this gene may play a role in other central nervous system 
disorders such as, Parkinson's disease, epilepsy, multiple sclerosis, schizophrenia and 
depression 

General_screening_panel_vl.4 Summary: Ag3506/Ag3581 Two experiments produce 
results that are in very good agreement. Highest expression of the CG59671-02 gene is 
observed in samples derived from melanoma cell lines (CTs=23-35). Thus, expression of 
this gene can be used in distinguishing these samples from other samples in the panel. In 
addition, significant levels of expression of this gene are also associated with colon cancer, 
ovarian cancer, breast cancer, and lung cancer cell lines. Therefore, therapeutic modulation 
of the activity of this gene or its protein product might be beneficial in the treatment of 
these cancers. 

This gene is also expressed at low to moderate levels in a number of tissues with 
metabolic or endocrine function, including adipose, adrenal gland, gastrointestinal tract, 
pancreas, skeletal muscle and thyroid. Therefore, therapeutic modulation of the activity of 
this gene may prove useful in the treatment of endocrine/metabolically related diseases, 
such as obesity and diabetes. 

This gene is also expressed at high to moderate levels in all regions of the CNS 
examined. Please see Panel CNS_neurodegeneration_vl.O for discussion of utility of this 
gene in the central nervous system. 



914 



Panel 4.1D Summary: Ag3581 Highest expression of the CG59671-02 gene is observed 
in the resting KU-812 sample (CT=29.18). In addition, high expression of this gene is 
detected in colon, lung, thymus and kidney. Therefore, therapies designed with the protein 
encoded for by this gene could be important in the treatment of inflammatory or 
5 autoimmune diseases that affect the kidney, lung and kidney including, asthma, allergies, 
lupus and glomerulonephritis. Expression of this gene is decreased in colon samples from 
patients with IBD colitis and Crohn's disease relative to normal colon. Therefore, 
therapeutic modulation of the activity of the protein encoded by this gene may also be 
useful in the treatment of inflammatory bowel disease. 

10 Expression of this gene appears to be down-regulated in activated primary and 

secondary Thl, Th2, and Trl cells. Also, TNF alpha stimulates the expression of this gene 
in resting dermal fibroblasts. Therefore, therapeutics designed with the protein encoded by 
this transcript could be important in regulating T cell function and treating diseases such as 
asthma, arthritis, psoriasis, IBD, and systemic lupus erythematosus. 

15 Panel 4D Summary: Ag3506 Highest expression of CG59671-02 is observed colon 
sample (CT=27.3). Overall, the expression pattern using this probe is in excellent 
agreement with results in panel 4. ID for Ag3581. Please see that panel for discussion of 
utility of this gene in inflammation. 

CC. CG56870-01: NDR3 

20 Expression of gene CG56870-01 was assessed using the primer-probe set Ag2075, 

described in Table CCA. Results of the RTQ-PCR runs are shown in Tables CCB, CCC, 
CCD and CCE. Please note that CG56870-02 represents a full-length physical clone of the 
CG56870-01 gene, validating the prediction of the gene sequence. 



Table CCA . Probe Name Ag2075 



Primers 


Sequences 


Length 


Start 
Position 


SEQID 
NO: 


I or ward 


5 ' -catggatgaacttcaggatgtt-3 ' 


22 


70 


647 


Probe 


TET-5 1 -cagctcacagagatcaaaccacttct-3 1 - 
TAMRA 


26 


92 


648 


Reverse 


5 ' - tgacagtcaaagtcctggaagt-3 1 


22 


141 


649 



25 Table CCB . Panel 1.3D 
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Tissue Name 


Rel. Exp.(%) j 
Ag2075, Run I Tissue Name 
152355202 ( 


Rel. Exp.(%) 
AgZU /D, Kun 
152355202 


Liver adenocarcinoma 


11.9 Kidney (fetal) 


1.4 


Pancreas 


0.8 


Renal ca. 786-0 


3.5 


Pancreatic ca. CAPAN 1 
2 


1.6 


Renal ca. A498 


12.9 


Adrenal gland 


2.4 


Renal ca.RXI 393 


1.5 


Thyroid 


1.8 


Renal ca. ACHN 


1.7 


Salivary gland 


1.2 


Kenai ca. uvj-j i 


6.8 


Pituitary gland 


2.4 


Renal ca. TK-10 


2.8 


Brain (fetal) 


3.2 


Liver 


0.4 


Brain (whole) 


25.2 


1 iver ( fetal i 


0.8 


Brain (amygdala) 


14.1 


Liver ca. 

(hepatoblast) HepG2 


6.7 


Brain (ccreHcilutn t 


10.8 


Lung 


2.1 


Brain (hippocampus) 1 39.8 


Lung (fetal) 


3.2 


Brain (substantia nigra) 


2.7 


Lung ca. (small cell) 
LX-1 


4.1 




Brain (thalamus) 


12.4 


Lung ca. (small cell) 
NCI-H69 


3.8 


Cerebral Cortex 


100.0 


Lung ca. (s.cell var.) 
SHP-77 


3.8 


Spinal cord 


, , JLung ca. (large 
cetl)NCl-H460 


0.6 


glio/astro U87-MG 


6.4 


Lung ca. (non-sm. 
cell) A549 


3.0 


glio/astroU-118-MG 


10.4 


Lung ca. (non-s.cell) 
NCI-H23 




astrocytoma SW1783 


5.1 


Lung ca. (non-s.cell) 
HOP-62 


3.1 


neuro*; met SK-N-AS 


6.2 


Lung ca. (non-s.cl) 
NCI-H522 


6.3 


astrocytoma SF-539 


5.4 


Lung ca. (squam.) 
SW 900 


1.4 


astrocytoma SNB-75 


6.7 


Lung ca. (squam.) 
NCI-H596 


1.4 


glioma SNB-19 


3.1 


Mammary gland 


2 - 4 


glioma U251 


2.0 


Breast ca.* (pl.ef) 
MCF-7 


1.8 


glioma SF-295 


3.4 


Breast ca.* (pl.ef) 
MDA-MB-231 


11.6 


Heart (fetal) 


6.7 


Breast ca.* (pl.ef) 
T47D 


9.0 


Heart 


1.4 


Breast ca. BT-549 


4.8 
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Skelciai muscle (fetal) 


1 8.0 Breast ca. MDA-N 


4.2 


Skeletal muscle 


0.7 jOvary 


13.4 


Bone marrow 


0 q Ovarian ca. 

fOVCAR-3 


1.8 


Thymus 


0 q Ovarian ca. 

JOVCAR-4 


1.6 


Spleen 




- _ lOvarian ca. 

jOVCAR-5 


- | 

1.8 


Lymph itiode 


1.9 


Ovarian ca. 
OVCAR-8 




4.9 


Colorectal 


3.3 


Ovarian ca. IGROV- 
1 


1.6 


Stomach 


2.7 


Ovarian ca.* 
(ascites) SK-OV-3 


5.8 


Small intestine 


2.5 


Uterus 


2.4 


Colon ca SW480 


10.4 


Placenta 


1.2 


Colon ca.* 
SW620($W480 met) 


3.7 


Prostate 


6.0 


Colon ca. HT29 


4.6 


Prostate ca.* (bone 1 . _ 
met)PC-3 


Coloi ci I IC 1-1 16 


3.6 


Testis 


17 


Colon ca. CaCo-2 


10.3 


Melanoma 
Hs688(A).T 


3.8 


Colon ca. 
tissue(OD03866) 


~ 9 [Melanoma* (met) 
[Hs688(B).T 


4.4 


Colon ca. HCC-2998 


2.6 


Melanoma UACC- 
62 


1.4 


Gastric ca.* (liver met) 
NCI-N87 


3.1 


Melanoma M 14 


1.8 


Bladder 


0.8 


Melanoma LOX 
IMVI 


3.3 


Trachea 


2.0 


Melanoma* (met) 
SK-MEL-5 


2.4 


Kidney 


1.3 


Adipose 


1.2 



Table C CC. Panel 2.2 



Tissue Name 


Rel. Exp.(%) 
Ag2075, Run 
174255357 


Tissue Name 


Rel. Exp.(%) 
Ag2075, Run 
174255357 


Normal Colon 


27.7 


Kidney Margin 
(OD04348) 


46.3 


Colon cancer 
(01)06064) 


52.9 


Kidney malignant 
cancer (OD06204B) 




19.2 


Colon Margin 


30.1 


Kidney normal 


14.3 
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2 



(OD06C64) 


jadjacent tissue 
|(OD06204E) 




Colon cancer 
(OD06159) 


„ , Kidney Cancer 
|(OD04450-01) 


66.0 


Colon Margin 
(OD06159)" 


?1 „ jKidney Margin 
j(OD04450-03) 


19.8 


Colon cancer 
(OD06297-04) 


99 _ jKidney Cancer 
|8120613 


3.0 


Colon Margin 
(OD06297-05) 


41 o iKidney Margin 
18120614 


11.0 


CC Gr.2 ascend colon 
(ODG3921) 


4.0 


Kidney Cancer 
9010320 


6.8 


CC Margin (OD03921) 


6.7 


Kidney Margin 
9010321 


9.7 


Colon cancer metastasis 
(OD06104) 


1 1 1 Kidney Cancer 
(8120607 


35.6 


Lung Margin 
(OD06104) 


47 fi jKidney Margin 
18120608 


13.7 


Colon mets to lung 
(OD04451-01) 


17.8. jNormal Uterus 


59.5 


Lung Margin 
(< ! ! ^- *-i 5 1 -02> 


9.3 juterine Cancer 0640 1 1 


9.7 


Normal Prostate 


51.4 Normal Thyroid 


8.8 


Prostate Cancer 
(OD04410) 


? , n jThyroid Cancer 
1064010 


11.2 


Prostate Margin 
(OD04410) 


19.3 


Thyroid Cancer 
A302152 


17.9 


Normal Ovary 


22.8 


Thyroid Margin 
A302153 


5.5 


Ovarian cancer 
(OD06283-03) 


9.9 Normal Breast 

1 


33.2 


Ovarian Margin 
(OD06283-07) 


1 7 1 [Breast Cancer 
.(OD04566) 


8.5 


Ovarian Cancer 064008 


18.0 


l!iv;i-4 Cancer 1(124 


36T ' 


Ovarian cancer 
(OD06145) 


6.6 


Breast Cancer 
(OD04590-01) 


18.4 


Ovarian Margin 


1 ? , Breast Cancer Mets 
|(OD04590-03) 


31.9 


Ovarian cancer 
(OD06455-03) 


14.7 


Breast Cancer 
Metastasis (OD04655- 
05) 


45.4 


Ovarian Margin 
(OD06455-07) 


21.8 


Breast Cancer 064006 


11.5 

I 


Normal Lung 


21.9 


Breast Cancer 9100266 


20.9 


Invasive poor diff. lung 


17.6 


Breast Margin 


35.1 
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adeno (ODO4945-01 




9100265 




Lung Margin 
(ODO4945-03) 


12.2 


Breast Cancer 
A209073 


9.7 


Lung Malignant Cancer 
(OD03126) 


8.7 


Breast Margin 
A2090734 


22.2 


Lung Margin 
COD03126) 


7.4 


Breast cancer 
(OD06083) 


100.0 


Lung Cancer 
(OD05014A) 


9.9 


Breast cancer node 
metastasis (OD06083) 


63.7 


Lung Margin 
(OD05014B) 


21.8 


Normal Liver 


9.9 


Lung cancer 
(OD06081) 


5.1 


Liver Cancer 1 026 


5.6 


Lung Margin 
(OD06081) 


7.9 


Liver Cancer 1025 


13.5 


Lung Cancer 
(OD04237-01) 


17.4 

, 


Liver Cancer 6004-T 


4.8 


Lung Margin 
(OD04237-02) 


24.0 


Liver Tissue 6004-N 


9.3 


Ocular Melanoma 
Metastasis 


9.7 


Liver Cancer 6005-T 


15.7 


Ocular Melanoma 
Margin (Liver) 


■ 46 


Liver Tissue 6005-N 


20.4 


Melanoma Metastasis 


l l >.8 


1 iver Cancer 064003 


10.7 


Melanoma Margin 
(Lung) 


21.6 


Normal Bladder 


8.0 


Normal Kidney 


11.0 


Bladder Cancer 1023 


10.5 


Kidney Ca, Nuclear 
grade 2 (OD04338) 


37.6 

— „™ m ™ 


Bladder Cancer 
A302173 


17.3 


Kidney Margin 
(OD04338) 


22.1 


Normal Stomach 


37.9 


Kidney Ca Nuclear 
grade 1/2 (OD04339) 


21.6 

.„,„. 


Gastric Cancer 
9060397 


11.1 


Kidney Margin 
(OD04339) 


12.9 


Stomach Margin 
9060396 


20.6 


Kidney Ca, Clear cell 
type (OD04340) 


6.5 


Gastric Cancer 
9060395 


22.7 


Kidney Margin 
(OD04340) 


18.4 


Stomach Margin 
9060394 


36.1 


Kidney Ca, Nuclear 
grade 3 (OD04348) 


12.9 

: 


Gastric Cancer 064005 


8.1 


Table CCD. Panel 3D 


Tissue Name 


f Rel.Exp.(%)j 


Tissue Name 


Rel. Exp.(%) 



919 



j Ag2075,Run 
J 164750734 




Ag2075, Run 
164750734 


Daoy- Myoblastoma 1 6.7 | Ca Ski " Ce ^ vic f 1 ^dermoid 
carcinoma i metastasis) 


50.3 


TE671- I £ 0 IeS-2- Ovarian clear cell 
Medulloblastoma carcinoma 


13 9 


D283 Med- 
Medulloblastoma 


^ : Ramos- Stimulated with 
PM X ionomycin 6h 


2.7 


PFSK-1- Primitive 
Neuroectodermal 


j ^ ^ jRamos- Stimulated with 
P\l \ loin mi} ,m 1 -Ih 


3.3 


XF-498- CNS 


4.4 


MEG-01- Chronic 
myelogenous leukemia 
(megokaryoblast) 


3.4 


SNB-78- Glioma 


23.3 


Raji- Burkitt's lymphoma 


3.7 


SF-268- Glioblastoma 


13.6 


Daudi- Burkitt's lymphoma 


6.5 


T98G- Glioblastoma 


18.6 


U266- B-cell plasmacytoma 


8.4 


SK-N-SH- 

Neuroblastoma 

(metastasis) 


17.0 


CA46- Burkitt's lymphoma 


7.2 


SF-295- Glioblastoma 


g ^ RL- non-Hodgkin's B-cell 
lymphoma 


,,, 

3.5 


Cerebellum 


74.7 


JM1- pre-B-cell lymphoma 


3 "! 


Cerebellum 


62.4 


Jurkat- T cell leukemia 


11.6 


NCI-H292- 
Mucoepidermoid lung 
carcinoma 


45.7 


TF- 1 - Erythroleukemia 


13.4 


DMS-114- Small cell 
lung cancer 


10.9 


HUT 78- T-cell lymphoma 


— - - - 

10.0 


DMS-79- small cell 
lung cancer 


100.0 


U937- Histiocytic lymphoma 


18.2 


NCI-H146- Small cell 
lung cancer 


30.6 


KU-812- Myelogenous 
leukemia 


6.0 


NCI-H526- Small cell I 

, 57.4 

Jung cancer j 


769-P- Clear cell renal 
carcinoma 


1 1 .3 


NCI-N4 17- Small cell 
lung cancer 


15.2 


Caki-2- Clear cell renal 
carcinoma 


10.4 


NCI-H82- Small cell 
lung cancer 


36.9 


SW 839- Clear cell renal 
carcinoma 


2.2 


NCI-Ill 57- Squamous 
cell lung cancer 
(metastasis) 


51.1 


G401- Wilms' tumor 


6.2 


NCI-H1155- Large cell 
lung cancer 


26.6 


Hs7''>hr- Pancreatic 
carcinoma (LN metastasis) 


42.9 


NCI-H1299- Large cell 
lung cancer 


44.4 


CAPAN-1- Pancreatic 
adenocarcinoma (liver 
metastasis) 




5.0 



NCI-H727- Lung 
carcinoid 


47.0 


SU86.86- Pancreatic 
carcinoma (liver metastasis) 


28.1 


NCI-UMC-ll-Lung 
carcinoid 


60.3 


BxPC-3- Pancreatic 
adenocarcinoma 


6.3 


LX-1- Small cell lung 
cancer 


23.5 


HPAC- Pancreatic 
adenocarcinoma 


7.4 


Colo-205- Colon cancer 


29 5 |MIA PaCa-2- Pancreatic 

Jcarcinoma 


3.6 




KM12- Colon cancer 


24 Q jCFPAC- 1 - Pancreatic ductal 
^adenocarcinoma 


40.9 


KM20L2- Colon cancer 


8 4 PANC- 1 - Pancreatic 

epithelioid ductal carcinoma 


- 

20.9 


NCI-H7 16- Colon 
cancer 


, :T24- Bladder carcinma 
|(transitional cell) 


.3.1 


SW-48- Colon 
adenocarcinoma 


27.0 (5637- Bladder carcinoma 


11.0 


SW111 6- Colon 
adenocarcinoma 


— — , 

iHT-1 197- Bladder 
1U.U i 

(carcinoma 


9.2 


LS 174T- Colon 
adenocarcinoma 


9.9 


UM-UC-3- Bladder 
carcinma (transitional cell) 


7.2 


SW-948- Colon 
adenocarcinoma 


hi 


A204- Rhabdomyosarcoma 


7.0 


SW-480- Colon 
adenocarcinoma 


8.8 


HT- 1 080- Fibrosarcoma 


16.6 


NCI-SNU-5- Gastric 
carcinoma 


7.2 


MG-63- Osteosarcoma 


16.7 


KATO 11- Gastric 
carcinoma 


32.8 


SK-LMS-1- 

Leiomyosarcoma (vulva) 


26.4 


NCI-SNU-16- Gastric 
carcinoma 


13.6 


SJRH30- 

Rhabdomyosarcoma (met to 
bone marrow) 


16.8 


NCI-SNU-1- Gastric 
carcinoma 


j 6 0 jA43 1 - Epidermoid 
carcinoma 


9.8 


RF-1- Gastric 
adenoc; rcinoma 


2.1 WM266-4- Melanoma 


12.0 


RF-4.S- Gastric 
adenocarcinoma 


^ q |DU 145- Prostate carcinoma 
i(brain metastasis) 


0.1 


MKN-45- Gastric 
carcinoma 


28.7 


MDA-MB-468- Breast 
adenocarcinoma 


14.7 


NCI-N87- Gastric 
carcinoma 


13.3 


SCC-4- Squamous cell 
carcinoma of tongue 


0.9 


OVCAR-5- Ovarian 
carcinoma 


2.6 


SCC-9- Squamous cell 
< a t rH<ind >>t t< iLi e 


0.3 


RL95-2- Uterine 
carcinoma 


7.2 


SCC-15- Squamous cell 
carcinoma of tongue 


0.5 


Nda^ 3- Cervical 


13.4 




21.2 
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adenocarcinoma carcinoma < >f tongue 

Table CCE . Panel 4D 



Tissue Name 


Rel. Ex !>.(%) I Rel. Exp.(%) 
Ag2075,Run I Tissue Name Ag2075,Run 
152787491 I 152787491 


- -- 

S< 2 >ndar\ s hi act 


46.3 HUVEC IL-lbeta 


25.6 


Second tp 1 h2 act 


39.2 HUVEC I FN gamma 


33.0 


Secondary Trl act 


319 IHIVI.CTN1- alpha 4 
U N gamma 


13.1 


Secondary Thl rest 


n _ HI S EC l\l alpha ' 
II I 


20.7 


Secondary Th2 iv->i 


13.9 III \U II -II 


20.0 


Secondary Trl rest 


2 j g (Lung Microvascular EC 
none 


22.7 


Primary Thl act 


_ 0 1 jLung Microvascular EC 
TNFalpha + IL-lbeta 


15.0 


Primary Th2 act 


2 I Microvascular Dermal 
EC none 


26.8 


Primary Trl act 


_ iMicrosvasular Dermal 

)EC TNFalpha + IL-lbeta 


16.6 


Primary Thl rest 


84.1 


Bronchial epithelium 
TNFalpha + IL1 beta 


4.9 


Primary Th2 rest 


48.6 


Small airway epithelium 
none 


19.9 


Primary Trl rest 


39.0 


Small airway epithelium 
TNFalpha + IL-lbeta 


72.7 


CD45RA CD4 
h mphocyte act 


21.8 ICoronery artery SMC rest 


27.9 


CD45RO CD4 
lymphocyte act 


33.0 


Coronery artery SMC 
TNFalpha + IL-lbeta 


19.6 


C DS hi aphonic act 


25.5 


Astrocj tes rest 


26.4 


Secondary CD 8 
lymphocyte rest 


21.5 


Astrocytes 1 Nl alpha ■ 
IL-lbeta 


13.2 


Secondary CD8 
lymphocyte act 


29.3 |KU-812 (Basophil) rest 1 


6.3 


CD4 lymphocyte none 


12? KU-812 (Basophil) 
iPMA/ionomycin 


16.0 


2i\ !h1 lh2, hi mti- 
CD95 CH11 


25.5 


CCD 1 106 

(Keratinocytes) none 


46.0 


LAK cells rest 


26.1 


CCD1106 
(Keratinocytes) 
TNFalpha IL-lbeta 


4.3 


LAK cells TL-2 


30.6 


Liver cirrhosis 


2.7 
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LAK cells II -2 II -12 


18.2 |Lupus kidney 


* (i 


LAK cells IL-2+IFN 
gamma 


31.0 1NCI-H292 none 


76.8 


LAK cells 1L-2 - II -18 


31.2 1NCI-H292 IL-4 


94.6 


LAK cells 
PMA/ionomycin 


9.5 JNCI-H292 IL-9 


97.3 


NK Cells IL-2 rest 


37.4 


NCI-H292 IL-1 3 


59.5 


Two Way MLR 3 day 


24.0 


NCI-H292 IFN gamma 


51.8 


Two Way MLR 5 day 


23.0 


HPAEC none 


23.3 


Two Way MLR 7 day 


19.6 


HPAJ C TNF alpha + || - 
1 beta 


15.8 


PBMC rest 


11.4 


Lung fibroblast none 


184 


PBMC PWM 


?2 2 (Lung fibroblast TNF 
(alpha + IL-1 beta 


12.0 


rtSiVR... rHA-JL 


33.9 Lung fibroblast II -4 


35.8 


Ramos | P, cell) none 


19.6 Lung fibroblast IL-9 


25.5 


Ramos (B cell) 
ionomycin 


100.0 (Lung fibroblast IL-1 3 


18.7 


B lymphocytes PWM 
B l>mphoo} o i L -Jul 

„„ J TT A 

ana IL.-4 


8 j 8 Lung fibroblast IFN 
gamma 

42 , Dermal fibroblast 
CCD1070rest 


38.4 



48.0 


EOL-1 dbcAMP 


? ~ 7 jbermal fibroblast 

(CCD 1070 TNF alpha 


83.5 


r/\| i jl„ \ \ 4 r> 
tiKJL-l uOCAMr 

PMA/ionomycin 


13.5 


Dermal fibroblast 
CCD1070 II -1 beta 


13.6 


Dendritic cells none 


20.9 


Dermal fibroblast IFN 
gamma 


— — 

13.1 


Dendritic cells LPS 


11.5 


Dermal fibroblast IL-4 


36.6 


Dendritic cells anti- 
CD40 


23.2 


IBD Colitis 2 


1 8 


Monocytes rest 


19.2 


IBD Crohn's 


2.4 


Monocytes LPS 


6-5 


Colon 


26.8 


Macrophages test 


36.1 iLung 




Macn tphages LIS 


13.3 iThymus 


41.5 j 


111 \ \ < none 


3 7.6 Kidney 


" 24.3 " 1 


HUVEC starved 


58.6 





Panel 1.3D Summary: Ag2075 Highest expression of the CG56870-01 gene is detected in 
the cerebral cortex (CT=24.2). Thus expression of this gene can be used in distinguishing 
this sam ple from other samples in the panel. Furthermore, significant expression of this 
gene is observed throughout the CNS, including in amygdala, substantia nigra, thalamus, 
cerebellum, cerebral cortex, and spinal cord. The CG56870-01 gene encodes an Ndr3 
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homolog which is a putative member of Ndr family. This family consists of proteins from 
different gene families: Ndr 1 /RTP/Drg 1 /NDRG 1 , Ndr2, and Ndr3 (PFAM: IPR004142). 
NDRG1 is a cytoplasmic protein involved in stress responses, hormone responses, cell 
growth, and differentiation. Mutation of this gene was reported to be causative for 
hereditary motor and sensory neuropathy-Lorn. Recently, NDRG4, another memember of 
Ndr family, was shown to be expressed in neurons of the brain and spinal cord. Its 
expression was markedly decreased in the brain of Alzheimer's disease patient (Zhou et al., 
2001). irherefore, this gene may play a role in central nervous system disorders such as 
Alzheimer's disease, Parkinson's disease, epilepsy, multiple sclerosis, schizophrenia and 
depression. 

This gene also has moderate levels of expression in adipose, adrenal, thyroid, liver, 
heart, thyroid and skeletal muscle. Thus, this gene product may be important in the 
pathogenesis, diagnosis and/or treatment of metabolic and endocrine disease, including 
Types 1 and 2 diabetes and obesity. 

In addition, there appears to be substantial expression in other samples derived from 
breast cancer cell lines, lung cancer cell lines, renal cancer cell lines and colon cancer cell 
lines. Thus, therapeutic modulation of this gene could be of benefit in the treatment of 
breast, lung, renal or colon cancer. 

RefereEices: 

I. Zhou RH, Kokame K, Tsukamoto Y, Yutani C, Kato H, Miyata T. (2001) 
Characterization of the human NDRG gene family: a newly identified member, NDRG4, is 
specifically expressed in brain and heart. Genomics 73(l):86-97 

Ag2075 The expression of this gene appears to be highest in a sample derived from 
a normal brain tissue. In addition, there appears to be substantial expression in other 
samples derived from breast cancer cell lines, lung cancer cell lines, renal cancer cell lines 
and colon cancer cell lines. Thus, the expression of this gene could be used to distinguish 
normal brain tissue from other samples in the panel. Moreover, therapeutic modulation of 
this gene could be of benefit in the treatment of breast, lung, renal or colon cancer. 
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Panel 2,.2 Summary: Ag2075 Highest expression of CG56870-01 is detected in breast 
cancer sample (CT=29.89). Thus expression of this gene can be used in distinguishing this 
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sample from other samples in the panel. In addition, there appears to be substantial 
expression in other samples derived from breast cancers, kidney cancers and colon cancers. 
Therefore, therapeutic modulation of this could be of benefit in the treatment of breast, 
kidney or colon cancer. 

5 Panel 3D Summary: Ag2075 The expression of this gene appears to be highest in a 

sample derived from a lung cancer cell line (DMS-79)(CT=26.4). In addition, there appears 
to be substantial expression in other samples derived from pancreatic cancer cell lines, lung 
cancer cell lines, brain cancer cell lines and cervical cancer cell lines. Thus, the expression 
of this gene could be used to distinguish DMS-79 cells from other samples in the panel. 

=== 

10 Moreover, therapeutic modulation of this gene could be of benefit in the treatment of 
J pancreatic, lung, brain or cervical cancer. 

ry 

Panel 4D Summary: Ag2075 Expression of the CG56870-01 gene is ubiquitous 
q througout this panel, with highest in samples derived from ionomycin treated Ramos (B 

L cell) cells (CT=26.1). Furthermore, expression of this gene is also detected in PWM treated 

U 

yj 15 PBMC cells and PWM treated B lymphocytes. Therefore, therapeutic modulation of the 

*~1 express ion or function of this gene may reduce or eliminate the symptoms in patients with 

□ autoimmune and inflammatory diseases in which B cells play a part in the initiation or 

progression of the disease process, such as systemic lupus erythematosus, Crohn's disease, 
ulcerative colitis, multiple sclerosis, chronic obstructive pulmonary disease, asthma, 
20 emphysema, rheumatoid arthritis, or psoriasis. 

CD. CG56870-04: N-nryc downstream-regulated gene 3 

Expression of gene CG56870-04 was assessed using the primer-probe sets Ag5279 
and Ag2075, described in Tables CDA and CDB. Results of the RTQ-PCR runs are shown 
in Tables CDC, CDD, CDE, CDF, CDG, CDH and CDI. 

25 Table CPA . Probe Name Ag5279 



jPrimers Sequences 


Length 


Start 
Position 


SEQID 
NO: 


Forwards ' -aggctgtgatggcg a 


IS 873 


650 


IProbe | TET ~ 5 1 -ttcagcctgggaagttcaccgaggcc- 
] S3 ' -TAMRA 


26 


912 


651 


Reverse 5' j- CJ-jjtcatgctggcagat-3 ' 


20 


.21? 


o5: 
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Table CDB . Probe Name Ag2075 



Primers 


Sequences 


Length 


Start 
Position 


SEQID 
NO: 


Forward 


5 ' -catggatgaacttcaggatgtt-3 1 


22 


70 


653 


Probe 


TET-5 1 -cagctcacagagatcaaaccacttct-3 ' - 
IAMRA 


26 


92 




654 


Reverse 


5 ' -tgacagtcaaagtcctggaagt-3 ' 


22 


141 


.... 

655 



Table CDC . CNS_neurodegeneration_vl.O 



r— — - 
Tissue Name 


Rel. Exp.(%) Ag5279,T"~ 

Run 230512909 I ' lssue i>ame 


Rel. Exp.(%) Ag5279, 
Run 230512909 

■ 


AD 1 Hippo 


7.7 


Control (Path) 3 
Temporal Ctx 


1.0 


AD 2 Hippo 


8.4 


Control (Path) 4 
temporal Ctx 


13.2 


AD 3 Hippo 


3.4 


AD 1 Occipital 
Ctx 


7.2 


AD 4 Hippo 


2.8 


AD 2 Occipital 
Ctx (Missing) 


0.0 


AD 5 hippo 


100.0 


AD 3 Occipital 
Ctx 


2.1 


AD 6 Hippo 32.8 | AD 4 Occipital 

JCtx 


13.1 


Control 2 Hippo 


32.3 


AD 5 Occipital 

Ctx 


15.6 


Control 4 Hippo 


2.7 


AD 6 Occipital 
Ctx 


54.7 


Control (Path) 3 
Hippo 


1.2 


Control 1 Occipital 
Ctx 


1.1 


AD 1 Temporal Ctx 


6 4 jControl 2 Occipital 


81.2 


AD 2 Temporal Ctx 


. _ , jControl 3 Occipital 
jCtx 


8.1 


AD 3 Temporal Ctx 


7 , jControl 4 Occipital! 

JCtx 


2.5 


AD 4 Temporal Ctx 


8 7 Control (Path) 1 
' JOccipital Ctx 


66.0 


AD 5 Inf Temporal 
Ctx 


?9 Q IControl (Path) 2 
.Occipital Ctx 


5.6 


AD 5 SupTemporal 
Ctx 


6 4 jControl (Path) 3 
Occipital Ctx 


1.5 


AD 6 Inf Temporal 
Ctx 


^ s _ Control (Path) 4 
(Occipital Ctx 


7.6 
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AD 6 Sup Temporal 

Ctx 


24 j Control 1 Parietal 

jCtx 


2.4 


Control 1 Temporal 
Ctx 


0 n Control 2 Parietal 
Ctx 


10.7 


Control 2 Temporal 
Ctx 


. . ~ (Control 3 Parietal 
41 - 2 (Ctx 


16.6 


i^oniroi j i empordi 
Ctx 


4 fi [Control (Path) 1 
jParietal Ctx 


75.3 


Control 4 Temporal 
Ctx 


2.7 


Control (Path) 2 
Parietal Ctx 


13.7 


Control (Path) 1 
Temporal Ctx 


35.4 


Control (Path) 3 
Parietal C1\ 


1.6 


Control (Path) 2 
Temporal Ctx 


27.7 


Control (Path) 4 
Parietal Ctx 


19.3 



Table CDD . General_screening_panel_vl.5 



Tissue Name 


Rel. Exp.(%) 
Ag5279, Run 


Tissue Name 


Rel. Exp.(%) 
Ag5279, Run 
2oU5Uy99o 


Adipose 


1.3 


Renal ca. TK-10 


9.2 


Melanoma* 
Hs688(A).T 


7.6 


Bladder 


2.4 


Melanoma* 
Hs688(B). r 


8.0 


Gastric ca. (liver met.) 
NCI-N87 


8.7 


Melanoma* Ml 4 


12.2 


Gastric ca. KATO III 


21.8 


Melanoma* 
LOXIMVI 


13.8 


Colon ca. SW-948 


4.6 


Melanoma* SK- 
MEL-5 


8.2 


Colon ca. SW480 


14.1 


Squamous cell 
carcinoma SCC-4 


6.5 


Colon ca.* (SW480 
met) SW620 


10.4 


lestis Pool 


8.2 


Colon ca. HT29 


12.3 


Prostate ca.* (bone 
met) PC-3 


2.3 


Colon ca. HCT-116 


12.2 


Pix.bUte Pool 


6.2 


Colon ca. CaCo-2 


17.8 


Placenta 


3.5 


Colon cancer tissue 


4.9 


Uterus Pool 


1 I 


Colon cu. SWI 116 


2.6 


Ovarian ca. 
OVCAR-3 


8.8 


Colon ca. Colo-205 


7.3 


Ovarian ca. SK- 
OV-3 


20.4 


Colon ca. SW-48 


5.9 


Ovarian ca. 
OVCAR-4 


5.6 


Colon Pool 


5.1 


Ovarian ca. 


10.9 


Small Intestine Pool 


3.9 
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OVCAR-5 






iOvarian ca. 
[lGROV-1 


6.0 




— _ 

Stomach Pool 


2.3 


Ovarian ca. 
OVCAR-8 


5.5 


Bone Marrow Pool 


1.5 


[Ovary 


5.7 


I clal I Icart 


_ 7.5 


Breast ca. MCF-7 


5.0 




Heart Pool 


3.3 


Breast ca. MDA- 
MB-23I 


23.0 




Lymph Node Pool 


5.1 


{Breast ca. BT 549 


21.9 




Fetal Skeletal Muscle 


3.1 


iBreast ca. T47D 


10.6 




Skeletal Muscle Pool 


4.0 


Breast <.a V1DA-N 


5.S 




Spleen Pool 




jBreast Pool 


5.9 




Thymus Pool 


2.9 


Trachea 




5.0 


jCNS cancer 
j(glio/astro) U87-MG 


20.4 

„ 


Lung 






CNS cancer 
(glio/astro)U-118-MG 


16.8 


Fetal Lung 


8.7 


|CNS cancer 
j(neuro;met) SK-N-AS 


10.7 


Lung ca.NCI-N417 


3.7 


CNS cancer (astro) SF- 
539 


7.0 


Lung ca. LX-1 


MA 


CNS cancer (astro) 
SNB-75 


21.3 


Lung ca, NCI-H146 


5.8 




CNS cancer (glio) 
SNB-19 


5.8 


Lung ca. SHP-77 


7.9 




C NS cancer (glio) SI - 
295 


12.2 


Lung ca. A549 


14 1 




P>raai < \m\ gd.ila f Pool 


19.3 


Lung ca. NCI-H526 


S1 




Brain (cerebelluni ) 


100.0 


Lung ca. NCI-H23 


11.2 




Brain (fetal ) 


20.0 


Lung ca. NCI-H460 


2.9 


Brain (Hippocampus) 
IPool 


17.4 


Lung ca.HOP-62 


4.0 




Cerebral Cortex Pool 


23.7 


T una ca MPT-HS99 \ 


13.6 


(Brain (Substantia 
Jnigra) Pool 


17.9 


Liver 


0.5 




Brain (Thalamus) Pool 


34.2 


Fetal Liver 


i J 




Brain (whole) 


" 46.3 ' 


Liver ca. HcpG2 




iSpinal Cord Pool 




Kidney Pool 


7.1 




Adrenal Gland 


11.8 


Petal Kidney 


5.7 


H 


Pituitary gland Pool 


2.9 


Renal ca. 786-0 


6.3 




Salivary Gland 




Renal ca. A498 


11.0 


..... 


1 hyroid (female) 





Renal ca. ACHN 


6.0 




Pancreatic ca. 
CAPAN2 


5.6 
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;Renal_ca_. UO-31 | 8.0 Pancreas Pool 6.2 

Table CPE . Panel 1.3D 



Tissue Name 


Rel. Exp.(%) 
Ag2075, Run 
152355202 


Tissue Name 


rtei. Ji,xp.( /o ) 
Ag2075, Run 
152355202 


Liver adenocarcinoma 


11.9 


kidney (fetal) 


1.4 


Pancreas 


0.8 


Renal ca. 786-0 


3.5 


Pancreatic ca. CAP AN 
2 


1.6 


Renal ca. A498 


12.9 


Adrenal gland 


2.4 


Renal ca. RXF 393 


1.5 


[Thyroid 


1.8 iRenal ca. ACHN 


17 


Salivan gland 


1.2 


Renal ca. UO-31 


6.8 


Pituitar gland 


2.4 


Renal ca.TK-10 


2.8 


Brain I fetal) 


3.2 


Liver 


0.4 


[Brain (whole) 


_ — 

25.2 


I ivei (fetal) 


0.8 


|Brain (amygdala) 


14.1 


(hepatoblast) HepG2 


6.7 


[Brain (cerebellum) 


10.8 




2.1 


Brain (hippocampus) 


39.8 


Lung (fcttil) 


3.2 


Brain (substantia nigra) 


2.7 


Lung ca. (small cell) 
LX-1 


4.1 


Brain (thalamus) 


12.4 


Lung ca. (small cell) 
NCI-H69 


3.8 


Cerebral Cortex 


100.0 


Lung ca. (s.cell var.) 
SHP-77 


3.8 


Spinal cord 


3.5 


Lung ca. (large 
cell)NCI-H460 


0.6 


at in/a stir* T TR7-Mfl 


6.4 


Lung ca. (non-sm. 
cell) A549 


3 0 


glio/astro U-l 1 8-MG 


10.4 


Lung ca. (non-s.cell) 
NCI-H23 


7.7 


astrocytoma SW1783 


5.1 


Lung ca. (non-s.cell) 
HOP-62 


3.1 


neuro*; met SK-N-AS 


6.2 


Lung ca. (non-s.cl) 
NCI-H522 


6.3 


astrocytoma SF-539 


5.4 


Lung ca. (squam.) 
SW 900 


1.4 


astrocytoma SNB-75 


6.7 


Lung ca. (squam.) 
NCI-H596 


1.4 


glioma SNB-19 


3.1 


Mammary gland 


2.4 


glioma U25 1 


2.0 


Breast ca.* (pl.ef) 
MCF-7 


1.8 
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glioma SF-295 


- . (Breast ca.* (pl.ef) 
J jMDA-MB-231 


11.6 


Heart (fetal) 


, 7 Breast ca.* (pl.ef) 
t>J __JT47D 


9.0 


Heart 


1.4 (Breast ca. BT-549 


4.8 


Skeletal muscle (fetal) 


1RO Rrpaot ra lVmA-W 
lo.u ixjlcaal td.. 1VIJ_7/V i > 


4 2 


Skeletal muscle 


0.7 jOvary 


13.4 


Bone marrow 


n „ lOvarian ca. 

(OVCAR-3 


1.8 


Thymus; 


„ g lOvarian ca. 

(OVCAR-4 


1.6 


Spleen 


_ „ lOvarian ca. 

pVCAR-5 


1.8 


Lymph node 


1.9 


Ovarian ca. 
OVCAR-8 


4.9 


Colorectal 


3.3 


Ovarian ca. IGROV- 
1 


1.6 


Stomach 


2.7 


Ovarian ca.* 
(ascites) SK-OV-3 


5.8 


Small intestine 


2.5 


Uterus 


2.4 


Colon ca. SW480 


10.4 


Placenta 


1.2 


Colon ca.* 
SW620(SW480 met) 


3.7 


Prostate 


6.0 


Colon ca. HT29 


„ , IProstate ca.* (bone „ ^ 
4 - 6 Anr , 4.5 
iinct)l > C-3 I 


Colon , 1 k 1 - i 16 3.6 jTestis 1.7 


Colon ca. CaCo-2 


10.3 


Melanoma 
Hs688(A).T 


3.8 


Colon ca. 
tissue(OD03866) 


3.2 


Melanoma* (met) 
Hs688(B).T 


4.4 


Colon ca. HCC-2998 


2.6 


Melanoma UACC- 
62 


1.4 


Gastric ca.* (liver met) 
NCI-N87 


3.1 


Melanoma Ml 4 


1.8 


Bladder 


0.8 


Melanoma LOX 
IMVI 


3.3 


Tracheal 


2.0 


Melanoma* (met) 
SK-MEL-5 


2.4 

i 


Kidney 


1.3 


Adipose 


..1,2 


Table CDF. Panel 2.2 


Tissue Name 


Rel. Exp.(%) 
Ag2075, Run 
174255357 


Tissue Name 


Rel. Exp.(%) 
Ag2075, Run 
174255357 
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Normal Colon 


27 7 jKidney Margin 
(OD04348) 


46.3 


Colon cancer 
(OD06064) 


52 n jKidney malignant 

jcancer (OD06204B) 


19.2 


Colon Margin 
(OD06064) 


jKidney normal 
30.1 jadjacent tissue 
((OD06204E) 


14.3 


Colon cancer 
(OD06159) 


, ^ jKidney Cancer 
D " .(OD04450-01) 


66.0 


Colon Margin 
(OD06159) 


, n (Kidney Margin 
|(OD04450-03) 


19.8 


Colon cancer 
(OD06297-04) 


? Kidney Cancer 
8 1 2061 3 


3.0 


Colon Margin 
(OD06297-05) 


Kidney Margin 
41-5 8120614 


11.0 


CC Gr.2 ascend colon 
(OD03921) 


. „ jKidney Cancer 
19010320 


6.8 


CC Margin (OD03921) 


, „ jKidney Margin 
9010321 


9.7 


Colon cancer metastasis 

(C.)UU<> 1 U4 I 


. 1 . jKidney Cancer 
b IzUoU / 


35.6 


Lung Margin 

/Vm^a^i f\A\ 
(UL>UoIU4) 


. ? r jKidney Margin 


13.7 


( ol( n mets to lung 
iTinn/t/i^i ^1^ 

[L>IJU £ *H J l-vl) 


17.8 "Normal Uterus 

1 


59.5 


Lung Margin 

[KJLJU'iHj l -UZ ) 


9.3 Uterine Cancer 0640 1 1 


9.7 


Normal Prostate 


5 1 .4 Normal Thyroid 


8.8 


Prostate Cancer 
(OD04410) 


? „ „ jThyroid Cancer 
(064010 


11.2 


Prostate Margin 


. Q - Thyroid Cancer 
A302152 


17.9 


Normal Ovary 


?? „ (Thyroid Margin 
8 IA302153 


5.5 


Ovarian cancer 


9.9 j Normal Breast 


33.2 


Ovarian Margin 
(OD06283-07) 


17.1 


Breast Cancer 
K >D04566) 


8.5 


Oval ian C dicer 064008 


18.0 


Breast Cancer 1024 


36.1 


Ovarian cancer 
(OD06145) 


6.6 


Breast Cancer 
(OD04590-01) 


18.4 


Ovarian Margin 
(OD06I45) " 


_ jBreast Cancer Mets 
(OD04590-03) 


31.9 


Ovarian cancer 
(OD06455-03) 


jBreast Cancer 
14.7 (Metastasis (OD04655- 
|05) 


45.4 
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Ovarian Margin 
(OD06455-07) 


21.8 


Breast Cancer 064006 


11.5 


Normal Lung 


21.9 


Breast Cancer 9100266 


20.9 


Invasive poor diff. lung 
adeno (ODO4945-01 


17.6 


Breast Margin 
9100265 


35.1 


Lung Margin 
(ODO4945-03) 


12.2 


Breast Cancer 
A209073 


9.7 


Lung Malignant Cancer 
(OD03126) 


8.7 


Breast Margin 
A2090734 


22.2 


Lung Margin 
(OD03126) 


7.4 


Breast cancer 
(OD06083) 


100.0 


Lung Cancer 
(OD05014A) 


9.9 


Breast cancer node 
metastasis (OD06083) 


63.7 


Lung Margin 
(OD05014B) 


21.8 

— — — — 


Normal Liver 


9.9 


Lung cancer 
(OD06081) 


5.1 


Liver Cancer 1026 


5.6 


Lung Margin 
(OD06C81) 


7.9 


Liver Cancer 1025 


13.5 


Lung Cancer 

{KJUVJtZJ /-Ulj 


17.4 


Liver Cancer 6004-T 


r . 

4.8 


Lung Margin 

\\JIJ\J'+Zj 1 -UZ ) 


24.0 




Liver Tissue 6004-N 


9.3 


Ocular iVlelanoma 
Metastasis 


9.7 

— 


Liver Cancer 6005-T 


15.7 


Ocular iVlelanoma 
Margin (Liver) 


4.6 


Liver Tissue 6005-N 


20.4 


Melanoma Metastasis 




Liver cancer ut>4Uuj 


10.7 


Melanoma Margin 
(Lung) 


21.6 


Normal Bladder 


8.0 


Normal Kidney 


11.0 


Bladder Cancer 1023 


10.5 


Kidney Ca, Nuclear 
grade 2 (OD04338) 


37.6 


Bladder Cancer 
A302173 


17.3 


Kidney Margin 
(OD04338) 


22.1 


Normal Stomach 


37.9 


Kidney Ca Nuclear 
grade 1/2 (OD04339) 


21.6 


Gastric Cancer 
9060397 


11.1 


Kidney Margin 
(OD04339) 


12.9 


Stomach Margin 
9060396 


20.6 


Kidney Ca, Clear cell 
type (OD04340) 


6.5 


Gastric Cancer 
9060395 


22.7 ] 


Kidney Margin 
(OD04340) 


18.4 


Stomach Margin 
9060394 


36.1 


Kidney Ca, Nuclear 
grade 3 (OD04348) 


12.9 


Gastric Cancer 064005 


8.1 
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Table CDG . Panel 3D 



Tissue Name 


Rel. Exp.(%) 
Ag2075, Run 
164750734 


..... , — ...... , , 

Tissue Name 


Rel. Exp.(%) 
Ag2075, Run 
104750734 


Daoy- Medulloblastoma 


6 7 


Ca Ski- Cerv ical epidermoid 
carcinoma (metastasis) 


50 3 


TE671- 

Medulloblastoma 


9.0 


ES-2- Ovarian clear cell 
carcinoma 


13.9 


D2S3 Med- 
M«_Jii!1<»Mjs1< ma 


35.4 


Ramos- Stimulated with 
PMA/ionomycin 6h 


2.7 


PI SK I - Primitive 
Neuroectodermal 


15.4 


Ramos- Stimulated with 
PM \ iom . ivcin 14h 


33 


XF-498- CNS 

— — — — - - , 


4.4 


MEG-01- Chronic 
myelogenous leukemia 
(megokaryoblast) 


3.4 


SNB-78 - Glioma 


23.3 Raji- Burkitt's lymphoma 


3.7 


SF-268- Glioblastoma 


13.6 jDaudi- Burkitt's lymphoma 


6.5 


T98G Glioblastoma 


18.6 :U266- B-cell plasmacytoma 


8.4 


SK-N-SH- 

Neuroblastoma 
(metastasis) 


17.0 


CA46- Burkitt's lymphoma 


7.2 


SF-295-- Glioblastoma 


8.4 


RL- non-Hodgkin's B-cell 
lymphoma 


3.5 


Cerebellum 


74.7 


JM1- prc-B-cell lymphoma 


3.1 


Cerebellum 


62 1 


Jurkat- T cell leukemia 




NCI-H292- 
Mucoepidermoid lung 
carcinoma 


45.7 


TF-1- Erythroleukemia 


13.4 


DMS-1 14- Small cell 
lung cancer 


10.9 


HUT 78- T-cell lymphoma 


10.0 


DMS-79- Small cell 
lung cancer 


100.0 


U937- Histiocytic lymphoma 


18.2 


NCI-H146- Small cell 
lung cancer 


g jKU-8 1 2- Myelogenous 
J ' leukemia 


6.0 


NCI-H526- Small cell 
lung cancer 


5? 4 J69-P- Clear cell renal 
carcinoma 


.,.3 

— j 


NC1-N417- Small cell 
lung cancer 


15.2 


Caki-2- Clear cell renal 
carcinoma 


10.4 


NCI-H82- Small cell 
lung cancer 


36.9 


SW 839- Clear cell renal 
carcinoma 


2.2 


NC1-1 1 157- Squamous 
cell lung cancer 
(metastasis) 


51.1 


G401- Wilms' tumor 


6.2 


NCI-IFI 155- I argecell 


26.6 


1 ls7()6T- Pancreatic 


42 9 
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lung cancer 




uiininnr;] (1 N metastasi: ) 




NCI-H1299- Large cell 
lung cancer 


44.4 


C A.PAN I - Pancreatic 
adenocarcinoma (liver 
metastasis) 


— — — 
5.0 


NCI-H727- Lung 
carcinoid 


47 Q 1SU86.86- Pancreatic 

[carcinoma (liver metastasis) 


28.1 


NCI-UMC-ll-Lung 
carcinoid 


60.3 


BxPC-3- Pancreatic 
adenocarcinoma 


6.3 


LX-1- Small cell lung 
cancer 


23.5 


HPAC- Pancreatic 
adenocarcinoma 


7.4 


Colo-205- Colon cancer 


29.5 


MIA PaCa-2- Pancreatic 
carcinoma 


3.6 


KM 12- Colon cancer 


24.0 


CFPAC-1- Pancreatic ductal 
adenocarcinoma 


40.9 


KM20L2- Colon cancer 


8.4 


PANC-1- Pancreatic 
epithelioid ductal carcinoma 


20.9 


NCI-H716- Colon 
cancer 


23.3 


T24- Bladder carcinma 
(transitional cell) 


13.1 


SW-48- Colon 
adenocarcinoma 


27.0 


5637- Bladder carcinoma 


11.0 


SW1116- Colon 
adenocarcinoma 


10.0 


HT-1 l"7- Bladder 
carcinoma 


9.2 


LS 174T- Colon 
adenocarcinoma 


9.9 


UM-UC-3- Bladder 
carcinma (transitional cell) 


7.2 


SW-948- Colon 
adenocarcinoma 


1.1 


A204- Rhabdomyosarcoma 


7.0 


SW-480- Colon 
adenocarcinoma 


8.8 


HT-1 080- Fibrosarcoma 


16.6 


NCI-SNU-5- Gastric 
carcinoma 


7.2 MG-63- Osteosarcoma 


— 

16.7 


KATO III- Gastric 
carcinoma 


32.8 


SK-LMS-1- 

Leiomyosarcoma (vulva) 


26.4 


NCI-SNU-16- Gastric 
carcinoma 


13.6 


SJRH30- 

Rhabdomyosarcoma (met to 
bone marrow) 


16.8 


N'CI-SM -!- Gastric 
carcinoma 


^ q 1A43 1 - Epidermoid 
(carcinoma 


9.8 


RF-l-Grastric 
3-dc no c 3.rc inonici 


2.1 WM266-4- Melanoma 




12.0 


RT-48- Gastric 
adciKX urcinoma 


4.0 


DU 145- Prostate carcinoma 
(brain metastasis) 


0-1 


MKN-45- Gastric 
carcinoma 


28.7 


MDA-MB-468- Breast 
adenocarcinoma 


1 

14.7 


NCI-N87- Gastric 
carcinoma 


13.3 


SCC-4- Squamous cell 
carcinoma of tongue 


0.9 


OVCAR-5- Ovarian 


« .... 


SCC-9- Squamous cell 


0.3 
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carcinoma 


[carcinoma of tongue 






RL95-2- Uterine 
carcinoma 


f 1 
7.2 j 


5CC-15- Squamous cell 
carcinoma of tongue 


- 

0.5 


HelaS3-- Cervical 
adenocarcinoma 


! .3.4 


CAL 27- Squamous cell 
carcinoma of tongue 


21.2 


Table CDH. Panel 4. ID 


Tissue Name 


Rel. Exp.(%) 
Ag5279, Run 
230472927 


Tissue Name 




Rel. Exp.(%) 
Ag5279, Run 
23(>4"'2«)27 


Secondary Th 1 act 


59.0 


HUVEC IL-lbeta 


_ 


II - 


Secondary Th2 act 


91.4 


HI \ 1 (' 11 \ LMiiima 




_ 71.7 


Secondary Trl act 


27.5 




HUVEC TNF alpha + 
I FN gamma 




17.4 


Secondary Thl rest 


7.5 


HUVEC TNF alpha + 
11 \ 


20.7 


Sceond-.irv Th2 rest 


8.4 


W \ i ' II -1 ! 


32.3 


Secondary Trl rest 


2.6 


Lung Microvascular EC 
mone 




Primary Thl act 


26.8 


Lung Microvascular EC 
1 NFalpha • IL-lbeta 


14.6 


Primary Th2 act 


55.1 


Microvascular Dermal 
EC none 


10.9 

... 


Primary Trl act 


■ — 


Microsvasular Dermal 
:EC TNFalpha + IL-lbeta 


13.2 


Primary Thl rest 


3.2 


Bronchial epithelium 
TNFalpha - ILIbUa 


17.4 


Primary Th2 rest 


13.7 


Small airway epithelium 
hone 


18.8 


Primary Trl rest 


7.6 


Small airway epithelium 
TNFalpha + IL-lbeta 


56.3 


CD45RA CD4 
lymphocyte act 




31.9 


Coronery artery SMC rest 


31.9 


CD45RO CD4 
lymphocyte act 
t DS K influx. \\c at l 
Secondary CD8 
l> mphoc} te rest 


48.3 
^1.2 
36.1 


Coronery artery SMC 
riNFaipb^ + II -1 beta 
Astrocytes rest 
Astrocytes TNFalpha + 
IL-lbeta 




37.1 

28.3 
15.8 


Secondary CD8 
!> mphocvle act 


8.0 


KU-812 (Basophil) rest 


15.2 


CD4 lymphocyte none ; 


8.4 


KU-812 (Basophil) 
PMA/ionomycin 


17.0 


2ry Thl/Th2/Trl anti- 
CD95 CHI 1 


8.9 


CCD 1106 

(Keratinocytes) none 


71.2 
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" 1 

LAK cells rest 


)CCD1106 
26.1 f (Keratinocytes) 

jTNFalpha + IL-lbeia 


42.0 1 


LAK ceils IL-2 


16.3 Liver cirrhosis 


15.5 


LAK cells IL-2+IL-12 


2.0 


NCI-H292 none 


45.4 


LAK cells IL-2+IFN 


12.5 


NCI-H292 IL-4 


63.7 


1 AK cells IL-2+ IL-18 


7.5 


NCI-H292 1L-9 


70.7 


LAK cells 
PMA/ionomycin 


21.9 


NCI-H292 IL-13 


99.3 


NK Cells IL-2 rest 


44 4 


NCI-H292 1FN gamma 


45.4 j 


Two Way MLR 3 day 


21.9 


HPAEC none 


25.9 | 


Two Way MLR 5 day 


13.5 


I IP \T( T\IF nlnlvi ; 1 1 - 

1 beta 


34.9 


I wo Wa\ Ml R 7 das 


1 1.3 


Lung fibroblast none 


____ 


PBMC l est 


11.0 


Lung fibroblast TNF 

dtt-*Vio _4_ TT 1 l-w^f o 


21.2 


PBMC PWM 


4.5 


Ltitu- fibroblast U -4 


31.6 


irJtiML, JriA-JL 


18.4 


I li r. Iihi tniast II -Q 




Ramos (B cell) none 


14.0 


Lung fibroblast IL- 1 3 


9.0 


Ramos (B cell) 
iionomycin 


42.9 


; .ii" libroblast IFN 

gamma 


"" , " " ,r 1 

58.6 


B lymphocytes PWM 


18.9 


Dermal fibroblast 
prr>i r>7n rpit 

OUL/lU/U rct>L 


43.2 


B lymphocytes CD40L 
|and IL-4 


31.0 


Dermal fibroblast 
CCD 1070 TNF alpha 


100.0 


EOL-1 dbcAMP 


37.9 


Dermal fibroblast 
CCD1070IL-1 beta 


23.5 

. _ — 


E( >L-I dbcAMP 
j PM A/io nomycin 




Dermal fibroblast IFN 
gamma 


31.4 


Dendritic cells none 




Dermal fibroblast IL-4 


45.4 


Dendritic cells LPS 


132 


Dermal Fibroblasts rest 


61.6 


jDendritic cells anti- 

1CD40 


17.0 


Neutrophils TNFa+LPS 


4.2 


Monocytes rest 


1 1.6 


Neutrophils rest 


31.4 


Mi >n> >c\ ts ] PS 


-— 8 5 - - 


Colon 


1.0 


Macrophages rest 


12 9 


Lung 


1.6 


Macrophages LPS 


"""2 6 


Thymus 


I "i'js 


ll('\T( none 


36.1 


Kidney 




llliVFC starved 


36.9 






Table CDI. Panel 4D 


j Tissue Name 


Rel. Exp.(%) 


Tissue Name 


| Rel. Exp.(%) 
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Ag2075, Run 
152787491 




152787491 


Secondary Thl act 


46.3 JHUVEC IL-lbeta 


25.0 


Seconds.ry Th2 act 


39.2 -HUVEC IFN gamma 


33.0 


Secondary Trl act 


319 HI iVECTNF alpha + 
(IFN gamma 


13.1 


Secondsiry Thl rest 


11.7 


HUVEC TNF alpha + 
IL4 


20.7 


Second; ry Th2 rest 


13.9 


HUVEC IL-il 


20.0 


Secondary Trl rest 


21.6 


Lung Microvascular EC 
none 


22.7 


Primary Thl act 


~ n . iLung Microvascular EC 
,TNFalpha + IL-lbeta 


15.0 


Primary Th2 act 


2p 2 [Microvascular Dermal 
EC none 


26.8 


Primary Trl act 


54.7 


Microsvasular Dermal 
EC TNFalpha + IL-lbeta 


16.6 


Primary Thl rest 


84.1 


Bronchial epithelium 
TNFalpha + IL1 beta 


4.9 


Primary Th2 rest 


48.6 


Small airway epithelium 
none 


19.9 


Primary Trl rest 


39.0 


Small airway epithelium 
TNFalpha + IL-lbeta 


72.7 


CD45RACD4 
l\ niphocyte act 


21.8 jCoronery artery SMC rest 


27.9 


CD45RO CD4 
lymphocyte act 


33.0 


Coronery artery SMC 
TNFalpha + IL-lbeta 


19.6 


jCD8 lymphocyte act 


25.5 


Astrocytes rest 


26.4 


JSecondary CD8 
lymphocyte rest 


21.5 


Astrocytes TNFalpha + 
IL-lbeta 


13.2 


Secondary CD8 
lymphocyte act 


29.3 


KU-812 (Basophil) rest 


6.3 


|cD4 lymphocyte none 


12.7 


KU-812 (Basophil) 
PMA/ionomycin 


16.0 


2ry Thl/Th2/Trl_anti- 

J/t^Q'n rm i 
|CUy.) Lxll 1 


25 5 (CCD 1106 

I(Keratinocytes) none 


46.0 


LAK cells rest 


26.1 


CCD1106 
(Keratinocytes) 
TNFalpha + IL-lbeta 


4.3 


LAK cells IL-2 


30.6 


Liver cirrhosis 




LAK cells IL-2+IL-12 


18.2 


Lupus kidne\ 


30 


L AK cells 1L-2+IFN 
I gamma 


31.0 (NCI-H292 none 


76.8 


'LAK cells IL-2+IL-18 


31.2 |NCI-H292 IL-4 


94.6 


|LAK cells 


9.5 JNCI-H292 IL-9 


97.:. 



937 



PMA/ionomycin 


| " 




NK Cells IL-2 rest 


37.4 'NCI-H292 IL-13 


59.5 


Two Way MLR 3 day 


24.0 |NCI-H292 I FN gamma 


51.8 


Two Way MLR 5 day \ 


23.0 JHPAEC none 


23.3 


Two Way MLR 7 day | 


l9(t 1 IP-VECTNF alpha + IL- 
|l beta 


15.8 


PBMC rest 


1 1.4 


Lung fibroblast none 


18.4 


PBMC PWM 


72.^ 


Lung fibroblast TNF 
alpha + IL-1 beta 


12.0 


irrJiVIL, J: rlA-JL 


33.9 ; Lung fibroblast IL-4 


35 8 


Ramos (B cell 1 none 


19.6 Lung fibroblast IL-9 


25.5 


Ramos (B cell) 
ionomycin 


100.0 SLung fibroblast IL-13 j 


18.7 


B lymphocytes PWM 


g jjLung fibroblast IFN 
gamma 


38.4 


B lymphocytes C D40I 
and IL-4 


A0 _ I Dermal fibroblast 
CCD1 070 rest 


48.0 


EOL-1 dbcAMP 


7 jDermal fibroblast 

JCCD1070TNF alpha 


83.5 

— - - -J 


EOL-1 dbcAMP 
PMA/ionomycin 


. _ _ JDermal fibroblast 
|CCD1070 IL-1 beta 


13.6 

— 


Dendritic cells none 


2Q 9 Dermal fibroblast IFN 
gamma 


13.1 


Dendritic cells LPS 


S ILL. 


Dermal fibroblast IL-4 


' 36.6 


Dendritic cells anti- 
CD40 


23.2 


IBD Colitis 2 


1.8 


Monocytes rest 


10.2 


IBD Crohn's 


2-4 


Monocytes LPS 


6.5 


Colon 


:«. s 


Macrophages rest 


36.1 JLung 


21.3 


Macrophages LPs 


13.3 i h\raih 


41.5 


HUVEC none 


37.6 Kidney 


24.3 


HUVEC starved 


58.6 \ 





CNS_neurodegeneration_vl.O Summary: Ag5279 This panel confirms the expression of 
the CG56870-04 gene at low levels in the brain in an independent group of individuals. 
However, no differential expression of this gene was detected between Alzheimer's 
diseased postmortem brains and those of non-demented controls in this experiment. Please 
see Panel 1 .5 for a discussion of the potential utility of this gene in treatment of central 
nervous system disorders. 

General_screening_panel_vl.5 Summary: Ag5279 Highest expression of the CG56870- 
01 is detected in cerebral cortex (CT=25.02). Thus, expression of this gene can be used in 
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distinguishing this sample from other samples in the panel. Furthermore, significant 
expression of this gene is observed throughout the CNS, including in amygdala, substantia 
nigra, thalamus, cerebellum, cerebral cortex, and spinal cord. The CG56870-01 gene 
encodes a Ndr3 protein homolog. The Ndr family is comprised of members from different 
5 gene families: Ndr 1 /RTP/Drg 1 /NDRG 1 , Ndr2 , and Ndr3 (PFAM: IPR004142). NDRG1 is 
a cytoplasmic protein involved in stress responses, hormone responses, cell growth, and 
different iation. Mutation of this gene was reported to be causative for hereditary motor and 
sensory neuropathy-Lorn. Recently, NDRG4, another memember of Ndr family, was 
shown to be expressed in neurons of the brain and spinal cord. Its expression was markedly 
10 decreased in the brain of Alzheimer's disease patient (Zhou et al., 2001). Therefore, this 
gene may play a role in central nervous system disorders such as Alzheimer's disease, 
Parkinson's disease, epilepsy, multiple sclerosis, schizophrenia and depression. 

Among metabolic tissues, this gene is moderately expressed in adipose, adrenal, 
heart, thyroid, liver, pancreas, pituitary, and skeletal muscle. Thus, this gene product may 
15 be impo rtant for the pathogenesis, diagnosis, and/or treatment of metabolic and endocrine 
disease, including Types 1 and 2 diabetes and obesity. 

In addition, there appears to be substantial expression in other samples derived from 
brain cancer cell lines, colon cancer cell lines, breast cancer cell lines and ovarian cancer 
cell lines. Moreover, therapeutic modulation of this gene could be of benefit in the 
20 treatment of brain, colon, breast or ovarian cancer. 

References: 

1 . Zhou RH, Kokame K, Tsukamoto Y, Yutani C, Kato H, Miyata T. (2001) 
Characterization of the human NDRG gene family: a newly identified member, NDRG4, is 
specifically expressed in brain and heart. Genomics 73(l):86-97 

25 Panel 1..3D Summary: Ag2075 Highest expression of the CG56870-01 gene is detected in 
the cerelbral cortex (CT=24.2). This expression is consistent with expression in Panel 1.5. 
Please see that panel for discussion of utility of this gene in the central nervous system. 

This gene also has moderate levels of expression in adipose, adrenal, thyroid, liver, 
heart, thyroid and skeletal muscle. Thus, this gene product may be important in the 
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pathogenesis, diagnosis and/or treatment of metabolic and endocrine disease, including 
Types 1 and 2 diabetes and obesity. 

In addition, there appears to be substantial expression in other samples derived from 
breast cancer cell lines, lung cancer cell lines, renal cancer cell lines and colon cancer cell 
5 lines. Thus, therapeutic modulation of this gene could be of benefit in the treatment of 
breast, lung, renal or colon cancer. 

Panel 2.2 Summary: Ag2075 Highest expression of CG5 6870-01 is detected in breast 
cancer sample (CT=29.89). Thus expression of this gene can be used in distinguishing this 
sample from other samples in the panel. In addition, there appears to be substantial 
10 expression in other samples derived from breast cancers, kidney cancers and colon cancers. 
Therefo re, therapeutic modulation of this gene could be of benefit in the treatment of 
breast, kidney or colon cancer. 

Panel 3D Summary: Ag2075 The expression of this gene appears to be highest in a 
sample derived from a lung cancer cell line (DMS-79)(CT=26.4). In addition, there appears 
15 to be substantial expression in other samples derived from pancreatic cancer cell lines, lung 
cancer cell lines, brain cancer cell lines and cervical cancer cell lines. Thus, the expression 
of this gene could be used to distinguish DMS-79 cells from other samples in the panel. 
Moreover, therapeutic modulation of this gene could be of benefit in the treatment of 
pancreatic, lung, brain or cervical cancer. 

20 Panel 4.1D Summary: Ag5279 Expression of the CG56870-01 gene is highest in samples 
derived from TNF alpha treated dermal fibroblast CCD1070 cells (CT=30.6). Expression 
of this gene is also prominent in activated secondary and primarey Thl, Th2 and Trl cells 
when compared expression in the corresponding resting cell lines. Thus, this gene may be 
involved in T lymphocyte function. Therefore, therapeutic modulation fo the expression or 

25 function of this gene may be as anti-inflammatory therapeutics for T cell-mediated 

autoimmune and inflammatory diseases, such as asthma, athritis, psoriasis, IBD, and lupus. 

Panel 4D Summary: Ag2075 Expression of the CG56870-01 gene is ubiquitous 
througout this panel, with highest in samples derived from ionomycin treated Ramos (B 
cell) cells (CT=26.1). Furthermore, expression of this gene is also detected in PWM treated 
30 PBMC cells and PWM treated B lymphocytes. Therefore, therapeutic modulation of the 
expression or function of this gene may reduce or eliminate the symptoms in patients with 
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autoimmune and inflammatory diseases in which B cells play a part in the initiation or 
progression of the disease process, such as systemic lupus erythematosus, Crohn's disease, 
ulcerative colitis, multiple sclerosis, chronic obstructive pulmonary disease, asthma, 
emphysema, rheumatoid arthritis, or psoriasis. 

CE. CG56870-05: N-myc downstream-regulated gene 3 

Expression of gene CG56870-05 was assessed using the primer-probe set Ag5265, 
described in Table CEA. Results of the RTQ-PCR runs are shown in Tables CEB and CEC. 



Table CEA . Probe Name Ag5265 



r— 

Primers 


Sequences 


I 

Length : 


Start 
Position 


SEQ ID 

1 NO: 


tForward 


5 ' -tgttcagctcacagagatcaaa-3 ' 


22 


18 


J 656 


— 

JProbe 


TET-5 ' -caagaaacttccaggactttgactgtca-3 ' - 
TAMRA 


28 ! 


65 


657 


[Reverse 


5 1 -catccattgtggggtactga-3 ' 


2d 


96 


it 658 . . 



Table CEB . CNS neurodegeneration vl.O 



Tissue Name 


Rel. Exp.(%) Ag5265, 
Run 230512714 


j Tissue Name j 


Rel. Exp.(%) Ag5265, 
Run 230512714 


AD 1 Hippo 


7.2 


(Control (Path) 3 j 
[Temporal C '.\ 


2.4 


AD 2 Hippo 


15.2 


Control (Path) 4 
(Temporal Ctx 


10.3 


j AD 3 Hippo 


2.0 


|AD 1 Occipital Ctx 


11.5 


jAD 4 Hippo 


3.3 


|AD 2 Occipital Ctx 

jTVlissing) 


0.0 


JAD 5 Hippo 


59.9 


AD 3 ( >ccipiuil Ctx 


7.0 


KD6H.ippo 


22.1 


jAD 4 Occipital Ctx 


6.9 


1 Control 2 Hippo 


14.5 


AD 5 Occipital Ctx 


54.7 


Control 4 Hippo 


2.6 


AD 6 Occipital Ctx 


12.2 


Control (Path) 3 
Hippo 


3.9 


Control 1 ( Jccipital 

(Ctx 


1.2 


jAD 1 Temporal 

[Ctx 


12.7 

! — 


IControl 2 Occipital 
Ctx 


60.3 


AD 2 Temporal 
Ctx 


16.0 


(Control 3 Occipital 
|Ctx 


8.2 


AD 3 Temporal 
Ctx 


5.5 


JControl 4 Occipital 
Ctx 




AD 4 Temporal 


13.0 


^Control (Path) 1 


53.6 
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Ctx 




Occipital Ctx 




AD 5 Inf Temporal 
Ctx 


64.6 


Control (Path) 2 
Occipital Ctx 


5.9 


AD 5 Sup 
Temporal Ctx 


28.9 


Control (Path) 3 
Occipital Ctx 


2.2 


AD 6 Inf Temporal 
Ctx 


27.0 


Control (Path) 4 
Occipital Ctx 


5.5 


AD 6 Sup 
Temporal Ctx 


100.0 


Control 1 Parietal 
Ctx 


3.0 


Control 1 
Temporal Ctx 


2.6 


Control 2 Parietal 
Ctx 


19.6 


Control 2 
I emporal Ctx 


38.2 


Control 3 Parietal 
Ctx 


18.8 


Control 3 
Temporal Ctx 


„ jControl (Path) 1 
[Parietal Ctx 


56.6 


Control 3 
Temporal Ctx 


1.5 


Control (Path) 2 
Parietal Ctx 


14.2 


Control (Path) 1 
Temporal ( tx 


31.4 


Control (Path) 3 
Parietal Ctx 


2.6 


Control (Fathi 2 
Temporal Ctx 


18.3 


Control (Path) 4 
Parietal Ctx 


25.0 


Table CEC. General screening_panel vl.5 


Tissue Name 
Adipose 


Rel. Exp.(%) 
j Ag5265, Run 
} 232936652 


Tissue Name 


Rel. £xp.(%) 
Ag5265, Run 
232936652 


1.1 


Renal ca. I K- 1 o 


14.8 


Melanoma* 
Hs688(A).T 


I 7 ' 9 


Bladder 


2.5 


Melanoma* 
Hs688(B).T 


10.1 

I- 


Gastric ca. (liver met.) 
NCI-N87 


_____ 


Melanoma 1 Ml -4 


12.4 


Gastric ci KATO III 


- J 78 


Melanoma* 
LOXIMVI 


13.7 

i_ 


Colon ca. S W-948 


1.8 


Melanoma* SK- 
MEL-5 


1 9.9 

i - 


Colon ca. S W480 


16.0 


Squamous cell 
carcinoma SCC-4 


8.3 


Colon ca.* (SW480 
met)SW620 


8.2 


Testis Pool 


2.2 


Colon ca. HT29 


6.7 


Prostate ca.* (bone 
met) PC-3 


18.0 


Colon ca. HCT-116 


15.8 


Prostate Pool 


4.4 


Colon ca. CaCo-2 


25.5 


Placenta 


2.1 


Colon cancel tissue 


2.5 


Uterus Pool 


1.9 


( lolon ca S\\ 1116 


2.3 
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Ovarian ca. 
OVCAEL-3 


9.0 


Colon ca. Colo-205 


4.8 


Ovarian ca. SK- 
OV-3 


16.0 


Colon ca. SW-48 


5.1 


Ovarian ca. 


4.5 


Colon Pool 


2.8 


Ovarian ca. 
OVCAR-5 


10.4 


Small Intestine Pool 


2.4 


Ovarian ca. 
IGROV-1 


12.0 


Stomach Pool 


1.8 


Ovarian ca. 
OVCAR-8 


4.5 


Bone Marrow Pool 


1.2 


Ovary 


2.8 


Fetal Heart 


2.3 


Breast ca. MCF-7 


4.9 


Heart Pool 


1.5 


Breast ca. MDA- 
MB-231 


30.1 


Lymph Node Pool 


3.0 


Breast ca. BT 549 


18.9 


Fetal Skeletal Muscle 


0.9 \ 


Breast ca. T47D 


4.0 


Skeletal Muscle Pool 


1.8 


Breast ca. MDA-N 


8.4 


Spleen Pool 


2.2 


iBreast Pool 


3.6 


Thymus Pool 




jTrachea 

h- - 


3.2 


CNS cancer 
(glio/astro) U87-MG 


18.3 


jLung 


1.7 


CNS cancer 
(glio/astro) U-118-MG 


19.5 


[Fetal Lung 


4.8 


CNS cancer 
(neuro;met) SK-N-AS 


8.4 


Lung ca.NCI-N417 


2.0 


CNS cancer (astro) SF- 
539 


8.8 


Lung ca. LX-1 


o ^ 1CNS cancer (astro) 
SNB-75 


29.1 


Lung ca.NCI-H146 


9.2 


CNS cancer (glio) 
SNB-19 


10.9 


Lung ca. SHP-77 


8.2 


CNS cancer (glio) SF- 
295 


17.2 


ILung ca. A549 


15.2 


Brain (Amygdala) Pool 


14.0 


Lung ca. NCI-H526 


3.2 


Brain (cerebellum) 


100.0 


Lung ca. NCI-H23 


16.2 


Brain (fetal) 


10.7 


Lung ca. NCI-H460 


2.7 


Brain (Hippocampus) 
Pool 


11.4 


Lung ca. HOP-62 


5.6 


Cerebral Cenex Pool 


' 15.5 


Lung ca. NCI-H522 


13.0 


Brain (Substantia 
nigra) Pool 


14.7 


Liver 


0.9 


Rutin ( ! habinuM Pi, >1 


17.9 


Fetal Liver 


2.4 


Brain (whole) 


25.9 
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Liver ca. HepG2 


6.2 


Spinal Cord Pool 


9.0 


Kidney Pool 


6.9 


Adrenal Gland 


5.0 


Fetal Kidney 


3.1 


Pituitar gland Pool 


1.5 


Renal ca. 786-0 


9.0 


Salivary Gland 


2.3 


Renal ca. A498 




Fhyroid ( female) 


2.0 


Renal ca. ACHN 


6.0 


Pancreatic ca. 
CAPAN2 


6.5 


Renal ca.UO-31 


'V - 


Pancreas Pool 


3.2 



CNS_neurodegeneration_vl.O Summary: Ag5265 This panel confirms the expression of 
the CG5 6870-04 gene at low levels in the brain in an independent group of individuals. 
However, no differential expression of this gene was detected between Alzheimer's 
diseased! postmortem brains and those of non-demented controls in this experiment. Please 
see Panel 1.5 for a discussion of the potential utility of this gene in treatment of central 
nervous system disorders. 

General_screening_panel_vl.5 Summary: Ag5265 Highest expression of the CG56870- 
05 gene is detected in cerebral cortex (CT=28.86). Thus, expression of this gene can be 
used in distinguishing this sample from other samples in the panel. Furthermore, significant 
expression of this gene is observed throughout the CNS, including in amygdala, substantia 
nigra, thalamus, cerebellum, cerebral cortex, and spinal cord. The CG56870-05 gene 
encodes a putative Ndr3 protein. This family consists of proteins from different gene 
families: Ndr 1 /RTP/Drg 1 /NDRG 1 , Ndr2, andNdr3 (PFAM: IPR004142). NDRG1 is a 
cytoplasmic protein involved in stress responses, hormone responses, cell growth, and 
differentiation. Mutation of this gene was reported to be causative for hereditary motor and 
sensory neuropathy-Lorn. Recently, NDRG4, another memember of Ndr family, was 
shown to be expressed in neurons of the brain and spinal cord. Its expression was markedly 
decreased in the brain of Alzheimer's disease patient (Zhou et al., 2001). Therefore, this 
gene may play a role in central nervous system disorders such as Alzheimer's disease, 
Parkinson's disease, epilepsy, multiple sclerosis, schizophrenia and depression. 

Among metabolic tissues, this gene has low levels of expression in heart, skeletal muscle, 
adrenal, thyroid, pancreas and pituitary. Therefore, this gene product may be important for 
the pathogenesis, diagnosis, and/or treatment of metabolic and endocrine disease, including 
Types 1 and 2 diabetes and obesity. 
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Overall, this gene is expressed in all the samples on this panel, with slightly higher 
levels of expression in the cancer cell lines compared to expression in the normal tissues 
samples. 

Panel 4.1D Summary: Ag5265 Expression of this gene is low/undetectable (CTs > 34.5) 
5 across all of the samples on this panel (data not shown). 

CF. CG59764-01: FERRITIN HEAVY CHAIN like protein 

Expression of gene CG59764-01 was assessed using the primer-probe set Ag3578, 
described in Table CFA. Results of the RTQ-PCR runs are shown in Tables CFB and CFC. 



Table CFA . Probe Name Ag3578 



Primers 


Sequences 


Lengthj 


Stilt 
Position 


SEQID 
NO: 


Forward 


5 ' - ctgcgacttcctggagaac- 3 1 


19 I 


430 


659 


Probe 


TET-5 1 -agcaggccaagaccatcaaagagct-3 ' - 
TAMRA 


25 j 


462 


660 


Reverse 


5 ' tqtgcag [ttgd - <ggta-3 ' 


19 


404 


661 1 



1 0 Table CFB . CNS_neurodegeneration_v 1 .0 



Tissue Name 


Rel. Exp.(%) Ag3578, 
Run 210642348 


Tissue Name 

_ - -. _ 


Rel. Exp.(%) Ag3578, 
Run 210642348 


AD 1 Hippo 


14.0 


Control (Path) 3 
Temporal Ctx 


3.2 


AD 2 Hippo 


8.2 


Control (Path) 4 
Temporal Ctx 


52.9 


U) 3 1 lippo 


6.7 


YD 1 < tccipitalCrx 


22.8 


AD 4 Hippo 


2.3 


M) 2 Occipital Ctx 
(Missing) 


0.0 


AD 5 Hippo 


83.5 


AD 3 Occipital Ctx 


7.3 


AD 6 Hippo 


29.1 


AD 4 Occipital Ctx 


14.9 


Control 2 Hippo 


1.6 


AD 5 Occipital Ctx 


11.4 


Control 4 Hippo 


3.1 


AD 6 Occipital Ctx 


19.8 


Control (Path) 3 
Hippo 


8.0 


Control 1 Occipital 
Ctx 


9.0 


AD 1 Temporal 
Ctx 


19.5 


Control 2 Occipital 

Ctx 


29.5 


AD 2 Temporal 


22.8 


Control 3 Occipital 
Ctx 


24.0 


AD 3 Temporal 


17.0 


Control 4 Occipital 


7.4 
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Ctx 




Ctx J 


AD 4 Temporal 
Ctx 


16.8 


Control (Path) 1 
Occipital Ctx 


38.4 


AJL> j ini 1 emporal 
Ctx 


50.3 


Control (Path) 2 
Occipital Ctx 


10.3 


AD 5 Sup 
Temporal Ctx 


40.6 


Control (Path) 3 
Occipital Ctx 


0.0 


AD 6 Inf Temporal 
Ctx 


63.7 


Control (Path) 4 
Occipital Ctx 


46.0 


AD 6 Sup 

Temporal Ctx | 


100.0 


Control 1 Parietal 
Ctx 


9.2 


Control 1 
Temporal Ctx 


Control 2 Parietal 
Ctx 


44.1 


Control 2 
Temporal Ctx 


13.3 


Control 3 Parietal 
Ctx 


21.0 


Control 3 
Temporal Ctx 


21.3 


Control (Path) 1 
Parietal Ctx 


20.2 


Control 3 
Temporal Ctx 


14.2 


Control (Path) 2 
Parietal Ctx 


16.7 


Control (Path) 1 
.Temporal Ctx 


35.1 


Control (Path) 3 
Parietal Ctx 


0.0 


Control (Path) 2 
Temporal L l\ 


25.3 


Control (Path) 4 
Parietal Ctx 


45.4 


Table CFC. General screening_panel vl.4 


? 

Tissme Name 


Rel. Exp.(%) 
Ag3578, Run 
217423081 


Tissue Name 


Rel. Exp.(%) 
Ag3578, Run 
217423081 


'Adipose j 8.6 


Renal ca. TK-10 


19.9 


Melanoma* _ 
! 1 .688(A).T 


Bladder 


0.0 


Melanoma* 
Hs688(B).T 


0.0 


Gastric ca. (liver met.) 
NCI-N87 


15.3 


Melanoma* Ml 4 


0.0 


Gastric ca. KATO III 


16.4 


Melanoma* 
LOXIMVI 


0.0 


Colon ca. SW-948 


22.8 


Melanoma* SK- 
MEL-5 


0.0 


Colon ca. SW480 


5.6 


Squamous cell 
carcinoma SCC-4 


8.3 


Colon ca.* <SW-IX0 
met) SW620 


42.0 


Testis PdoI 


27.4 


Colon ca. HT29 


_ 2.6 


Prostate ca.* (bone 
met) PC-3 


11.0 


Colon ca.HCT-116 


39.2 


Prostate Pool 


0.0 


Colon ca. CaCo-2 


13.5 
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Placenta. 


15.3 


Colon cancer tissue 


4.8 


Uterus Pool 


0.0 


Colon ca. SW1116 


0.0 


Ovarian ca. 
OVCAR-3 


0.0 


Colon ca. Colo-205 


4.2 


Ovarian ca. SK- 
OV-3 


14.4 


Colon ca. SW-48 


9.3 


( )varian ca. 
nvrAff 4. 

KJ v v^.tViV-'t 


1 

8.8 fColon Pool 


27.5 


Ovarian ca. 
OVCAR-5 


16.6 


Small Intestine Pool 


11.2 


IGROV-1 


13.0 


Stomach Pool 


11.3 


Ovarian ca. 
OVCAR-8 


4.3 


Bone Marrow Pool 


8.7 


Ovary 


0.0 


Fetal Heart 


17.7 


Breast ca. MCF-7 


0.0 


Heart Pool 


9.9 


Breast ca. MDA- 
MB-231 


5.7 


Lymph Node Pool 


22.4 


Breast ca. BT 549 


47.3 


Fetal Skeletal Muscle 


10.2 


Breast ca. T47D 


8.0 


Skeletal Muscle Pool 


100.0 


Breast ca. MDA-N 


0.0 


Spleen Pool 




Breast Pool 


7.6 


f hymus Pool 




Trachea 


0.0 


CNS cancer 
(glio/astro) U87-MG 


0.0 


[Lung 


3.1 


CNS cancer 
(glio/astro) U-118-MG 


8.1 


. _ 

[Fetal Lung 


7.1 


CNS cancer 
(neuro;met) SK-N-AS 


22.4 


Lung ca. NCI-N417 

L_____ 


0.0 


CNS cancer (astro) SF- 

539 


8.2 


Lung ca. LX-1 


49.7 


CNS cancer (astro) 
SNB-75 


25.2 


Lung ca. NCI-H146 

J__ 


0.0 


CNS cancer (glio) 
SNB-19 


10.1 




(Lung ca. SHP-77 


— - 

0.0 


CNS cancer (glio) SF- 
295 


24.5 


Lung ca. A549 


0.0 


Brain (Amygdala) Pool 


7.8 


Lung ca NCI-H526 


0.0 


Brain (cerebellum) 


20.3 


Lung ca. NCI-H23 


11.9 


Brain (fetal) 


18.7 


Lung ca. NCI-H460 


2.7 


Brain (Hippocampus) j ^ ^ 


Lung cj. H( >P-62 


10.9 


Cerebral ( ortex Pool 


v, . 


Lung ca . NCI-H522 


0.0 


Brain (Substantia 
nigra) Pool 


35.4 

, _ 
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1 iver 


0.0 


Brain < t hiiuttiUM s-Vol 


19.1 


Fetal Liver 


0.0 


Brain (whole) 


17.3 


Liver ca. HepG2 


IT" 


> pins ( i . Pool 


9.2 


Kidney Pool 


21.5 


Adrenal Gland 


3.7 


Fetal Ki>ine> 


21.2 


Pifuitan gland Pool 


no 


Renal ca. 786-0 


11.7 


Salivary Gland 


0.0 


Renal ca. A498 


9.2 


Thyroid (female) 


6.7 


Renal ca. ACHN 


11.5 


Pancreatic ca. 
CAPAN2 


48.3 


Renal ca.UO-31 


10.3 


Pancreas Pool 


5.5 



CNS_neurodegeneration_vl.O Summary: Ag3578 This panel confirms the expression of 
the CG5 9764-01 gene at low levels in the brain in an independent group of individuals. 
However, no differential expression of this gene was detected between Alzheimer's 
diseased postmortem brains and those of non-demented controls in this experiment. Please 
see Panel 1.4 for a discussion of the potential utility of this gene in treatment of central 
nervous system disorders. 

Genera l_screening_panel_vl .4 Summary: Ag3578 Highest expression of the CG59764- 
01 gene is detected in sample derived from skeletal muscle (CT=31.2). Thus expression of 
this gene can be used to distinguish skeletal muscle sample from other samples used in this 
panel. This gene is also expressed at low but significant levels in heart and adipose. Thus, 
this gene product may be useful in the treatment of metabolic disorders that involve these 
tissues, including obesity. 

Significant expression of this gene is also associated with samples derived from 
breast cancer, pancreatic cancer, colon cancer and lung cancer cell lines. Therefore, 
1 5 therapeutic modulation of the activity of this gene or its protein product might be beneficial 
in the treatment of these cancers. 

In addition, this gene is expressed at low levels throughout the CNS, including in 
amygdala, substantia nigra, thalamus, cerebellum, cerebral cortex, and spinal cord. The 
CG59764-01 gene encodes a homologue of ferritin heavy chain protein (H-feritin). It has 
20 been hypothesized that the up-regulation of the H-ferritin mRNA is part of a mechanism 
protecting the hippocampus, a seizure-prone area, against a possible overactivation during 
absence seizures (Lakaye et al., 2000). Therefore, therapeutic modulation of the expression 
or function of this gene may be useful in the treatment of seizure disorders, such as 
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epilepsy. Furthermore, this gene may play a role in central nervous system disorders such 
as Alzheimer's disease, Parkinson's disease, multiple sclerosis, schizophrenia and 
depression. 



References: 

5 1 . Lakaye B, de Borman B, Minet A, Arckens L, Vergnes M, Marescaux C, Grisar 

T. (2000) Increased expression of mRNA encoding ferritin heavy chain in brain structures 
of a rat model of absence epilepsy. Exp Neurol 162(1):1 12-20. 

Panel 4.1D Summary: Ag3578 Expression of this gene is low/undetectable (CTs > 35) 
across a ll of the samples on this panel (data not shown). 

10 CG. CG59710-01: P14 

Expression of gene CG59710-01 was assessed using the primer-probe set Ag3512, 
described in Table CGA. Results of the RTQ-PCR runs are shown in Tables CGB and 
CGC. 



Table CGA . Probe Name Ag35 12 



Primers 


Sequences 


Length 


Start I SEQID 
Position NO: 


Forw art 


5 ' -ctttgttctccagcacatctg-3 ' 


21 


211 | 662 


Probe 


TET-5 ' - ctacatcatggccgagat ctgcaatg- 3 ' - 
TAMRA 


26 


232 663 


Reverse 


5 1 -cctcgcatgtttaggatctg-3 ' 


20 


290 ' 664 



1 5 Table CGB . CNS_neurodegeneration_vl .0 



Tissue Name 


Rcl. Exp.(%) A S 3512. 
Run 211004862 


Tissue Name I ^^JlfJ ^ 
Run 21 1(KMS(.2 


AD 1 Hippo 


23.0 


Control (Path) 3 
Temporal Ctx 


7.2 


AD 2 Hippo 


31.2 


Control (Path) 4 
Temporal Ctx 


37.1 


AD 3 Hippo 


9.8 


AD 1 Occipital 
Ctx 


20.0 


AD 4 Hippo 


10.0 


AD 2 Occipital 
Ctx (Missing) 


0.0 


AD 5 hippo 


85.9 


AD 3 Occipital 
Ctx 


9.0 
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AD 6 Hippo 


56.6 


AD 4 Occipital 
Ctx 


21.3 


Control 2 Hippo 


41.5 


AD 5 Occipital 
Ctx 


15.2 


Control 4 Hippo 


21.6 


AD 6 Occipital 
Ctx 


49.0 


Control (Path) 3 
Hippo 


12.5 


Control 1 Occipital 
Ctx 


6.0 


AD 1 Temporal Ctx 


31.9 


Control 2 Occipital 
Ctx 


77.4 


AD 2 Temporal Ctx 


41.2 


Control 3 Occipital 
Ctx 


22.1 


AD 3 Temporal Ctx 


11.3 


Control 4 Occipital 
Ctx 


9.8 


AD 4 Temporal Ctx 


21.0 


Control (Path) 1 
Occipital Ctx 




77.9 


AD 5 Inf Temporal 

Ctx 


100.0 


Control (Path) 2 
Occipital Ctx 




12.6 


AD 5 SupTemporal 

Ctx 


42.6 


Control (Path) 3 
Occipital Ctx 


5.4 


AD 6 Inf Temporal 

Ctx 


35.1 


Control (Path) 4 
Occipital Ctx 


21.6 


AD 6 Sup Temporal 
Ctx 


56.6 


Control 1 Parietal 
Ctx 


11.4 


jControl 1 Temporal 
Ctx 


10.1 


Control 2 Parietal 
Ctx 


52.5 


JControl 2 Temporal 
Ctx 


53.2 


Control 3 Parietal 
Ctx 


21.5 


Control 3 Temporal 
Ctx 


17.2 


Control (Path) 1 
Parietal Ctx 


68.8 


Control 4 Temporal 
Ctx 


15.9 


Control (Path) 2 
Parietal Ctx 


29.7 


Control (Path) 1 
Temporal Ctx 


55.1 




Control (Path) 3 
Parietal Ctx 


10.2 


Control (Path) 2 
Temporal Ctx 


39.5 


Control (Path) 4 
Parietal Ctx 


48.3 


Table CGC. General screening panel vl.4 


1 Tissue Name 

| 


Rel. Exp.(%) 
Ag3512, Run 
217240776 


Tissue Name 


Rel. Exp.(%) 
Ag3512, Run 
217240776 


[Adipose 


3.3 


Renal ca. TK-10 


21.8 


| Melanoma* 
|Hs688(A).T 


9.7 


Bladder 


5.5 
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Melanoma* 
Hs688(B).T 


10.5 


Gastric ca. (liver met.) 
NCI-N87 


37.1 


Melanoma* M14 


26.4 


t ia>t! it ca. K Vi O 1 11 


48.6 


Melanoma* 
LOXIMVI 


35.4 


Colon ca. SW-948 


7.2 


Melanoma* SK- 
MEL-5 


14.4 


Colon ca. SW480 


62.4 


Squamous cell 
carcinoma SCC-4 


14.9 


Colon ca.* (SW480 
met) SW620 


29.7 


Testis Pool 


6.7 


Colon ca. HT29 


14.6 


Prostate ca.* (bone 
met) PC-3 


______ 

17.2 


Colon ca. HCT-116 


24.1 


Prostate Pool 


4.8 


Colon ca. CaCo-2 


46.0 


Pl&ccntft 


4.4 




13.9 


Uterus Pool 


3.1 


Colon ca. SWT 1 16 


2.1 


Ovarian ca. 
OVCAR-3 


20.0 


Colon ca. Colo-205 


10.8 


Ovarian ca. SK- 
OV-3 


31.6 


Colon ca. S W-48 


11.9 


Ovarian ca. 
ovrAP -4. 


19.6 




Colon Pool 


13.4 


Ovarian ca. 
OVCAR-5 


30.8 


Small Intestine Pool 


7.0 


Ov&ricin C3.. 
IGROV-l 


10.2 


Stomach Pool 


4.0 


Ovarian ca. 
OVCAR-8 


14.0 


_ 

Bone Marrow Pool 


2.4 


Ovary 


3.6 


Fetal Heart 


3.3 


Breast ca. MCF-7 


18.2 


I ieart Pool 


4.1 


Breast ca. MDA- 
MB-231 


33.7 


Lymph Node Pool 


7.5 


Breast ca. BT 549 


24.8 


heul SI t ietal Muscle 


1.8 


Breast ca. 1 47D 


100.0 


Skeletal Muscle Pool 


6.3 


Breast ca. MDA-N 


04 g 


opiccn ruui 


10 8 


Breast Pool 


7.3 


riivmus Pool 


5.9 


Trachea 


2.8 


CNS cancer 

(giio/a tro)U87-MG 


33.7 


Lung 


3.1 


(. MS cancer 
(glio/astro) U-l 18-MG 


30.1 


Fetal Lung 


7.0 


CNS cancer 
(neuroanet) SK-N-AS 


13.3 


Lung ca. NCI-N417 


7.3 


CNS cancer (astro) SF- 
539 


16.2 


Lung ca. LX-1 


17.6 


CNS cancer (astro) 
SNB-75 


46.3 
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Lung ca. NCI-H146 

"— ■" 


n iCNS cancer (glio) 
jSNB-19 


10.6 

j 


Lung ca. SHP-77 


FnS cancer (glio) SF- 
Zi ' U (295 


27.5 


Lung ca. A549 


25.7 Ruin (Amygdala) Pool 


4.1 


Lung ca. NCI-H526 


15.0 (Brain (cerebellum) 


6.7 


Lung ca. NCI-H23 


30.8 


Brain (fetal) 


6.7 J 


Lung ca. NCI-H460 


9.3 


Brain (Hippocampus) 
Pool 


""" " ™* — 

4.4 


Lung ca. HOP-62 


6.7 


Cerebral Cortex Pool 


6.3 


T nnir "MfT H^T') 
J_.ung Cd. IN V^l-rl J LL 


15.2 

. . 


Brain (Substantia 
nigra) Pool 


6.1 


Liver 


1.3 Brain (Thalamus) Pool 


5.6 


(Fetal Liver 
Liver ci . HepG2 


7.7 Brain (whole) 
1 1 .2 spinal Cord Pool 


4.7 


4.5 


iKidney Pool 


12.9 


Adrenal Gland 


3.3 


i Fetal Kidney 
Renal ca. 786-0 


4.2 


Pituitar) gland Pool 


2.6 


11.8 


Salivary Gland 


2.5 


Renal ca. A498 


6.1 jThyroid (female) 


3.5 


Renal ca. ACHN 


, n - (Pancreatic ca. 
110 CAPAN2 


15.3 


Renal ca. UO-31 


18.2 Pancreas Pool 


7 ; 6 



CNSneurodegenerationvl.O Summary: Ag3512 This panel confirms the expression of 
the CG59710-01 gene at low levels in the brain in an independent group of individuals. 
However, no differential expression of this gene was detected between Alzheimer's 
diseased postmortem brains and those of non-demented controls in this experiment. 
5 However, as seen in panel 1.4, this gene is expressed at low levels throughout the CNS, 
includin g in amygdala, substantia nigra, thalamus, cerebellum, cerebral cortex, and spinal 
cord. Therefore, this gene may play a role in other central nervous system disorders such as 
Parkinson's disease, epilepsy, multiple sclerosis, schizophrenia and depression. 

General_screening_panel_vl.4 Summary: Ag3512 Highest expression of the CG59710- 
10 01 gene is detected in a sample derived from a breast cancer cell line (CT=25.3). Therefore, 
expressi on of this gene could be used in distinguishing this sample from other samples in 
the panel. Overall, expression of this gene appears to be associated with the cancer cell 
lines suggesting a role for this gene product in cellular growth and proliferation. 
Specifically, significant expression of this gene is associated with CNS cancer, colon 
15 cancer, gastric cancer, renal cancer, lung cancer, breast cancer, ovarian cancer, and 



952 



melanoma cancer cell lines. Therefore, therapeutic modulation of the activity of this gene 
or its protein product might be beneficial in the treatment of these cancers. 

Panel 4.1D Summary: Ag3512 Results from one experiment with the CG59710-01 gene 
are not included. The amp plot indicates that there were experimental difficulties with this 
5 run. 

CH. CG59754-02 and CG59754-01: DOWN SYNDROME CELL ADHESION 
MOLECULE 

Expression of gene CG59754-02 and variant CG59754-01 was assessed using the 
primer-probe set Agl305, described in Table CHA. 

1 0 Table CHA . Probe Name Ag 1 305 



f 

jPrimers 


Sequences 


Length 


Start 
Position 


SEQID I 
NO: 


Forward 


5 1 -gtgagcattgtgtctccagaa-3 ' 


21 


291 


665 j 


Probe 


TET-5 ' -tttattacctaccacggcgggctgta-3 1 - 

TAMRA 


26 


321 


666 


[Reverse 


5 ' -tcctccttctgtacgtcagaga-3 ' 


22 


349 


667 



Panel 4D Summary: Agl305 Expression of this gene is low/undetectable (CTs > 35) 
across all of the samples on this panel (data not shown). 

CI. CG59800-01: HEPARAN SULFATE D-GLUCOSAMINYL 3-0- 
SULFOTRANSFERASE-3B 



15 Expression of gene CG59800-01 was assessed using the primer-probe set Ag3589, 

described in Table CIA. 



Table C IA. Probe Name Ag3589 



Primers 


Sequences 


Length 


Start 
Position 


SEQID 1 

NO: J 


Forward 


5 1 -tacatacctgccctgtccatac-3 1 


22 


88 


668 


Probe 


TET-5 1 -ctacatacctgccccgtccatacctg-3 ' - 
TAMRA 


26 


117 


669 


Reverse 


5 ' -gtatggacggggcaggtat-3 ' 


19 


121 


670 | 
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Results from Panels CNS neurodegeneration vl.O, 1.4, 2.2, and 4.1D are not 
included. The amp plots corresponding to these runs suggest that there were experimental 
difficulties with these runs. 

CJ. CG59761-01: AXIN 1 (AXIS INHIBITION PROTEIN 1) (HAXIN) - 
5 isoforml, submitted to study DDSMT on 03/21/01 by cmiller; clone 

status=FIS; novelty=Novel; ORF start=97, ORF stop=2833, frame=l; 2949 bp. 

Expression of gene CG59761-01 was assessed using the primer-probe set Ag3577, 
described in Table CJA. Results of the RTQ-PCR runs are shown in Tables CJB, CJC and 
CJD. 

10 Table CJ A. Probe Name Ag3577 



Primers 


Sequences 


Length 


Start 
Position 


SEQID 
NO: 


Forward 


5 ' -atacttgaagtgggctgagtca-3 1 


22 


486 


671 


Probe 


TET-5 1 -cattccctgctggatgaccaagatg-3 ' - 
TAMRA 


25 


511 


672 


Reverse 


5 ' -aggaaagtcctgaacaggctta-3 1 


22 


539 





Table C JB. CNS_neurodegeneration_vl.O 



Tissue Name 


Rel. Exp.(%) Ag3577, | 

Run 210642177 1 1,ssueJ ^ ame 


Rel. Exp.(%) Ag3577, 
Run 210642177 


AD 1 Hippo 


26.1 


Control (Path) 3 
Temporal Ctx 


7.5 


AD 2 Hippo 


20.6 


Control (Path) 4 
Temporal Ctx 


26.8 


AD 3 Hippo 


10.8 


AD 1 Occipital 
Ctx 


23.5 


AD 4 Hippo 


9.1 


AD 2 Occipital 
Ctx(Missirui 


0.0 


AD 5 hippo 


86.5 


AD 3 Occipital 
Ctx 


12.3 


AD 6 Hippo 


48.3 


AD 4 Occipital 
Ctx 


18.9 


Control 2 Hippo 


21.8 


AD 5 Occipital 
Ctx 


24.3 


Control 4 Hippo 


16.6 


AD 6 Occipital 
Ctx 


34.4 


Control (Path) 3 

iiippo 


4.7 


Control 1 Occipital 
Ctx 


6.2 


AD 1 Temporal Ctx 


25.7 


Control 2 Occipital 


57.4 
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1 


Ctx 


\ 


AD 2 Temporal Ctx 


28.3 


Control 3 Occipital 
Ctx 


13.4 


AD 3 Temporal Ctx 


14.5 


Control 4 Occipital 
Ctx 


8.7 


AD 4 Temporal Ctx 


19.8 


Control (Path) 1 
Occipital Ctx 


62.4 


AD 5 Inf Temporal 
Ctx 


100.0 


Control (Path) 2 
Occipital Ctx 


10.5 


AD 5 SupTemporal 1 . . . 
Ctx J 44J 


Control (Path) 3 
Occipital Ctx 


5.3 


AD 6 Inf Temporal 
Ctx 


48.3 


Control (Path) 4 
Occipital Ctx 


20.4 


AD 6 Sup Temporal 
Ctx 


47.0 


Control 1 Parietal 

Ctx 


15.9 


Control 1 Temporal 
Ctx 


11.6 


Control 2 Parietal 
Ctx 


54.3 


Control 2 Temporal 
Ctx 


35.1 


Control 3 Parietal 
Ctx 


15.5 


Control 3 Temporal 
Ctx 


14.6 


Control (Path) 1 
Pari, tal Ctx 


43.5 


Control 4 Temporal 
Ctx 


fControl (Path) 2 
jParietal Ctx 


21.3 


Control (Path) 1 
Temporal Ctx 


. - (Control (Path) 3 
4/ - U jParietal Ctx 


7.0 


Control (Path) 2 
Temporal Ctx 


28.5 


Control (Path) 4 1 
Parietal Ctx 


Table CJC. General screening panel vl.4 


1 

Tissue Name 


Rel. Exp.(%) | 
Ag3577, Run Tissue Name 
217343282 1 


Rel. Exp.(%) 
Ag3577, Run 
217343282 


Adipose 


2.8 


Renal ca TK-10 


35.4 


Melanoma* 
Hs688(A).T 


13.9 


Bladder 


13.3 


Melanoma* 
Hs6S8(B).T 


13.7 


Gastric ca. (liver met.) 
NCI-N87 


70.7 


Melanoma* Ml 4 


28.5 


Gastric ca KATO III 


100.0 


Melanoma* 
1 OXIMV1 


21.2 


Colon ca. SW-948 


15.0 


Melanoma* SK- 
MEL-5 j 


24.3 


Colon ca. SW480 


43.5 


Squamous cell ) 
carcinoma SCC-4 i 


23.2 


Colon ca * (SW480 
met) SW620 


47.6 
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l ost is Pool 


6.4 ICoIon ca. HT29 


26.2 


Prostate ca.* (bone 
met) PC-3 


32.3 


Colon ca. HCT-116 


27.2 


Prostate Pool 




Colon ca. CaCo-2 


35.4 


Placenta 


6.2 


Colon cancer tissue 


13.1 


Uterus Pool 


2.8 


Colon ca S\\ 1 1 In 


14.2 


Ovarian ca. 
OVCAR-3 


12.5 


Colon ca. Colo-205 


10.1 


Ovarian ca. SK- 
OV-3 


56.6 


Colon ca. SW-48 


21.8 


Ovarian ca. 
OVCAR-4 


11.4 


Colon Pool 


10.7 


OVCAR-5 


42.0 


Small Intestine Pool 


10.4 


IGROV-1 


12.9 


Stomach Pool 


5.6 


Ovarian ca. 
OVCAR-8 


13.1 


Bone Marrow Pool 


5.2 


Ovary 


7.3 


Fetal Heart 


3 - 5 . . j 


Breast ca. MCF-7 


29.3 [Heart Pool 


4.6 J 


Breast ca. MDA- 
MB-231 


32.3 [Lymph Node Pool 


12.7 j 


(Breast ca. BT 549 


30.1 Fetal Skeletal Muscle 




|Breast ca. T47D 


7573 S k e letaF Muscle Pool 




rsreasi ca. ivjul'/\-in 


2I.S Spleen Pool 




Breast F'ool 


12.3 


i h) inns Pool 




Trachea 


10.8 


CNS cancer 
(glio/astro) U87-MG 


10.7 


Lung 


1.8 


CNS cancer 
(glio/astro) U-118-MG 


42.9 


Fetal Lung 


1 , n ICNS cancer 

j(neuro;met) SK-N-AS 


25.3 


[Lung ca. NCI-N417 


7 n ICNS cancer (astro) SF- 
AU '539 


6.0 


Lung ca . LX-1 


ICNS cancer (astro) 
jSNB-75 


| I 4 ; 2 


Lung ca. NCI-H146 


jCNS cancer (glio) 
SNB-19 


13.8 


Lung ca.. SHP-77 


CNS cancer (glio) SF- 


28.1 


Lung ca. A549 


29.1 Brain ( Amygdala) Pool 


5.3 


Lung ca. NCI-H526 


7.4 fBrain (cerebellum) 


34.9 


Lung ca. NCI-H23 


28.3 fBrain (fetal) 


21.5 


Lung ca. NCI-H460 


23.3 jBrain (Hippocampus) 


I 5.4 
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Pool 


" ~ ~~ 


Lung ca. HOP-62 


7.8 


Cerebral Cortex Pool 


6.3 


T nnft /-a \TrT T-T^OO 


28.5 


Brain (Substantia 
nigra) Pool 


6.6 


Liver 


0.9 


Buun i i iul imus) Pool 


8.1 


Fetal Liver 


12.5 


Brain (whole) 


11.6 


Liver ca. HepG2 


26.8 


Spinal Cord Pool 


_ ^ ,.. 


Kidney Pool 


14.1 


Adrenal Gland 


10.1 


Fetal Kidney 


7.2 


Pituitary gland Pool 


0.0 


Renal ca. 786-0 


13 6 (Salivary Gland 


5.9 


Renal ca. A498 


7.9 


Thyroid (female) 


"5.V* " 


Renal ca. ACHN 


22.1 


Pancreatic ca. 
CAPAN2 


30.8 


Renal ca.UO-31 


17.7 


Pancreas Pool 


13.9 



Table CJ D. Panel 4. ID 



Tissue Name 


Rel. Exp.(%) j 

Ag3577, Run I Tissue Name 
169851850 


Rel. K\p.(%) 
Ag3577, Run 
169851850 


Secondary Thl act 


50.0 JHUVEC IL-lbeta 


37.9 


Secondary Th2 act 


57 0 III VECIFN gamma 


25.5 


Secondary Trl act 


?1 , HUVECTNF alpha + 
U N gamma 


35.8 




Secondary Thl rest 


OQ IHUVEC TNF alpha + 
IL4 


36.9 


Secondary Th2 rest 


58.2 ;huvecil-h 


14.6 


Secondary Trl rest 


^ j (Lung Microvascular EC 
inone 


48.0 


Primary Thl act 


,~ „ jLung Microvascular EC 
|TNFaIpha + IL-lbeta 


45.7 


Primary Th2 act 


, (Microvascular Dermal 
oi.o |__, 

IX none 


27.5 


Primary Trl act 


, n _ iMicrosvasular Dermal 
" EC I NI alpha + IL-lbeta 


24.1 


Primary Thl rest 


s0 _ ^ Bronchial epithelium 
~ TNFalpha I ILlbeta 


39.2 


Primary Th2 rest 


^ j |Small airway epithelium 
none 


20.4 


Primary Trl rest 


R . . Small airway epithelium 
jTNFalpha + IL-lbeta 


57.8 


CD45RA CD4 
lymphocyte act 


47.6 


Coronery artery SMC rest 


13.9 


CD45RO CD4 


58.6 


Coronery artery SMC 


12.9 
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TNFalpha + IL-1 beta 




( I )8 lymphocyte act 


55.5 


Astrocytes rest 


16.8 


Secondary CD8 
lymphocyte rest 


51.4 


Astrocytes TNFalpha + 
IL-1 beta 


18.7 


Secondary CD8 
lymphocyte act 


31.0 


KU-812 (Basophil) rest 


37.1 


CD4 lymphocyte none 


33 2 iKU-812 (Basophil) 
PMA/ionomycin 


64.2 


2ryThl/Th2/Trl_anti- 


54.3 


CCD 1106 

(Keratinocytes) none 


48.0 


LAK cells rest 


52.1 


CCD1106 
(Keratinocytes) 
TNFalpha + IL-1 beta 


43.8 


LAK cells IL-2 


50.3 


Liver cirrhosis 


7.5 


1 AK cells II -2+IL-12 


47.0 


NCI-H292 none 


30.6 


LAK cells IL-2+IFN 
gamma 


62.9 


NCI-H292 IL-4 


60.3 


L AK cells II -2+ II -IS 


61.1 NCI-H292 IL-9 


72.2 


LAK cells 
PMA/ionomycin 


95.9 


NCI-H292 IL-1 3 


57.8 


NK Cells IL-2 rest 


100.0 


NCI-H292 IFN gamma 


85.9 


Two Way MLR 3 day 


74.7 


HPAEC none 


23.0 


Two Way MLR 5 day 


50.7 


HPAEC TNF alpha + IL- 
1 beta 


33.4 


Tv\o Wav MLR 7 day 


27.9 Lung fibroblast none 


15.3 


|PBMC rest 


5 8 0 Lung fibroblast TNF 
alpha - 1L-I beta 




PBMCPWM 


46.7 1 uny fibroblast IL-4 


23.7 


LtdJVIL. J riA-L 


29.9 l.uiiG fibroblast IL-9 


99 3 


Rn nos i B veil) none 


39.2 Lung fibroblast IL- 13 




Ramos (B cell) 
ionomycin 


42 g Lung fibroblast IFN 

{gamma 


36 ' 3 


|B lymphocytes PWM 


~ iDermal fibroblast 
■ (CCD 1070 rest 


47.3 


!D lympnocyies ^-l/^ul 
|and IL-4 


.p „ iDermal fibroblast 

JCCD1070 TNF alpha 


94.6 


1 

iEOL-1 dbcAMP 


jDermal fibroblast 
3 CCD1070 IL-1 beta 


20.6 
25.0 


K )l -1 dbc \MP 

jPMA/ionomycin 


5 ^ j Dermal fibroblast M M 
i gamma 


Dendritic cells none 


47.6 Dermal fibroblast IL-4 


34.6 


(Dendritic cells 1 PS 


4 1 .2 jDermal Fibroblasts rest 


22.4 


jjDendritic cells anti- 
|CD40 


62.4 (Neutrophils TNFa+LPS 


10.2 
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Monocytes rest ] 50.3 


Neutrophils rest 


28.9 


Monocytes LPS 63.7 


Colon 


20.2 


Macrophages rest 


46.0 


Lung 


21.9 


Macrophages LPS 


27.7 


Thymus 


57.0 


HUVEC none 


15.8 


Kidney 


15.8 


HUVEC starved 


29.9 







CNS_neurodegeneration_vl.O Summary: Ag3577 This panel confirms the expression of 
the CG59671-01 gene at low levels in the brain in an independent group of individuals. 
However, no differential expression of this gene was detected between Alzheimer's 
diseased postmortem brains and those of non-demented controls in this experiment. As 
5 seen in panel 1 .4, this gene is expressed at low levels throughout the CNS, including in 
amygdala, substantia nigra, thalamus, cerebellum, cerebral cortex, and spinal cord. 
Therefore, this gene may play a role in other central nervous system disorders such as 
Parkinson's disease, epilepsy, multiple sclerosis, schizophrenia and depression. 

General_screening_panel_vl.4 Summary: Ag3577 Highest expression of the CG59671- 

10 01 gene is detected in a gastric cancer cell line sample (CTs=27.3). In addition, significant 
expression of this gene is associated with clusters of cell lines derived from ovarian cancer, 
breast cancer, and gastric cancer. Therefore, expression of this gene might be used to 
differentiate between these samples and other samples on this panel and as a marker for 
these cancers. The CG59671-01 gene encodes an Axin 1 protein, which is known play an 

1 5 important role in Wnt signalling transduction pathway. The Wnt/Wingless signaling 
transduction pathway plays an important role in both embryonic development and 
tumorigenesis. Beta-Catenin, a key component of the Wnt signaling pathway, interacts with 
the TCF/LEF family of transcription factors and activates transcription of Wnt target genes. 
A number of proteins such as the tumor suppressor APC and Axin are also involved in the 

20 regulation of the Wnt signaling pathway. Furthermore, mutations in APC or beta-catenin 
have been found to be responsible for the genesis of human cancers (Akiyama T, 2000). 
Recently, Dahmen et al. (2001) have shown presence of a single somatic point mutation in 
exon 1 (Pro255Ser) and deletion of seven large of AXIN1 (12%) in 86 medulloblastoma 
(MB) samples and 1 1 MB cell lines. Therefore, AXIN1 may play a role as tumor 

25 suppressor gene in MBs. Furthermore, therapeutic modulation of the activity of this gene or 
its protein product might be beneficial in the treatment of these cancers. 
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Among tissues with metabolic function, this gene is expressed at moderate to low 
levels in pituitary, adipose, adrenal gland, pancreas, thyroid, and adult and fetal skeletal 
muscle, heart, and liver. This widespread expression among these tissues suggests that this 
gene product may play a role in normal neuroendocrine and metabolic and that disregulated 
5 expression of this gene may contribute to neuroendocrine disorders or metabolic diseases, 
such as obesity and diabetes. 

This gene is also expressed in all regions of the CNS examined. Please see Panel 
CNS_neurodegeneration_vl.O for discussion of utility of this gene in the central nervous 
system. 

10 References: 

1 . Akiyama T. (2000) Wnt/beta-catenin signaling. Cytokine Growth Factor Rev 
ll(4):273-82. 

2. Dahmen RP, Koch A, Denkhaus D, Tonn JC, Sorensen N, Berthold F, Behrens 
J, Birchmeier W, Wiestler OD, Pietsch T. (2001) Deletions of AXIN1, a component of the 

1 5 WNT/wingless pathway, in sporadic medulloblastomas. Cancer Res 2001 Oct 
l;61(19):7039-43 

Panel 4.1D Summary: Ag3577 Highest expression of the CG59671-01 gene is detected in 
resting NK Cells IL-2 cells (CTs=28.3). In addition, this gene is expressed at high to 
moderate levels in a wide range of cell types of significance in the immune response in 

20 health and disease. These cells include members of the T-cell, B-cell, endothelial cell, 

macrophage/monocyte, and peripheral blood mononuclear cell family, as well as epithelial 
and fibroblast cell types from lung and skin, and normal tissues represented by colon, lung, 
thymus and kidney. This ubiquitous pattern of expression suggests that this gene product 
may be involved in homeostatic processes for these and other cell types and tissues. 

25 Therefore, modulation of the gene product with a functional therapeutic may lead to the 
alteration of functions associated with these cell types and lead to improvement of the 
symptoms of patients suffering from autoimmune and inflammatory diseases such as 
asthma, allergies, inflammatory bowel disease, lupus erythematosus, psoriasis, rheumatoid 
arthritis, and osteoarthritis. 
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CK. CG59708-01 and CG59708-02 and CG59708-03: Ubiquitin carboxyl- 
terminal hydrolase 21 

Expression of gene CG59708-01, full length clone CG59708-03 and variant 
CG59708-02 was assessed using the primer-probe set Ag35 11, described in Table CKA. 
Results of the RTQ-PCR runs are shown in Tables CKB, CKC and CKD. Please note that 
CG59708-03 represents a full-length physical clone of the CG59708-01 gene, validating 
the prediction of the gene sequence. 



Table CK A. Probe Name Ag351 1 



Primers 


Sequences 


Length 


Start 
Position 


SEQID 

NO: 


Forw ard 


5 ' -acccaaaagggtagtagaacga-3 ' 


22 


2431 


674 


Probe 


TET-5 1 -cccttctggaacagtttgcagataaa-3 ' - 

TAMRA 


26 


2454 


675 


Reverse 


5 1 -gccaccttcataatgctgatt-3 ' 


21 


2503 


676 



Table CKB . CNS_neurodegeneration_vl.O 



Tissue Name 


Rel. Exp.(%) Ag3511, 
Run 210499621 


Tissue Name 


Rel. Exp.(%) Ag3511, 
Run 210499621 


AD 1 Hippo 


8.2 


Control (Path) 3 
Temporal Ctx 


4.1 ! 


AD 2 Hippo 


17.9 


Control (Path) I 
Temporal Ctx 


32.1 


AD 3 Hippo 


6.0 


AD 1 Occipital Ctx 


17.7 


AD 4 Hippo 


4.1 


AD 2 Occipital Ctx 
(Missing) 


0.0 


AD 5 Hippo 


87.1 


AD 3 Occipital Ctx 


3.1 


AD 6 Hippo 


52.5 


AD 4 Occipital Ctx 


20.0 


Control 2 Hippo 


13.0 


\D 5 Occif ita! Cl\ 


22.4 


( ontj 1 4 Hippo 




\FJ> 6 Occipital Clx 


21.6 


Control (Path) 3 
Hippo 


- - 


Control 1 Occipital 
Ctx 


2.5 


AD 1 Temporal 
Ctx 


15.0 


Control 2 Occipital 
Ctx 


39.5 


AD 2 Temporal 
Ctx 


22.7 


Control 3 Occipital 
Ctx 


18.6 


AD 3 Temporal 
Ctx 


4.5 


Control 4 Occipital 
(. tx 


________ 

3.8 


AD 4 Temporal 
Ctx 


20.0 


Control (Path) 1 
Occipital Ctx 


61.6 
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AD 5 Inf Temporal I 
( i\ | 


100.0 


Control (Path) 2 
( )ccipital (. tx 


10.4 


AD 5 Sup 

Temporal Ctx | 


36.6 


Control (Path) 3 
Occipital Ctx 


2.4 


AD 6 Inf Temporal 
Ctx 


46.7 


Control (Path) 4 
Occipital Ctx 


14.1 


AD 6 Sup 

Temporal Ctx | 


58.6 


Control 1 Parietal 
Ctx 


5.3 


Control 1 j 
Temporal Ctx i 


5.6 


Control 2 Parietal 
Ctx 


47.0 


Control 2 

Temporal Ctx | 


20.2 


Control 3 Parietal 
Ctx 


14.7 


Control 3 
Temporal Ctx 


15.8 


Control (Path) 1 
Parietal Ctx 


57.4 


Control 3 
Temporal Ctx 


6.1 


Control (Path) 2 
Parietal Ctx 


27.4 


Control (Path) 1 
Temporal Ctx 


50.0 


Control (Path) 3 
Parietal Ctx 


2.2 


Control (Path) 2 
Temporal Ctx 


33.2 


Control (Path) 4 
Parietal Ctx 




Table CKC. General screening panel vl.4 


Tissue Name 


j Rel. Exp.(%) 
I Ag3511,Run 
1 217240774 


Tissue Name 


Rel. Exp.(%) 
Ag3511, Run 
217240774 


Adipose 


i 4.9 


Renal ca. TK- 10 


18.4 

_____ 


Melanoma* 
Hs688(A).T 


15.9 


Bladder 




Melanoma* 
Hs688(B).T 


12.8 


Gastric ca. (liver met.) 
NCI-N87 


24.3 


(Melanoma* M14 


20.6 


Gastric ca. KATO III 


1 00.0 


(Melanoma* 
JLOXIMYI 


1 9.1 


Colon ca. SW-948 


4.7 


(Melanoma* SK- 
jMEL-5 


26.6 


Colon ca. SW480 




63.7 


(Squamous cell 
carcinoma SCC-4 


14.3 


Colon ca.* (SW480 
met) SW620 


26.1 


Testis Pool 


1 6.3 


Colon ca. 1 1 1 _ ( ) 


17.4 


Prostate ca.* (bone 
met) PC-3 


43.5 


Colon ca. HCT-116 


24.3 


Prostate Pool 


6.6 


Colon ca. CaCo-2 


54.7 


Placenta 


0.7 


Colon career tissue 


6.5 


I tcrus Poo! 


4.4 


Colon ca. SWT 1 16 


4.8 


Ovarian ca. 


1 1.7 


Colon ca. Colo-205 


2 -° 



962 





OVCAR-3 










Ovarian ca. SK- 
OV-3 


45.1 


Colon ca. SW-48 


4.0 




Ovarian ca. 

(J V L.AK.-4 


17.9 


Colon Pool 


13.0 




Ovarian ca. 
OVCAR-5 


26.1 


Small Intestine Pool 


14.1 




Ovarian ca. 
IGROV-1 


12.5 


Stomach Pool 


7.2 




Ovctricin cs.. 
OVCAR-8 


10.2 


Bone Marrow Pool 


6.5 




Ovary 


7.5 


Fetal Heart 


73.2 




Breast ca. MCF-7 


9.7 


Heart Pool 


19.1 


J3 


MB-231 


47.0 


Lymph Node Pool 


14.5 




D rpcjc f ~ Q DT *\AQ 

i3rca!>l L-a. J3 I 


58 2 


Fetal Skeletal Muscle 


34.9 




Brcasl ca. 1 }"!> 


44.1 


Skeletal Muscle Pool 


45.1 _____ 




Breast ca. MDA-N 


12.9 


Spleen Pool 


9.8 j 


yj 


Breast Pool 


15.2 


Thymus Pool 


TTo I 


Q 


Trachea 


7.1 


CNS cancer 
(glio/astro) U87-MG 


1 


m 


Lung 


3.8 


CNS cancer 
(glio/astro) I i-l 18-M(i 


33.4 




Fetal Lung 


27.5 


CNS cancer 
(noui-o;mct) Sk-N-AS 


8.5 




Lung ca.NCI-N417 


2.5 


CMS cancel (astro) SK 

.■ 5 - 3 ? 


11.6 






Lung ca. LX-1 


24.0 


CNS cancer (astro) 
SNB-75 


31.6 




Lung ca . NCI-H146 


3.2 


CNS cancer (glio) 
SNB-19 


12.1 




Lung ca . SHP-77 


13.0 


( \S cancer (alio) SI - 
295 


40.9 




Lun^ ca A549 


18.9 


Brain (Amygdala) Pool 


4.7 




Lung ca. NCI-H526 


20.6 


Brain (cerebellum) 


12.0 




Lung ca. NCI-H23 


23.5 


Brain (fetal) 


10.7 

__ 




Lungca.NCI-H460 


10.5 


Brain (1 Iippocumpu >> 
Pool 


3.8 




Lungca. HOP-62 


10.2 


Cerebral Cortex Pool 


7.5 




Lung ca. NCI-H522 


21.8 


Brain (Substantia 
nigra) Pool 






Li\cr 


i~6 


B'.iiii (.Thalamus) Po< >\ 


8.0 




iFetal Liver 


10.9 


Brain (who- > 


5.0 




jLiver ca. HepG2 


7.2 


Spinal Cord Pool 


3.8 
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Kidne\ Pool 


16.3 


Adrenal ( >\nw\ 


5.0 


Fetal Kidney 


16.3 


Pituitary gland Pool 


3.0 


Renal ca. 786-0 j 


11.3 


Salivary Gland 


3.1 


Renal ca. A498 j 


4.2 


Thyroid (female) 


3.1 


Renal ca. ACHN 


10.2 


Pancreatic ca. 
CAPAN2 


25.0 


Renal ca.UO-31 j 


17.1 


Pancreas Pool 


13.0 



Table CKD . Panel 4D 



i 

Tissue Name 


Rel. Exp.(%) 

Ag3511,Run j Tissue Name 
166407112 


Rel. Exp.(%) | 
Ag3511, Run 
166407112 


Secondary Thl act 


26.6 


HUVEC IL-1 beta 


14.8 


Secondary Th2 act 


31.6 


HUVEC IFN gamma 


16.8 1 


Secondary Trl act 


33.7 


HUVEC TNF alpha + 
IFN gamma 


9.7 


Secondary Thl rest 


22.8 


HUVEC TNF alpha + 
IL4 


9.1 


Secondary Th2 rest 


1 7.2 JHUVEC IL-1 1 




Secondetry Trl rest 


2Q ^ |Lung Microvascular EC 
(none 


13.2 


Primary Thl act 


ft . {Lung Microvascular EC 
ITNFalpha + IL-1 beta 


8.3 


Primary Th2 act 


17.2 


Microvascular Dermal 
EC none 


22.7 


Primary Trl act 


25.7 


Microsvasular Dermal 
EC TNFalpha I IL-1 beta 


11 9 


Primary Thl rest 


. (Bronchial epithelium 
[TNFalpha + ILlbeta 


8.0 


JPrimary Th2 rest 


2g j I Small airway epithelium 
Inone 


3.7 


Primary Trl rest 


ISmall airway epithelium 
TNFalpha + IL-1 beta 


24.3 


CD45R \ CD4 
l>mphocyte act 


11.0 


Coronery artery SMC rest 


11.9 


CD45RO CD4 
lymphocyte act 


28.1 


Coronery artery SMC 
TNFalpha + IL-1 beta 


7.6 


CDS lymphocyte act 


19.2 


Astrocytes rest 10.6 


Secondary CD8 
i lymphocyte rest 


15.3 


Astrocytes TNFalpha + 
IL-lbeta 


12.4 


jSecondary CD8 
^lymphocyte act 


20.3 


KU-812 (Basophil) rest 


20.2 


CD4 lymphocyte none 


— 
8.4 


KU-812 (Basophil) 
PMA/ionomycin 


46.7 
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2ry Thl/Th2/Trl_anti- ! 
CD95 CHI 1 


24.8 


CCD 1106 

(Keratinocytes) none 


12.6 


LAK cells rest 


12.2 


CCD 1 106 
(Keratinocytes) 
TNFalpha + IL-1 beta 


53.2 


LAK cells IL-2 


49.3 


Liver cirrhosis 


7.7 


LAK cells IL-2+IL-I2 


27.5 


Lupus kidney 


9.7 


LAK cells IL-2+1FN 
gamma 


47.6 


NCI-H292 none 


59.5 


LAK cells IL-2+IL-18 


35.4 


NCI-H292 IL-4 


100.0 


LAK cells 
PMA/ionomycin 


3.6 


NCI-H292 IL-9 


69.7 


NK Cells IL-2 rest 


46.0 


NCI-H292IL-13 


46.7 


Two Way MLR 3 day 


24.7 


NCI-H292 IFN gamma 


36.6 


Two Way MLR 5 day 


22.1 


Ml' \l ( none 


1 1.8 


Two Way MLR 7 day 


13.9 


HPAEC TNF alpha - II - 
1 beta 


12.9 


PBMC rest 


13.0 


Lung fibroblast none 


13.3 


PBMC PWM 


17.3 


Lung riDroDJast liNr 
alpha + IL-1 beta 


31.9 


rbML r rlA-JL 




Lung fibroblast IL-4 




Ramos (B cell) none 


19.8 


Lung fibroblast IL-9 




Ramos (B cell) 
ionomycin 


18.2 


Lung fibroblast IL-1 3 


6.3 


B lymphocytes PWM 


37.1 


Lung fibroblast IFN 
gamma 


10.4 


B lymphocytes CD40L 

anrl TT A 


34.6 


Dermal fibroblast 
l u / u rci>L 


23.7 


EOL-1 dbcAMP 


14.4 


Dermal fibroblast 

CCT^^ 070 TKFP »lnha 
VvL/JJlU/U I IN JT dipild. 


75.8 

——————J 


DUL-1 1.1 [JL/ \ VI [ 

PMA/ionomycin 




21.6 


Dermal fibroblast 
CCD 1070 IL-1 beta 


18.7 

1 


Dendritic cells none 


10.8 


Dermal fibroblast IFN 
gamma 


5.6 [ 


Dendritic cells LPS 


13.5 


Dermal fibroblast IL-4 




Dendritic cells anti- 
CD40 


14.4 


IBD Colitis 2 


5.3 1 


Monocytes rest 


1 1.6 


IBDCrohiVs 


6.0 


Monocytes LPS 


l - 6 - . 


Colon 




Macrophages rest 


20.3 


Lung 


9.7 


" 1 rophages LPS 


15.0 


Thymus 


16.3 


HI VECnone 


16.8 


Kidney 


19.9 


Mi \ I ( i in I 


29.9 
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CNSjneurodegenerationvl.O Summary: Ag351 1 This panel confirms the expression of 
CG5970 8-01 gene at low levels in the brain in an independent group of individuals. 
However, no differential expression of this gene was detected between Alzheimer's 
diseased postmortem brains and those of non-demented controls in this experiment. 
5 However, as seen in panel 1 .4, this gene is expressed at low levels throughout the CNS, 
including in amygdala, substantia nigra, thalamus, cerebellum, cerebral cortex, and spinal 
cord. Therefore, this gene may play a role in other central nervous system disorders such as 
Parkinson's disease, epilepsy, multiple sclerosis, schizophrenia and depression. 

General_screening_panel_vl.4 Summary: Ag351 1 Highest expression of the CG59708- 
01 is detected in a gastric cancer cell line sample (CT=27.1). Thus, expression of this gene 
can be used to distinguish this sample from other samples in this panel. In addition, high 
levels of expression of this gene are associated with breast cancer, ovarian cancer, and 
gastric cancer cell lines. Therefore, therapeutic modulation of the activity of this gene or its 
protein product might be beneficial in the treatment of these cancers. 

Among tissues with metabolic function, this gene is expressed at moderate to low 
levels in pituitary, adipose, adrenal gland, pancreas, thyroid, and adult and fetal skeletal 
muscle, heart, and liver. This widespread expression among these tissues suggests that this 
gene product may play a role in normal neuroendocrine and metabolic and that disregulated 
expression of this gene may contribute to neuroendocrine disorders or metabolic diseases, 
such as obesity and diabetes. 

This gene is also expressed at moderate to low levels in all regions of the CNS 
examined. Please see Panel CNS_neurodegeneration_vl .0 for discussion of utility of this 
gene in the central nervous system. 

Panel 4D Summary: Ag351 1 Highest expression of the CG59708-01 gene is detected in a 
IL-4 treated NCI-H292 sample (CT=26.4). In addition, this gene is expressed at high to 
moderate levels in a wide range of cell types of significance in the immune response in 
health and disease. These cells include members of the T-cell, B-cell, endothelial cell, 
macrophage/monocyte, and peripheral blood mononuclear cell family, as well as epithelial 
and fibroblast cell types from lung and skin, and normal tissues represented by colon, lung, 
thymus and kidney. This ubiquitous pattern of expression suggests that this gene product 
may be involved in homeostatic processes for these and other cell types and tissues. This 



h 



25 



30 
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pattern is in agreement with the expression profile in General_screeningjpanel_vl .5 and 
also suggests a role for the gene product in cell survival and proliferation. Therefore, 
modulation of the gene product with a functional therapeutic may lead to the alteration of 
functions associated with these cell types and lead to improvement of the symptoms of 
5 patients suffering from autoimmune and inflammatory diseases such as asthma, allergies, 
inflammatory bowel disease, lupus erythematosus, psoriasis, rheumatoid arthritis, and 
osteoarthritis. 

CL. CG 59559-01: CPSase-related 

Expression of gene CG595 59-01 was assessed using the primer-probe set Ag3469, 
1 0 described in Table CLA. Results of the RTQ-PCR runs are shown in Tables CLB, CLC and 
CLD. 



Table CLA . Probe Name Ag3469 



iPrimers 

I 


Sequences 


Length 


Start 
Position 


SEQID j 

NO: [ 


Forvv ard 


5 ' -tccagtcggatcaattctattg-3 1 


22 


1213 


677" | 


jProbe 


TET-5 ' -attcagatgtcccctcatcagcccat-3 ' - 
TAMRA 


26 


1237 


IfLJ 


[Reverse 


5 ' -aattgtcttcgacgaagaaacc-3 1 


22 


1266 


679 | 



Table CLB . CNS neurodegeneration vl.O 



Tissue Name 


Rel. Exp.(%) Ag3469, 
Run 210376662 


Tissue Name 


Rel. Exp.(%) Ag3469, 
Run 210376662 


AD 1 Hippo 


23.5 


Control (Path) 3 
Temporal Ctx 


13.9 


AD 2 Hippo 

j 


31.0 


( out ml (Path) 4 
Temporal Ctx 


35.6 


AD 3 Hippo 


17.1 


AD 1 Occipital Ctx 


— - 


AD 4 Hippo 


23.0 


KDH (ccipital Ctx 
(Missing) 


0.0 


AD 5 Hippo 


21.0 


AD 3 Occipiti 1 Ctx 


2.0 


AD 6 Hippo 


69.3 


\l) 4 Occipital Ctx 25.9 


Control 2 Hippo 


56.6 


AD 5 Occipital Ctx 


31.9 


Control 4 Hippo 


52.5 


AD 6 Occipital Ctx 


10.4 


Control (Path) 3 
Hippo 


7.5 


Control 1 Occipital 
Ctx 


3.3 


AD 1 Temporal 
Ctx 


39.8 


Control 2 Occipital 
Ctx 


42.6 
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AD 2 Temporal 
Ctx 


35.8 


Control 3 Occipital 

[Ctx 


10.7 


AD 3 Temporal 
Ctx 


12.2 


i Control 4 Occipital 
Ctx 


17.6 


AD 4 Temporal 
Ctx 


30.1 


Control (Path) 1 
Occipital Ctx 


59.0 


AD 5 Inf Temporal 
Ctx 


37.6 


Control (Path) 2 
Occipital Ctx 


14.6 


AD 5 Sup 
Temporal Ctx 


33.4 


Control (Path) 3 
Occipital Ctx 


0.0 


AD 6 Inf Temporal 
Ctx 


52.1 


Control (Path) 4 
Occipital Ctx 


10.6 


AD 6 Sup 
Temporal Ctx 


31.0 


| Control 1 Parietal 
Ctx 


9.1 


Control 1 
Temporal Ctx 


12.7 


Control 2 Parietal 
Ctx 


28.5 


Control 2 
Temporal Ctx 


31.4 


Control 3 Parietal 
Ctx 


17.2 


Control 3 
Temporal Ctx 


36.3 


I Control (Path) 1 
Parietal Ctx 


64.2 


Control 3 
Temporal Ctx 


16.8 


Control (Path) 2 
Parietal Ctx 


25.7 


Control (Path) 1 
Temporal Ctx 


100.0 


Control (Path) 3 
Parietal Ctx 




Control (Path) 2 
Temporal Ctx 


62.4 


Control (Path) 4 
Parietal Ctx 


55.9 


Table CLC. General screening_panel vl.4 


Tissue Name 


Rel. Exp.(%) 1 
J Ag3469, Run 

217119419 1 


Tissue Name 


Rel. Exp.(%) 
Ag3469, Run 
217119419 


[Adipose 


3.8 


Renal ca. TK-10 


7.1 


Melanoma* 
Hs688(A).T 


9.2 


Bladder 


5.7 


Melanoma* 
Hs688(B).T 




3astric ca. (liver met.) 
STCI-N87 


4.9 


Melanoma* Ml 4 


0.1 


iastrie ca. KATO (II 


1.0 


Melanoma* 
jLOXIMVI 


0.8 


Colon ca. SW-948 




0.8 


(Melanoma* SK- 
MEL-5 


2.3 


Colon ca. SW480 




0.0 


Squamous cell 
carcinoma SCC-4 


| ..9 


Colon ca.* (SW480 
•net) SW620 


5.1 


Testis Pool 




Colon ca. HT29 


8.9 
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Prostate ca.* (bone 
met) PC-3 


3.6 


Colon ca. HCT-116 


0.7 


Prostate Pool 


1.8 


Colon ca. Ca( "-2 


8.1 


Placenta 


1 A 


Colon cancer tissue 




Litems Pool 


2.7 


Colon ca. SW1116 


0.3 


Ovarian ca. 
OVCAR-3 


7.9 


Colon ca. Colo-205 


1.6 


Ovarian ca. SK- 
OV-3 


11.3 


Colon ca. SW-48 


0.9 


Ovarian ca. 

UVLAK-4 


0.8 


Colon Pool 


10.4 


Ovarian ca. 
OVCAR-5 


9.2 


Small Intestine Pool 


3.6 


Ovarian ca. 
IGROV-1 


5.5 


Stomach Pool 

- 


2.7 


( )\ ^\[ ^^iii 
OVCAR-8 


2.0 


Bone Marrow Pool 


6.3 


Ovary 


16.7 


Ketal Heart 


0.9 


Breast ca. MCF-7 


31.6 


Heart Pool 


3.8 


Brc&st C3.. MDOA" 
MB-231 


4.0 


Lymph Node Pool 


11.9 


Breast ca. P. 1 549 


3.7 


Fetal Skeletal Muscle 


0.4 


Breast ca. T47D 


20.9"~~" 


Skeletal Muscle Pool 


0.5 


Breast ca. MDA-N 


0.1 


Spleen Pool 


10.7 


Breast 1 'ool 


11.3 


Thymus Pool 


7.5 


Trachea 


3.8 


CNS cancer 
(glio'astro) U87-MG 


10.2 


Lung 


9.3 


CNS cancer 
(glio/astro) U-118-MG 


1.3 


Fetal Lung 


3.8 


CNS cancer 
(neuro;met) SK-N-AS 


0.1 


Lung ca. NCI-N417 


0.0 


( NS cancel (astro) SI - 
539 


3.0 


Lung ca. LX-1 


13.8 


CNS cancer (astro) 
SNB-75 


, -. 

10.8 


Lung ca. NCI-H146 

L . 


0.1 


CNS cancer (glio) 
SNB-19 


4.9 


Lung ca. SHP-77 


6.0 


C NS cancel (glio) SF- 
295 


21.0 


Lung ca. A549 


100.0 


Brain (Amygdala) Pool 


I.I 


Lung ca. NCI-H526 


0.0 


Brain (cerebellum) 


2.0 


Lungca. NCI-H23 


0.1 


Brain (fetal) 


1.2 


Lung ca. NCI-H460 


8.9 


Brain (Hippocampus) 
Pool 


1.1 
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Lung ca. HOP-62 


45.4 


Cerebral Cortex Pool 


1.2 


T una MPT T4^9? 
juung La. iN^i-rijzz. 


4 2 


Brain (Substantia 
nigra) Pool 


1 3 


Liver 


0.6 


Br sin (Thai imus ) Pool 


1.2 


[ ; et '.1 Liver 


1.8 


Brain (whole) 


1.1 


Liver ca. IIepG2 


1.5 


Spina! Cord Pool 


1 9 


Kidney Pool 


8.8 


Adrenal Gland 


: s 


Fetal Kidney 


3.7 


Pituitary gland Pool 


«< 


Renal ca. 786-0 


3.9 


Salivary Gland 


_. 13 


Renal ca. A498 


10.5 


Thyroid (female) 


0.7 


Renal ca. ACHN 


,_ 62 


Pancreatic ca. 
CAPAN2 


20.2 


Renal ca. UO-31 


6.3 


Pancreas Pool 





Table C LP. Panel 4. ID 



Tissue Name 


Rel. Exp.(%) 
Ag3469, Run 
169839390 


l T' » 

1 Tissue Name 

I 


Rel. Exp.(%) 
Ag3469, Run 
169839390 


Secondary Thl act 


19.2 


1 II IVEC IL-lbeta 


1.3 


Secondary Th2 act 


25.2 


|HUVEC IFN gamma 


. " 


Secondary Trl act 


14.4 


jHUVEC TNF alpha + 
ll'N gamma 


1.3 


Secondary Thl rest 


26.1 


HUVI C TNF alpha I 

JIL4 


1.1 


Secondary 1 h2 rest 


46.3 


|HUVEC IL-11 


1.1 


Secondary Trl rest 


36.6 


jLung Microvascular EC 

jjnone 


1 0.7 1 


Primary Thl act 


13.6 


jLung Microvascular EC 
rNFalpha - IL-lbeta 


0.4 


Primary Th2 act 


27.7 


^Microvascular Dermal 
jEC none 


0.1 


Primary Trl act 


15.2 


jMicrosvasular Dermal 
EC TNFalpha I IL-lbeta 


0.0 


Primary 1 hi rest 


46.0 


jBronchial epithelium 
TNFalpha + 1L1 beta 


5.8 


Primary Th2 rest 


44.4 


(Small airway epithelium 
[none 


2.0 


Primary Trl rest 


45.1 


J Small airway epithelium 
TNFalpha + IL-lbeta 


6.9 


CD45R \ ( 1)4 
lymphocyte act 


25.2 


(Coronery artery SMC rest 


3.1 


CD45RO CD4 
lymphocyte act 


67.8 


jCoronery artery SMC 
I NI alpha - IL-lbeta 


3.4 
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CD8 lymphocyte act 


50.5 


Astrocytes rest 


1.4 


Secondary CD8 
lymphocj^te rest 


47.0 


Astrocytes TNFalpha + 
IL-lbeta 


3.2 


Secondary CD8 
lymphocyte act 


38.4 


KU-812 (Basophil) rest 


3.0 


CD4 lymphocyte none 


61.1 


KU-812 (Basophil) 
PMA/ionomycin 


6.4 


2ry Thl/Th2/Trl_anti- 
L Uyj LH 1 1 


27.2 


CCD 1106 

(Keratinocytes) none 


3.8 


LAK cells rest 


50.7 


vA^D 1 1 Uo 
(Keratinocytes) 
TNFalpha + IL-lbeta 


5.2 


LAK cells IL-2 


36.9 


Liver cirrhosis 


3.0 


LAK cells IL-2+IL-12 


49.0 


NCI-H292 none 


5.0 


LAK cells IL-2+IFN 
gamma 


59.5 


NCI-H292 IL-4 


5.5 


LAK cells IL-2+ IL-18 


58.2 


NCI-H292 IL-9 


6.8 


LAK cells 
PMA/ionomycin 


65.5 


NCI-H292 IL-13 


6.7 


NK Cells IL-2 rest 


47.0 


NCI-H292 IFN gamma 


8.0 


Two Way MLR 3 day 


54.0 


HPAEC none 


2.7 


Two Way MLR 5 day | 25.2 


HPAEC TNF alpha + IL- 
1 beta 


3.3 


Two Way MLR 7 da\ j 42.6 


Lung fibroblast none J 1 .5 


PBMC rest j 23.5 


Lung fibroblast TNF 
alpha + IL-1 beta 


1.1 


PBMC PWM 37.4 


Lung fibroblast IL-4 


0.7 


PBMC PHA-L ' 59.5 


Lung fibroblast IL-9 


3.1 


Ramos (B cell) none J 1 .8 


Lung fibroblast IL-13 


0.7 


Ramos (B cell) 
ionomycin 


2.5 


Lung fibroblast IFN 
gamma 


0.3 


B lymphocytes PWM 


26.8 


Dermal fibroblast 
CCD 1070 rest 


7.7 


B lymphocytes CD40L 
and IL-4 


100.0 


Dermal fibroblast 
CCD1070 TNF alpha 


36.9 


EOL-1 dbcAMP 


2.7 


Dermal fibroblast 
CCD 1070 IL-1 beta 


5.5 


EOL-1 dbcAMP 
PMA/ionomycin 


0.5 


Dermal fibroblast IFN 
gamma 


0.7 


Dendritic cells none 


10.4 


Dermal fibroblast IL-4 


3.7 


Dendritic cells LPS 


3.2 


Dermal Fibroblasts rest 


0.2 


Dendritic cells anti- 
CD40 


4.2 


Neutrophils TNFa+LPS 


0.8 


Monocytes rest 


1.7 


Neutrophils rest 


3.2 
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\lonoc} tes LPS 


2.6 


|Colon 


4.4 


Macrophages rest 


5.6 


|Lung 


- n 


Macrophages LPS 


2.6 


jThymus 


6.9 


HUVEC none 


0.7 


|Kidney 


2.0 


HUVEC starved 


1.6 




j 



CNS_neurodegeneration_vl.O Summary: Ag3469 This panel confirms the expression of 
the CG5 9559-01 gene at low levels in the brain in an independent group of individuals. 
However, no differential expression of this gene was detected between Alzheimer's 
diseased postmortem brains and those of non-demented controls in this experiment. 
5 However, as seen in panel 1.4, this gene is expressed at low levels throughout the CNS, 
including in amygdala, substantia nigra, thalamus, cerebellum, cerebral cortex, and spinal 
cord. Therefore, this gene may play a role in other central nervous system disorders such as 
Parkinson's disease, epilepsy, multiple sclerosis, schizophrenia and depression. 

General_screening_panel_vl.4 Summary: Ag3469 Highest expression of the CG59559- 
10 01 gene is detected in sample derived from a lung cancer cell line (CT=25.6). Thus, 

expression of this gene can be used to distinguish this sample from other samples used in 
this panel. Furthermore, significant expression of this gene is associated with pancreatic 
cancer, CNS cancer and breast cancer cell lines. Therefore, therapeutic modulation of the 
activity of this gene or its protein product might be beneficial in the treatment of these 
15 cancers. 

Among tissues with metabolic function, this gene is expressed in pituitary, adipose, 
adrenal gland, pancreas, thyroid, and adult and fetal skeletal muscle, heart, and liver. This 
widespread expression among these tissues suggests that this gene product may play a role 
in normal neuroendocrine and metabolic and that disregulated expression of this gene may 
20 contribute to neuroendocrine disorders or metabolic diseases, such as obesity and diabetes. 

This gene is also expressed at low but significant levels in all regions of the CNS 
examined. Please see Panel CNS_neurodegeneration_vl .0 for discussion of utility of this 
gene in the central nervous system. 

Panel 4.1D Summary: Ag3469 Highest expression of the CG59559-01 gene is detected in 
25 sample derived CD40L and IL-4 treated B lymphocytes (CT=27.2). Fur5hermore, this gene 
is expressed at significant levels in a wide range of cell types of significance in the immune 
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response in health and disease. These cells include members of the T-cell, B-cell, 
endothelial cell, and peripheral blood mononuclear cell family, as well as epithelial and 
fibroblast cell types from lung and skin, and normal tissues represented by colon, lung, 
thymus and kidney. This ubiquitous pattern of expression suggests that this gene product 
may be involved in homeostatic processes for these and other cell types and tissues. 
Therefore, modulation of the gene product with a functional therapeutic may lead to the 
alteration of functions associated with these cell types and lead to improvement of the 
symptoms of patients suffering from autoimmune and inflammatory diseases such as 
asthma, allergies, inflammatory bowel disease, lupus erythematosus, psoriasis, rheumatoid 
arthritis., and osteoarthritis. 

CM. CG59669-01: CARBONYL REDUCTASE 

Expression of gene CG59669-01 was assessed using the primer-probe set Ag3505, 
described in Table CMA. 



Table CMA . Probe Name Ag3505 



(Primers 


Sequences 


Length 


Start Position 


seq rn 

NO: 


1 orwarc 


5 ' -ccgggtcccagaatctagt-3 ' 


10 


4 


680 


iProbe 


TET-5 ' -cctacgccacggttttgaccacg-3 ' -TAMRA 


23 


23 


681 j 


[Reverse 


5 1 -gacacacggaccacctgat-3 ' 


19 


74 


6S2 



CNS_neurodegeneration_vl.O Summary: Ag3505 Expression of the CG59669-01 gene 
is low/undetectable (CTs > 35) across all of the samples on this panel (data not shown). 



Generai_screening_panel_vl.4 Summary: Ag3505 Results from one experiment with 
the CG59669-01 gene are not included. The amp plot indicates that there were 
experimental difficulties with this run. 

Panel 4.1D Summary: Ag3505 Expression of the CG59669-01 gene is low/undetectable 
(CTs > 35) across all of the samples on this panel due to a probable probe or chemistry 
failure (data not shown). 

Panel 5 Islet Summary: Ag3505 Expression of the CG59669-01 gene is low/undetectable 
(CTs > 35) across all of the samples on this panel (data not shown). 
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CN. CG59679-01: CARBONYL REDUCTASE 

Expression of gene CG59679-01 was assessed using the primer-probe set Ag3507, 
described in Table CNA. 



Table CNA . Probe Name Ag3507 



jPrimers 


Sequences 


Length 


Start 
Position 


SEQDD 1 

NO: 1 


[Forward 


5 ' -gactggagctaataagggcatt-3 1 


22 


130 


683 ! 


iProbe 


TET-5' -tcgtgacctgtgtcagcaattctcag-3 ' - 

TAMRA 


26 


166 


684 | 


[Reverse 


5 ' -gtgcagtgagcaccacatc-3 ' 


19 


194 


685 | 



CNS_neurodegeneration_vl.O Summary: Ag3507 Expression of the CG59679-01 gene 
is low/undetectable (CTs > 35) across all of the samples on this panel (data not shown). 



General_screening_panel_vl.4 Summary: Ag3507 Expression of the CG59679-01 gene 
is low/undetectable (CTs > 35) across all of the samples on this panel (data not shown). 

Panel 4„1D Summary: Ag3507 Expression of the CG59679-01 gene is low/undetectable 
(CTs > 35) across all of the samples on this panel (data not shown). The data suggest that 
there may have been experimental difficulties with this run. 

Panel 5 Islet Summary: Ag3507 Expression of CG59679-01 gene is low/undetectable 
(CTs > 35) across all of the samples on this panel (data not shown). 

CO. CG59644-01: Putative protein phosphatase 

Expression of gene CG59644-01 was assessed using the primer-probe set Ag3503, 
described in Table COA. Results of the RTQ-PCR runs are shown in Tables COB, COC 
and COD. 



Table CPA . Probe Name Ag3503 



Primers 


Sequences 


Length 


Start 
Position 


SEQID 
NO: 


1 orwar( 


5 ' -tcgtacctagtaatcccattgg-3 ' 




410 


763 


Probe 


TET-5 1 -ccacaagctactgtgagttgctgcaa-3 ' - 
TAMRA 


26 


440 


764 


Reverse 


5 ' -ctaccgagcaaagggactttat-3 ' 


22 ; 467 


765 



974 



Table COB . CNS_neurodegeneration_vl.O 



Tissue Name 


Rel. Exp.(%) Ag3503, 
Run 210938272 


Tissue Name 


Rel. Exp.(%) Ag3503, 
Run 210938272 


AD 1 Hippo 


22.1 


Control (Path) 3 
Temporal Ctx 


5.7 

[ 


AD 2 Hippo 


29 j [Control (Path) 4 
[Temporal Ctx 


25.3 


AD 3 Hippo 


j 08 IAD 1 Occipital 

jCtx 


21.8 


AD 4 Hippo 


9 4 JAD 2 Occipital 
JCtx (Missing) 


0.0 


AD 5 hippo 


100.0 AD 3 Occipital 


8.9 


AD 6 Hippo 


on _ IAD 4 Occipital 
8 °- 7 JCtx 


22.8 


Control 2 Hippo 


97 „ I AD 5 Occipital 
AU JCtx 


26.8 


Control 4 Hippo 


11.4 


AD 6 Occipital 
Ctx 


56.6 


Control (Path) 3 


11.3 


Control 1 Occipital 
Ctx 




5.6 


AD 1 Temporal Ctx 


18.9 


Control : Op^pilal 
Ctx 


93.3 


AD 2 Temporal Ctx 


29.5 


Control 3 Occipital 
Ctx 


15.8 


AD 3 Temporal Ctx 


7.2 


Control 4 Occipital 
Ctx 


6.1 


AD 4 Temporal Ctx 


19.3 


Control (Path) 1 
Occipital Ctx 


80.1 

• ™. - „ J 


AD 5 Inf Temporal 
Ctx 


y3 2 (Control (Path) 2 
Occipital Ctx 


7.5 


AD 5 SupTemporal 
Ctx 


49.0 


Control (Path) 3 
Occipital Ctx 


5.9 


AD 6 Inf Temporal 
Ctx 


68.8 


Control (Path) 4 
Occipital Ctx 


14.5 


AD 6 Sup Temporal 
Ctx 


A (Control 1 Parietal 
660 ]ctx 


7.6 


Control 1 Temporal 
Ctx 


. . IControl 2 Parietal 
JCtx 


29.1 


Control 2 Temporal 
Ctx 


„ 9 Control 3 Parietal 

^ JCtx 


- - - - j 

26.6 


Control 3 Temporal 
Ctx 


Control (Path) 1 
JParietal Ctx 


95.9 


Control 4 Temporal 


11.3 Control (Path) 2 


22.5 
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Ctx 


48.0 


Parietal Ctx 


_ 

6.8 


Contn i (Path) 1 
Temporal Ctx 


( :ontrol (Path) 3 
[Parietal Ctx 


Control (Path) 2 
I'cmporal Ctx 


[Control (Path) 4 
Parietal Ctx 


49.7 


Table COC. General screenine_panel vl.4 


Tiss ue Name 
Adipose 


Rel. Exp.(%) 
Ag3503, Run 
217131685 

12.9 


Tissue Name 

Ren.il ca. TK-10 


Rel. Exp.(%) 
Ag3503, Run 
217131685 

26.8 


Melanoma* 
Hs688(A).T 


7.7 


Bladder 


22.1 


Melanoma* 
Hs688(B).T 


11.0 


Gastric ca. (liver met.) 
NCI-N87 


! 

63.7 


Melanoma 1 " M14 


25.0 


(uiMric ca. KA fO III 


66.0 


Melanoma* 
LOXIMVI 


25.9 


Colon ca. SW-948 


16.4 


Melanoma* SK- 
MEL-5 


69.3 


Colon ca. SW480 


37.4 


Squamous cell 
carcinoma SCC-4 


18.9 


( olon ca.* (SW480 
met) SW620 


24.8 


Testis Pool 


13.0 ( mi ...[ II 12' 


13.7 


Prostate ca.* (bone 
met) PC-3 


51.8 jColonca.HCT-116 


60.3 


Pool 


6.9 


Colon ca. CaCo-2 


■ ■ ■ ■ - 

25.7 


Placenta 


7.1 


(. ■ 'Ion cancer ti ;sue 


34.9 


Utei is Pool 




Clon ca. SWT 1 16 


8.2 


Ovarian ca. 
OVCAR-3 


14.0 


Colon ca. Colo-205 


9.2 


Ovarian ca. SK- 
OV-3 


90.8 


Colon ca. SW-48 


5.8 


Ovarian ca. 
OVCAR-4 




12.5 


Colon Pool 


17.4 


Ovarian ca. 
OVCAR-5 


34.2 


Small Intestine Pool 


15.7 


Ovarian ca. 
IGROV-1 


33.0 


Stomach Pool 


6.6 

— — " 


Ovarian ca. 
OVCAR-8 


17.6 


Bone Marrow Pool 


6.4 


Ovary 


7.8 


Fetal Heart 


12.1 


Breast ca. MCF-7 


15.9 


Heart Pool 


17.7 


Breast ca. MDA- 
MB-231 


55.1 


Lymph Node Pool 


17.6 
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Breast ca. BT 549 


19.8 


Fetal Skeletal Muscle 


4.8 


Breast ca. T47D 


66.9 


Skeletal Muscle Pool 


100.0 


Rrpad 1V!T> A "NJ 
D I Cab I Cd. 1VJuL//\-IN 


12.2 


Spleen Pool 




Breast Pool 


17.6 


Thymus Pool 


14.2 


Trachea 


28.7 


CNS cancer 
(glio/astro) U87-MG 


42.9 


Lung 




3.0 


CNS cancer 
(glio/astro) U-118-MG 


55.1 


Fetal Lung 


18.8 


CNS cancer 
(neuro;met) SK-N-AS 


30.6 


Lung ca. NCI-N417 


9.1 


CNS cancer (astro) SF- 
539 


12.2 


Lung ca. LX-1 


41.2 


CNS cancer (astro) 
SNB-75 


22.4 


Lung ca. NCI-H146 


7.0 


CNS cancer (glio) 
SNB-19 


36.3 


Lung ca. SHP-77 


26.8 


CNS cancer (glio) SF- 
295 


50.7 


Lung ca . A549 


24.8 


Brain (Amygdala) Pool 


12.4 


Lung ca. NCI-H526 




8.0 


Brain (cerebellum) 


14.6 


Lungca. N( l-l i: - 




26.6 


Brain (fetal) 


10.2 


Lung ca. NCI-H460 


29.7 


Brain (Hippocampus) 
Pool 


13.8 


Lung ca. HOP-62 


r* 8.o 


Cerebral C < >i U x Pool 


17.2 


Lung ca. NCI-H522 




19.9 


Brain (Substantia 
nigra) Pool 


19.1 


Liver 




2.9 


Brain (Thalamus) Pool 


18.4 


Fetal Liver 


10.4 


Brain (whole) 


15.7 


Liver ca. HepG2 


14.2 


Spinal Cord Pool 


10.9 


Kidncv Pool 


23.8 


Adrenal Gland 


25.7 


I Fetal Ki dney 




10.4 


Pituitary gland Pool 


3.9 S 


Renal ca. 786-0 




12.3 


Salivary' Gland 


11.1 


Renal ca. A498 


3.7 


Tin mid (female) 


4.4 


Renal ca. ACHN 


27.9 


Pancreatic ca. 
CAPAN2 




15.0 


Renal lea. UO-3 1 




12.0 


Pa creas Pool 




17.1 


Table COD. Panel 4D 


Tissue Name 


Rel. £xp.(%) 
Ag3503,Run 
166441943 


Tissue Name 


Rel. Exp.(%) 
Ag3503, Run 
166441943 


Secondary Thl act 


54.3 


HUVEC IL-lbeta 


20.4 


Secondary Th2 act 


52.5 


HUVEC IFN gamma 


14.6 
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Secondeiry Trl act 


61 j IhUVEC TNFTlpha"+ 


16.0 


Secondary Thl rest 


HUVECTNF alpha + 
34 6 [L4 


24.8 


Secondaiy Th2 rest 


20.0 (HUVECIL-11 


6.6 


Secondary Trl rest 


2^ q Tung Microvascular EC 
none 


12.9 


Primary Thl act 


-g , Tung Microvascular EC 
[TNFalpha + IL-lbeta 


17.4 


Primary Th2 act 


^ g (Microvascular Dermal 
IX' none 


21.6 


Primary Trl act 


Q9 „ IMicrosvasular Dermal 

[EC TNFalpha + IL-lbeta 


20.2 


Primary Thl rest 

— 


94.6 


Bronchial epithelium 
TNFalpha + IL1 beta 


17.7 


Primary Th2 rest 


31.4 


Small airway epithelium 
none 


7.9 


Primary Trl rest 


„ iSmall airway epithelium 
JTNFalpha + IL-lbeta 


53.6 


CD45RA CD4 
lymphocyte act 


44. 1 ICoronery artery SMC rest 


7.7 


CD45RO CD4 
lymphocyte act 


65.5 


Coronery artery SMC 
rNFalpha + IL-lbeta 


8.3 


CD8 lymphocyte act 


68.3 


Astrocytes rest 


11.7 


Secondary CD8 
lymphocyte rest 


68.8 


Astrocytes TNFalpha + 
IL-lbeta 


21.9 


Secondary CD8 
lymphocyte act 


40.1 


KU-8 12 (Basophil) rest 


12.1 


CD4 lymphocyte none 


20.9 


KU-812 (Basophil) 
PMA/ionomycin 


42.6 


2r\ fill lit? rrl_anti- 
CD95 CHI 1 


40.9 


CCD1106 

(Keratinocytes) none 


21.0 


LAK cells rest 


22.7 


CCD1106 
(Keratinocytes) 
TNFalpha + IL-lbeta 


72.2 


LAK cells IL-2 


46.7 


Liver cirrhosis 


12.8 


LAK cells IL-2+I1 -12 


38.2 


Lupus kidney 


16.7 


LAK cells IL-2+IFN 
gamma 


68.8 


NCI-H292 none 


33.0 


LAK cells II -2 - 11- IN 


57.8 NCI-1I292IL-4 


35.1 


LAK cells 

PM \ ionomycin 


100.0 'NCI-H292 IL-9 


35.4 


NK Cells IL-2 rest 


32.8 [NCI-H292 IL-13 


24.8 


1 wo \Va> MLR 3 da> 


42.9 NCI-H292 U N gamma 


22.5 


Two Way MLR 5 da\ 


39.8 IHPAEC none 


11.2 



978 



Two Way MLR 7 day 


HPAEC TNF alpha + IL- 
55J Jl beta 


27 9 


PBMC rest 


22. ^ L un g fibroblast none 


10.6 


PBMC PWM 


Lun tibrobla i INI 
alpha -I- [JL-I beta 


25.5 


PBMC PHA-L 


20 6 Lung fibroblast IL-4 


1 1 .6 


Ramos (B cell) none 


56.3 Lung fibroblast IL-9 


5.4 


Ramos (B cell) 
ionomycin 


38.4 Lung fibroblast IL- 13 


8.5 


B lymphocytes PWM 


^ g |Lung fibroblast IFN 
gamma 


.1.7 


B lymphocytes CD40L 

anH TT A 
anu 


. _ Q iDermal fibroblast 
|CCD1 070 rest 


16.4 


EOL-1 dbcAMP 


1 8 ? | Dermal fibroblast 

1CCD1070 TNF alpha 


56.3 


JtiWL- 1 (lDCAlVlr 

PMA/ionomycin 


30.4 


Dermal fibroblast 
CCD1070IL-1 beta 


14.4 


Dendritic cells none 


31.2 


Dermal fibroblast IFN 
gamma 


14.5 


Dendritic eel K 1 I's 


40.9 


Dermal fibroblast IL-4 


- ■ ■ 

29.1 


Dendritic cells anti- 
CD40 


31.2 |lBD Colitis 2 


13 7 


Monocytes rest 


50.0 |IBD Crohn's 


11.6 


Monocytes LPS 


31.0 Colon 


70.7 


Macrop lages rest 


26.4 


Lung 


17.1 


Macropiiagcs 1 PS 


28.9 


Thymus 


29.3 


HUVEC none 


22.2 


Kidney 


40.1 


HUVEC starved 


25.3 ! 





CNS_neurodegeneration_vl.O Summary: Ag3503 This panel confirms the expression of 
this gens at low levels in the brains of an independent group of individuals. However, no 
differential expression of this gene was detected between Alzheimer's diseased postmortem 
brains and those of non-demented controls in this experiment. Please see Panel 1 .4 for a 
5 discussion of the potential utility of this gene in treatment of central nervous system 
disorders. 

General_screening_panel_vl.4 Summary: Ag3503 Expression of the CG59644-01 gene 
is highest in adult skeletal muscle (CT = 25.5). Interestingly, expression of this gene is 
much lower in fetal skeletal muscle (CT = 29.9), suggesting that expression of this gene 
10 may be used to distinguish adult and fetal skeletal muscle. 
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The CG59644-01 gene encodes a protein with homology to protein phosphatases. 
This gene is expressed at high to moderate levels in the majority of samples on this panel. 
However, expression of this gene appears to be higher in cancer cell lines when compared 
to normal adult tissues. This observation is consistent with the potential role for this gene 
5 product in cell survival and proliferation. 

In addition, this gene is expressed at high levels in all regions of the central nervous 
system examined, including amygdala, hippocampus, substantia nigra, thalamus, 
cerebellum, cerebral cortex, and spinal cord. Therefore, this gene may play a role in central 
nervous system disorders such as Alzheimer's disease, Parkinson's disease, epilepsy, 
10 multiple sclerosis, schizophrenia and depression. 

Among tissues with metabolic or endocrine function, this gene is expressed at high 
to moderate levels in pancreas, adipose, adrenal gland, thyroid, pituitary gland, skeletal 
muscle, heart, liver and the gastrointestinal tract. Therefore, therapeutic modulation of the 
activity of this gene may prove useful in the treatment of endocrine/metabolically related 
1 5 diseases, such as obesity and diabetes. 

Panel 4D Summary: Ag3503 This gene is expressed at high to moderate levels in a wide 
range of cell types of significance in the immune response in health and disease. These 
cells include T cells, B cells, endothelial cells, macrophages, monocytes, dendritic cells, 
basophils, eosinophils and peripheral blood mononuclear cells, as well as epithelial and 

20 fibroblast cell types from lung and skin, and normal tissues represented by colon, lung, 
thymus and kidney. This ubiquitous pattern of expression suggests that this gene product 
may be involved in homeostatic processes for these and other cell types and tissues. This 
pattern is in agreement with the expression profile in General_screening_panel_vl.5 and 
also suggests a role for the gene product in cell survival and proliferation. Therefore, 

25 therapeutic modulation of the activity of this gene or its protein product may lead to the 
alteration of functions associated with these cell types and lead to improvement of the 
symptoms of patients suffering from autoimmune and inflammatory diseases such as 
asthma, allergies, inflammatory bowel disease, lupus erythematosus, psoriasis, rheumatoid 
arthritis, and osteoarthritis. 

30 CP. CG 59662-01: Cyclophilin 
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Expression of gene CG59662-01 was assessed using the primer-probe set Ag3504, 
described in Table CPA. Results of the RTQ-PCR runs are shown in Tables CPB and CPC. 



Table CPA . Probe Name Ag3504 



Primers 


Sequences 


Length 


Start 
Position 


I SEQID 1 
NO: 


Forwarc 


5 1 -ggtcaaccacaccatgttctt-3 ' 


21 


22 


! 686 


Probe 


rET- 5 ' , r tg-3'- 
TAMRA 


25 


67 


1 687 


Reverse 


5 1 -tctttggaaacttttctgcaaa-3 ' 


22 


92 


| 688 



Table CPB . General_screening_panel_vl.4 



Tissue Name 


Dal V-vn 

tvei. JtLXp.^ /o ) 

Ag3504, Run 
217236170 


Tissue Name 


I> „I 17 (0/ \ 

IvCl. Ji,xp.^ /o ) 

Ag3504, Run 
217236170 


Adipose 


0.5 


Renal ca. I K- in 


0 \ 


Hs688(A). r 


1.0 


Bladder 


0.6 


^SAq l&HOHl cL* 

Hs688(B).T 


0.4 


Gastric ca. (liver met.) 
NCI-N87 


4.8 


Mclani'ina* Ml 4 


100.0 


Gastric ca. KATO III 


14.2 


Melanoma* 
LOXIMVI 


3.9 


Colon ca. S W-948 


2.3 


Melanoma* SK- 
MEL-5 


4.9 


Colon ca. SW480 


12.9 


Squamous cell 
carcinoma SCC-4 


47.6 


( olon ca * (SW480 
met) SW620 


0.3 


Testis Pool 


2.7 


Colon ca. HT29 


----- 


Prostate ca.* (bone 
met) PC-3 


3.1 


Colon ca. HCT-116 


1, 


Prostate Pool 


5.7 


( !olon ca. CaCo-2 


11.6 


Placenta 


5.0 


Colon cancer tissue 


7.6 


Uterus Pool 


2.2 


Colon ca.SWI 116 


1.5 


Ovarian ca. 
OVCAR-3 


20.6 


Colon ca. Colo-205 


11.7 


Ovarian ca. SK- 
OV-3 


6.3 


Colon ca. S W-48 


6.2 


Ovarian ca. 
OVCAR-4 


1.5 


Colon Pool 


4.7 


Ovarian ca. 
OVCAR-5 


10.2 


Small Intestine Pool 


2.5 


Ovarian ca. 


0.6 


Stomach Pool 


5.8 
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IGROV-1 






Ovarian ca. 
OVCAR-8 


0.7 


Bone Marrow Pool 


1.2 


Ovary 


2.2 


Fetal I leart 


3.3 


Breast ca. MCI -7 


2.9 


Heart Pool 


2.2 


Breast ca. MDA- 
MB-231 


4.0 


Lymph Node Pool 


11.8 


Breast ca. BT 549 


5.6 


Fetal Skeletal Muscle 


0.9 


Breast ca. T47D 


15.0 


Skeletal Muscle fd >1 


32.3 




17.7 


Spleen Pool 




Breast Pool 


0.9 


Thymus Pool 


9.3 


Trachea 


2.0 


CNS cancer 
(glio/astro) U87-MG 


1.0 


Lung 


16.3 


CNS cancer 
(glio/astro) U-118-MG 


3.3 


Fetal Lung 


42.0 


CNS cancer 
(ncum:met) SK-N-AS 


6.2 


Lung ca.NCI-N417 

- - - 


3.6 


CNS cane-i (astro) SF- 

539 


2.3 


Lung ca. LX-1 


0.7 


CNS cancer (astro) 
SNB-75 


- - - — - 

12.7 


Lung ca.NCI-H146 


ICNS cancer (glio) 
[SNB-19 


4.1 


Lung ca. SHP-77 


17.6 


CNS cancer (glio) SF- 
205 


_____ 

°- 7 


Lung ca.. A549 


19.9 


Bono ( \iir j Jala) Pool 


3.3 


Lung ca. NCI-H526 


4.3 


Brain (cerebellum i 


2 3 


Lung ca. NCI-H23 


3.3 


I! am (ki 1 


0.9 


Lung ca. NCI-H460 


3.8 


Brain (Hippocampus) 
Pool 


16.3 


Lung ca. HOP-62 


0.7 


Ccifhial Cortev Pool 


1 


Lung ca.. INCl-H dZZ 


_ __ ] 


Brain (Substantia 
nigra) Pool 


4.6 


Liver 


1.2 


Brain iThalciiuiM Pool 


9.0 


Fetal Liver 


17.2 


Brain (whole) 


1.4 


Liver a . 1 IepG2 




Spinal Cot d Pool 


11.6 


Kidney Pool 




Adrenal Gland 


0.3" " 1 


Fetal Kidney 


7.1 


Pi tu >ar) gland Pool 


3.9 ' 


Renal ca. 786-0 


1.2 


Salivary Gland 


1.2 


Renal ca. A-fQS 


2.4 Thyroid (female) 


5.5 j 


Renal ca. ACHN 


. ~ IjPancreatic ca. 
(CAPAN2 


0.6 


Filial ca W )-3 1 


2.5 (Pancreas Pool 


8.8 j 
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Table C PC. Panel 4D 



j Rel. Exp.(%) 
Tissue Name 1 Ag3504, Run 
166407168 


Tissue Name 


Rel. Exp.(%) 
Ag3504, Run 
166407168 


Secondary Th 1 act 0.0 


HUVEC IL-1 beta 


0.0 


Secondary Th2 act | 0.0 


HUVEC I FN gamma 


0.0 


Secondary Trl act 0.0 


HUVEC TNF alpha + 
U N gamma 


0.0 


Secondary Thl rest 


0.0 


HUVEC TNF alpha +1 

IL4 j 


Secondary Th2 rest 


0.0 jHUVEC IL-1 1 


0.0 


Secondary Trl rest 


q q Lung Microvascular EC 

jnone 


0.0 


Primary Thl act 


n iLung Microvascular EC 
jTNFalpha + IL-lbeta 


0.0 


Primary Th2 act 


q q (Microvascular Dermal 
LC none 


0.0 


Primary Trl act 


„ „ IMicrosvasular Dermal 1 ft ~ 
EC TNFalpha + IL- 1 beta [ 


Primary Thl rest 


0.0 


Bronchial epithelium 
TNFalpha + IL1 beta 


0.0 


Primary Th2 rest 


0.0 


Small airway epithelium 
none 


0.0 


Primary Trl rest 


0.0 


Small airway epithelium 
TNFalpha + IL-1 beta 


0.0 


CD45RA CD4 
lymphocyte act 


Coronery artery SMC rest 


0.0 


CD45RO CD4 
lymphocyte act 


0.0 


Coronery artery SMC 
TNFalpha + IL-1 beta 


0.0 


CDS lymphocyte act 


0.0 


Astroc>tes rest 


0.0 


Secondary CD8 
l\ mphocytc rest 


0.0 


Astrocytes TNFalpha + 
IL-1 beta 


0.0 


Secondary CD8 
lymphocyte act 


0.0 


KU-812 (Basophil) rest 


0.0 


CD4 lymphocyte none 


0Q |KU-8 12 (Basophil) 
jPMA/ionomycin 


0.0 


2ryThl/Th2/Trl anti- 
( \y>5 CHll 


0Q |CCD1106 

i(kcratin >c) tes) aom 


0.0 


LAK cells rest 


0.0 


CCD 1106 
(Keratinocytes) 
TNFalpha + IL-1 beta 


0.0 


LAK cells IL-2 


0.0 


Liver cirrhosis 


100.0 


LAK cells IL-2+1L-12 


0.0 


; li | lis kidney 


.1(1 


LAK cells IL-2+IFN 


0.0 


NCI-H292 none 


0.0 



983 



gamma 








LAK cells 1] -2+ IL-18 


0.0 


NCI-H292 IL-4 


0.0 


LAK cells 
PMA/ionomycin 


0.0 


NCI-H292 IL-9 


0.0 


NK Cellls IL-2 rest 


0.0 


NCI-H292 IL-13 


0.0 


Two Way MLR 3 day 


0.0 


NCLH292 I FN gamma 


0.0 


Two Way MLR 5 day 


0.0 


HPAEC none 


0.0 


Two Way MLR 7 day 


0.0 


HPAEC TNF alpha + IL- 
1 beta 


0.0 


PBMC rest 

, , ,. 


> ™ 


Lung fibroblast none 


0.0 


PBMC PWM 


0.0 




Lung fibroblast TNF 
alpha + IL-l beta 


0.0 


I Dlvll^ J. XXrV-JL/ 




Lung fibroblast IL-4 




Ramos i B cell) none 


0.0 


Lunt: fibroblast IJ -9 


0.0 


Ramos (B cell) 
ionomycin 


0.0 


Lung fibroblast IL-13 


0.0 


B lymphocytes PWM 


0.0 


Lung fibroblast IFN 
gamma 


3.5 


B lymphocytes CD40L 
and IL-4 


0.0 


Dermal fibroblast 
C( D 1070 rest 


0.0 


EOL-1 dbcAMP 


0.0 


Dermal fibroblast 
CCD 1 070 TNF alpha 


0.0 


JbUL-1 dDCAMr 
PMA/ionomycin 


0.0 


Dermal fibroblast 
CCD 1070 IL-l beta 


0.0 


i 

Dendritic cells none 


0.0 


Dermal fibroblast IFN 
gamma 


2.7 


Dendritic cells LPS 


0.0 


Dermal fibroblast IL-4 


' 0.0 " * 


Dendrit ic cells anti- 
CD40 


0.0 


IrSJLi UOlltlS z 


3-D 


Monocjtes rest 


0.0 


[BD CrohiVs 


0.0 


[ lono( \tes LPS 


0.0 


Colon 


17.0 


P lacrophi > rest 


III) 


Lung 


2.2 


M icrophages LI'S 


0.0 


Thymus 


9.0 


HUVEC none 


0.0 


Kidney 


0.0 


HUVEC starved 


H() 







CNS_neurodegeneration_vl.O Summary: Ag3504 Expression of the CG59662-01 gene 
is low/undetectable (CTs > 35) across all of the samples on this panel (data not shown). 



General_screening_panel_vl.4 Summary: Ag3504 The CG59662-01 gene is expressed 
at low levels in the majority of samples on this panel, with highest expression in a 
melanoma cell line (CT = 30). The CG59662-01 gene encodes a protein with homology to 
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cyclophilin, a specific high-affinity binding protein for the immunosuppressant agent 
cyclosporin A. 



Among tissues with metabolic or endocrine function, this gene is expressed at low 
levels in pancreas, thyroid, pituitary gland, skeletal muscle, heart, liver and the 
5 gastrointestinal tract. Therefore, therapeutic modulation of the activity of this gene may 
prove useful in the treatment of endocrine/metabolically related diseases, such as obesity 
and diabetes. Interestingly, this gene is expressed at higher levels in fetal liver (CT = 32.5) 
than in adult liver (CT = 36.4), suggesting that expression of this gene can be used to 
distinguish fetal and adult liver. 

10 In addition, this gene is expressed at low levels in some regions of the central 

nervous system, including amygdala, hippocampus, substantia nigra, thalamus, and spinal 
cord. Therefore, this gene may play a role in central nervous system disorders such as 
Alzheimer's disease, Parkinson's disease, epilepsy, multiple sclerosis, schizophrenia and 
depressiion. 

15 Panel 4D Summary: Ag3504 Significant expression of this gene is detected in a liver 
cirrhosis sample (CT = 34.4). Furthermore, expression of this gene is not detected at 
significant levels in normal adult liver in Panel 1 .4, suggesting that its expression is unique 
to liver cirrhosis. This gene encodes a putative cyclophilin; therefore, small molecule 
therapeutics could reduce or inhibit fibrosis that occurs in liver cirrhosis. In addition, 

20 expression of this putative cyclophilin could also be used for the diagnosis of liver 
cirrhosis. 

CQ. CG59773-01 : splice variant of myomegalin 

Expression of gene CG59773-01 was assessed using the primer-probe set Ag3580, 
described in Table CQA. Results of the RTQ-PCR runs are shown in Tables CQB, CQC 
25 and CQD. 

Table CPA . Probe Name Ag3580 



Primers 


Sequences 


Length 


Start 
Position 


SEQID 
NO: 


Forw ard 


5 1 -tggagttagttccggtttcc-3 1 


20 


1 647 689 


Probe 


TET-5 ' -tgaaacccctttacaagacacctgtg-3 ' - 
TAMRA 


26 


1679 


690 



985 



'Reverse ;5 ' -ctgaaagct-asgggitaact-3 1 1 22 1 1 705 \ 691 

Table COB . CNS_neurodegeneration_vl.0 



Tissue 
Name 


I£xp.(%) 

A „?ron r> .. .. 

Ag.5380, Kun 
210643837 


Rel. Exp.(%) 
Ag3580, Run 
224078543 


Tissue 
Name 


Rel. Exp.(%) 
AgJSoO, Kun 
210643837 


Rel. K\p.(%) 
Ag3580, Run 
22 4078543 


AD 1 Hippo 


28.9 


(Control 
31Q (Path) 3 
^Temporal 
|Ctx 


5.9 


4.8 


AD 2 Hippo 


1 

20.9 


[Control 

19.9 L Path)4 . 

ITemporal 

:ctx 


19.9 


19.2 


AD 3 Hippo 

| 


10.7 


9.7 


AD 1 

Occipital 

Ctx 


26.1 


20.3 


AD 4 Hippo 


5.4 


4.4 


AD 2 

Occipital 

Ctx 

(Missing) 


0.0 


0.0 


AD 5 Hippo 


100.0 


100.0 


AD 3 

Occipital 

Ctx 


9.9 


8.8 


AD 6 Hippo 


78.5 


62.4 




AD 4 

Occipital 

Ctx 

AD 5 

Occipital 

Ctx 


13.5 


13.2 


Control 2 
Hippo 


15.3 


14.6 


19.6 


16.6 


Control 4 
Hippo 


13.5 


12.1 


AD 6 

Occipital 

Ctx 


18.4 


17.9 


Control 
(Path) 3 
Hippo 


7.6 


7.5 


Control 1 
Occipital 
Ctx 


2.3 


2.1 


AD 1 

Temporal 

Ctx 


38.2 


29.7 


Control 2 
Occipital 
Ctx 


31.2 


29.3 


AD 2 " " " 
Temporal 
Ctx 


24.8 


20.7 


Control 3 
Occipital 
Ctx 


10.5 

- m ™. 


9.5 


AD 3 

Temporal 

Ctx 


13.2 


9.1 


Control 4 
Occipital 
Ctx 


7.7 


7.2 


AD 4 


16.5 


17.6 


Control 


- - - 

47.6 


4S.3 



986 



Temporal 
Ctx 




j(Path) 1 

lOccipital 

JCtx 






AD 5 Inf 
Temporal 
Ctx 


89.5 


81.2 


Control 
(Path) 2 
Occipital 
Ctx 


8.0 


„.., 


AD 5 Sup 
Temporal 
Ctx 


66.0 


63.7 


Control 
(Path) 3 
Occipital 
Ctx 


2.9 


2.3 


AD 6 Inf 
Ctx P 


47.3 


45.1 


Control 
(Path) 4 
Occipital 
Ctx 


9.9 


0 1 


AD 6 Sup 
Temporal 
Ctx 


41.8 




38.4 


Control 1 
Parietal Ctx 


5.4 


5.1 


Control 1 
Temporal 

CtX 


4.6 


4.5 


Control 2 
Parietal Ctx 


67.8 


59.0 


Control 2 
Temporal 

( L\ 


17.6 

______ 


. A ~ jControl 3 

Parietal Ctx 


9.8 


8.4 


Control 3 
Temporal 
Ctx 


9.0 


jControl 
8.5 (Path) 1 

Parietal Ctx 


40.6 


35.8 


Control 3 
Temporal 
Ctx 


6.0 


JControl 
5.9 (Path) 2 

(Parietal Ctx 


15.2 

——— 


14.1 


[Control 
(Path) 1 
Temporal 
Ctx 


29.1 




f — f~~ 1 

IControl 
27.4 (Path) 3 

^Parietal Ctx 


4.9 


3.2 


[Control 
(Path) 2 
Temporal 
Ctx 


22 1 


Control 
20.6 (Path) 4 

IParietal Ctx 


24.1 


23.8 



Table CQC . General_screening_panel_vl.4 





Rel. Exp.(%) 




Rel. Exp.(%) 


Tissue Name 


Ag3580, Run 


Tissue Name 


Ag3580, Run 




217423587 




217423587 


Adipose 


20.7 


Renal ca. 1 K-10 


20.7 


Melanoma* 
Hs688(A).T 


76.8 


Bladder 


15.7 
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Melanoma* 
Hs688(B).T 


38.7 


Gastric ca. (liver met.) 
NCI-N87 


50.0 


IVlelanoma* ^114 


7.6 


Gastric ca. KATO III 


13.1 


Melanoma* 
LOXIMVI 


81.2 


Colon ca. SW-948 


5.8 


Mfelcinoms* SK~ 
MEL-5 


14.9 


Colon ca. S W480 


12.9 


Squamous cell 
carcinoma SCC-4 


10.2 


Colon ca.* (SW480 
met) SW620 


2.6 


Testis Pool 


4.5 


Colon ca. HT29 


4.5 


Prostate ca.* (bone 
met) PC-3 


29.1 


Colon ca. HCT-116 


15.0 


Prostate Pool 


6.1 


Colon ca. CaCo-2 


29.7 


P 1 scent £i 


8 2 


Colon cancer tissue 


8 4 


l 'let a- P. »ol 


7.7 


Colon ca. SW1116 


2.5 


Ovarian ca. 
OVCAR-3 


24.0 


Colon ca. Colo-205 


2.9 


Ovarian ca. SK- 
OV-3 


15.4 


Colon ca. SW-48 


7.1 


Ovarian ca. 


17.0 


Colon Pool 


18.8 


Ovarian ca. 
OVCAR-5 


19.8 


Small Intestine Pool 


18.0 


Ovarian ca. 
IGROV-1 


15.1 


Stomach Pool 


7.4 


Ovarian ca. 
OVCAR-8 


6.7 


Bone Marrow Pool 


9.9 


Ovary 


4.4 


Fetal 1 lean 


«).! 


Breast ca. MCF-7 


28.9 


Heart Pool 


8.2 


Breast ca. MDA- 
MB-23I 


39.8 


Lymph Node Pool 


22.8 


Breast ca. BT 549 


13.9 


Fetal Skeletal Muscle 


6.8 


Breast ca. T47D 


49.0 


Skeletal Muscle Pool 


22.7 


Rrpact ra A/TDA-M 
J3I CaM Cd. ]y±Ur\ -IN 


7 6 


Ct-ili=»i=»« 

opieen roui 


12 1 


Breast Pool 


9.5 


Thxmus Pool 


19.9 


Trachea 


15.0 


CNS cancer 
(glio/astro) U87-MG 


35.1 


Lung 


4.0 


CNS cancer 
(glio/astro) U-118-MG 


28.9 


Fetal Lung 


30.6 


CNS cancer 
(neuro:met) SK-N-AS 


3.4 


Lung ca.NCI-N417 


5.4 


CNS cancer (astro) SF- 
539 


13.4 


Lung ca. LX-1 


6.6 


CNS cancer (astro) 
SNB-75 


57.4 
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Lungca.NCI-H146 


, - R CNS cancer (glio) 
JSNB-19 


9.0 


Lung ca. SHP-77 


9 Q jCNS cancer (glio) SF- 
1295 


■■ ™ 

49.7 


Lung ca. A549 


4.2 


Brain (Amygdala) Pool 


15.9 


Lung ca. NCI-H526 


7.6 


Brain (cerebellum) 


100.0 


Lung ca. NCI-H23 


50.0 jBrain (fetal) 


69.3 


Lung ca. NCI-H460 


^ g jBrain (Hippocampus) 
jPool 


16.0 


Lung ca. HOP-62 


24.3 Cerebral ( orrex Pool 


26.1 


T liner ra NPT-HS?? 


^ - (Brain (Substantia 
(nigra) Pool 


' 


Liver 


4.9 


Brain (Thalamus) Pool 


37.4 


Fetal Liver 


12.0 


Brain (whole) 


33.4 


1 v .1 .1 S k|M 1 1 


11.1 


pin 1 ( ord l> ool 


34.4 


Kidney Pool 
Fetal Kidney 


54.0 


Adrenal Gland 


5.2 


40.3 


Pituitary gland Pool 


9.7 


Renal ca. 786-0 


13.9 


Salivary Gland 


3.4 


Renal ca. A498 


15.0 


Thyroid (female) 


5.3 


Renal ca. ACHN 


1.6 


Pancreatic ca. 
CAPAN2 


16.4 


Renal ca. 1 JO-3 1 


10.2 


Pancreas Pool 


17.2 



Table COD . Panel 4. ID 



Tissue Name 


Rel. Exp.(%) 

Ag3580, Run Tissue Name 
169910382 


Rel. Exp.(%) 
Ag3580, Run 
169910382 


Secondary Thl act 


28.3 HUVEC IL-lbeta 


28.5 


second, ry Th2 act 


32.8 HUVEC I FN gamma 


8.0 


Secondstry Trl act 


;4 , HUVEC TNT alpha v 
jlFN gamma 


30.1 


Secondary Thl rest 


23.7 


HUVEC TNF alpha + 
IL4 


27.2 


Secondsiry Th2 rest 


26.4 


HUVEC IL-11 


8.8 


Secondary Trl rest 


34.2 


Lung Microvascular EC 
none 


42.9 


Primary Thl act 


11.2 


Lung Microvascular EC 
TNFalpha + IL-lbeta 


33.2 


Primary Th2 act 


j q g Microvascular Dermal 
(EC none 


24.8 


Primary Trl act 


, , n JMicrosvasular Dermal 

JEC TNFalpha + IL-lbeta 


20.7 


Primary Thl rest 


32.3 Bronchial epithelium 


18.6 
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1 NFalpha i [Libera 


~™"' ■— - 


Primary Th2 rest 


^ j 2 (Small airway epithelium 
(none 


8.2 


Primary Trl rest 


24.1 


Small airway epithelium 
TNFalpha + IL-lbeta 


10.7 


CD45RA CD4 
l> mphocyte act 


50.0 


Coronery artery SMC rest 


10.7 


CD45RO CD4 
lymphocyte act 


7 - - Coronery artery SMC 
jTNFalpha + IL-lbeta 


15.6 


CD8 lymphocyte act j 18.8 (Astrocytes rest 


14.1 


Secondary CD8 
lymphocyte rest 


16.3 


Astrocytes TNFalpha + 
IL-lbeta 


13.0 


Secondary CD8 
lymphocyte act 


16.3 


KU-812 (Basophil) rest 


4.0 


CD4 lymphocyte none 


22.4 


KU-8 12 (Basophil) 
PMA/ionomycin 


7.8 


2ry Thl/Th2/Trl_anti- 


47.6 


CCD1106 

(Keratinocytes) none 


24.3 


LAK cells rest 


24.1 


CCD1106 
(Keratinocytes) 
TNFalpha + IL-lbeta 


26.4 


LAK cells IL-2 


21.8 (Liver cirrhosis 


12.0 


LAK cells IL-2+IL-12 


23.2 INCI-H292 none 


9.6 


LAK cells IL-2+IFN 
gamma 


29.1 |NCI-H292 IL-4 


15.5 


LAK cells IL-2+ IL-18 


32.5 NCI-H292 IL-9 


26.2 


T AT<f r<=>11c 

ij/\JS. OCilS 

PMA/ionomycin 


35.8 INCI-H292IL-13 

I 


15.3 


NK Cells IL-2 rest 


26.1 NCI-H292 1FN gamma 


14.0 


Two Way MLR 3 day 


35.8 HPAECnone 


8.2 1 


Two Way MLR 5 day 


l9 - HP VEC TNF alpha + IL- 
1 beta 


31.9 


Two Way MLR 7 day 


46.0 Lung fibroblast none 


51.8 


PBMC rest 


j 5 g > Lun g fibroblast TNF 
alpha + IL-1 beta 


44.8 


PBMC PWM 


16.0 


Lung fibroblast 1 1 - 1 


32.8 


PBMC PHA-L 


13.3 


Lung fibroblast II .-9 


51.4 


Ramos (B cell) none 


16.8 


Lung fibroblast 1 L - 1 3 


29.9 


Ramos (B cell) 
ionomycin 


15.9 


Lung fibroblast IFN 
gamma 


27.7 


B lymphocytes PWM 


1 4 a JDermal fibroblast 
|CCD1070 rest 


38.4 ! 


B lymphocytes CD40L 
and IL-4 


1 , _ iDermal fibroblast 

CCD1070 TNF alpha 


. - J 

100.0 


EOL-1 dbcAMP 


5.6 IDermal fibroblast 


48.3 
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- 




CCD 1070 IL-1 beta 




EOL-1 dbcAMP 
PMA/ionomycin 


29.3 


Dermal fibroblast II N 
gamma 


11.0 


Dendritic cells none 


13.6 


Dermal fibroblast IL-4 


22.7 


Deiidrit c cells LPS 


14.4 


Dermal fibroblasts rest 


18.8 


Dendritic cells anti- 
CD40 


13.5 


Neutrophils TNFa+LPS 


1.7 


M< moevtes rest 


12.3 


Neutrophils rest 


2.6 


Monocytes LPS 


62.9 


( i>|. n 


15.8 


Macrophages rest 


18.4 


Lung 


15.6 


Macrophages LPS 


15.3 


Thymus 


21.8 


HUVEC none 


i: n 


Kidney 


46.3 


HI (VEC starved 


17.0 







CNS_neurodegeneration_vl.O Summary: Ag3580 Results from two experiments using 
the same probe/primer set are in excellent agreement. This panel confirms the expression of 
this gene at high to moderate levels in the brains of an independent group of individuals. 
This gene is found to be upregulated in the temporal cortex of Alzheimer's disease patients. 
Therefore, therapeutic modulation of this gene or its protein product may be used to 
decrease neuronal death and treat Alzheimer's disease. 



General_screening_panel_vl.4 Summary: Ag3580 The CG59773-01 gene encodes a 
splice variant of the myomegalin protein, which is a component of the golgi/centrosome 
and interacts with a cyclic nucleotide phosphodiesterase (ref. 1). Expression of the 
CG59773-01 gene is highest in the cerebellum (CT = 23.8). In addition, this gene is 
expressed at high levels in all other regions of the central nervous system examined, 
including amygdala, hippocampus, substantia nigra, thalamus, cerebral cortex, and spinal 
cord. Therefore, this gene may play a role in central nervous system disorders such as 
Alzheimer's disease, Parkinson's disease, epilepsy, multiple sclerosis, schizophrenia and 
depression. 



Among tissues with metabolic or endocrine function, this gene is expressed at high 
to moderate levels in pancreas, adipose, adrenal gland, thyroid, pituitary gland, skeletal 
muscle, heart, liver and the gastrointestinal tract. Therefore, therapeutic modulation of the 
activity of this gene may prove useful in the treatment of endocrine/metabolically related 
20 diseases, such as obesity and diabetes. 
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This gene is also expressed at very high levels in a number of melanoma cell lines. 
Therefo re, therapeutic modulation of the activity of this gene or its protein product may be 
of benefit in the treatment of melanoma. 



References: 

5 1 . Verde I, Pahlke G, Salanova M, Zhang G, Wang S, Coletti D, Onuffer J, Jin SL, 

Conti M. Myomegalin is a novel protein of the golgi/centrosome that interacts with a cyclic 
nucleotide phosphodiesterase. J Biol Chem 2001 Apr 6;276(14):1 1 189-98 

Pane! 4. ID Summary: Ag3580 This gene is expressed at high to moderate levels in a 
Q wide range of cell types of significance in the immune response in health and disease. 

1" 10 These cells include T cells, B cells, endothelial cells, macrophages, monocytes, dendritic 

iy 

0 cells, basophils, eosinophils and peripheral blood mononuclear cells, as well as epithelial 

ri 

and fibroblast cell types from lung and skin, and normal tissues represented by colon, lung, 
s thymus and kidney. This ubiquitous pattern of expression suggests that this gene product 

may be involved in homeostatic processes for these and other cell types and tissues. This 

0 1 5 pattern i s in agreement with the expression profile in General_screening_panel_vl .5 and 
rj also suggests a role for the gene product in cell survival and proliferation. Therefore, 

1 W therapeutic modulation of the activity of this gene or its protein product may lead to the 

alteration of functions associated with these cell types and lead to improvement of the 
symptoms of patients suffering from autoimmune and inflammatory diseases such as 
20 asthma, allergies, inflammatory bowel disease, lupus erythematosus, psoriasis, rheumatoid 
arthritis, and osteoarthritis. 

CR. CG57460-01: N-ACETYLTRANSFERASE CAMELLO 2 

Expression of gene CG57460-01 was assessed using the primer-probe set Ag3273, 
described in Table CRA. Results of the RTQ-PCR runs are shown in Tables CRB, CRC 
25 and CRD. 



Table CRA . Probe Name Ag3273 



Primers 


Sequences 


Length 


Start 
Position 


SEQID 
NO: 


F< >r\\ arc! 


5 ' - cgctactactacagccgcaa- 3 1 


20 


205 


692 


Probe 


TET-5 ' -gtgatccgcgcctacctggagtg-3 ' - 
TAMRA 


23 


226 


693 
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Reverse 5 ' -qggcggcttcatgtagtact-3^ , 20 j 281 694 

Table CRB . CNS neurodegeneration vl.O 



Tissue Name 


Rel. Exp.(%) Rel. Exp.(%) ; 
Ag3273, Run Ag3273, Run * ,5SUC 
210038591 230512515 mC 


Rel. Exp.(%) 
Ag3273, Run 
210038591 


Rel. Exp.(%) 
Ag3273, Run 
230512515 


AD 1 Hippo 


15.7 


Control 

18 6 (Path)3 , 
Temporal 

(t\ 


12.2 


12.7 


AD 2 Hippo 


28.9 


IControl 

23.2 L Path > 4 , 
Temporal 

Ctx 


40.3 


— — — 
35.4 


AD 3 Hippo 


11.0 


\D 1 

11.0 Occipital 

jCtx 


20.9 


18.8 


AD 4 Hippo 


10.2 


AD 2 
10 „ Occipital 
138 Ctx 

;(Missing) 


0.0 


0.0 


AD 5 hippo 


100.0 


!AD3 
100.0 Occipital 

jCtx 


15.2 


13.1 


AD 6 Hippo 


21.8 


AD 4 
22.4 Occipital 
jCtx 


22.1 


23.2 


Control 2 
Hippo 


27.0 


AD 5 
27.9 JOccipital 

iCtx 


18.4 


18.4 


Control 4 
Hippo 

L 


22.5 


21.8 JOccipital 

ICtx 


46.3 


48.3 

— — — 


Control (Path) 
3 Hippo 


8.4 


(Control 1 
9.0 [Occipital 

ICtx 


8.0 


11.2 


AD 1 Temporal 
Ctx 


16.5 


Control 2 
15.1 (Occipital 

iCtx 




80.1 


82.4 


AD 2 Temporal 
Ctx 


29.3 


Control 3 
26.2 Occipital 

JCtx 


26.2 


27.2 


AD 3 Temporal 
Ctx 


10.9 


[Control 4 
12.9 (Occipital 
Ctx 


14.8 

. _ „_ _ . 


14.1 


AD 4 Temporal 


22.5 


20.3 |Control 


70 : 


us 
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Ctx 






(Path) 1 

Occipital 

Ctx 






AD 5 Inf 
Temporal Ctx 


57.0 


59.5 


Control 
(Path) 2 
Occipital 
Ctx 


21.6 


20.6 


AD 5 

SupTemporal 
Ctx 


30 6 


26.8 


Control 
(Path) 3 
Occipital 
Ctx 






AD 6 Inf 
Temporal Ctx 


.......... 

27.2 


(Control 

aw < Path > 4 , 

[Occipital 
Ctx 


36.1 


34.2 


AD 6 Sup 
I ■ mporal Ctx 


34.2 


~ 7 . (Control 1 

jParietalCtx 


14.1 


13.7 


(\>mr<>i 1 
i :mporal Ctx 


12.2 


j j j jControl 2 

[Parietal Ctx 


33.9 


36.9 


Control 2 
remporal Ctx 


43 2 1 39 8 (Controls 

1 jParietalCtx 


33.0 


28.7 


Control 3 
Temporal Ctx 


16.7 


jControl 
18.3 (Path) 1 

■Parietal Ctx 


63.7 


67.8 


Control 4 
Temporal Ctx 


18.3 


Control 
19.8 (Path) 2 

Parietal Ctx 


30.6 


— 

30.4 


Control (Path) 
1 Temporal Ctx 


51.4 


JControl 
48.0 (Path) 3 

(Parietal Ctx 


8.1 


9.7 


Control (Path) 
2 Temporal Ctx 


39.2 


IControl 
45.7 (Path) 4 

(Parietal Ctx 


64.2 


59.5 

..„.,, 



Table CRC . General_screening_panel_vl.4 



Tissue Name 


Rel. Exp.(%) 
Ag3273, Run 
215775405 


j Tissue Name 


Rel. Exp.(%) 
Ag3273, Run 
2157-5405 


Adipose: 


7.1 


Renal ca. TK- 10 


4.9 


Melanoma* 
Hs688(A).T 


0.0 


jBladder 


0.9 


Melanoma* 
Hs6S8(B).l 


0.0 


Gastric ca. (liver met.) 
NCI-N87 


2.4 


Melanoma* Ml 4 


17.6 


Gastric ca. KAT( ) III 


0.0 


Melanoma* 


0.9 


Colon SW-948 


0.0 
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LOXIMVI 








IVtclEtn orris* SIC- 
MEL-5 


4.2 


Colon ca. SW480 


—————— 

2.7 


Squamous cell 
carcinoma SCC-4 


(Colon ca* (SW480 
(met) SW620 


1.0 


Testis Fool 


13.5 Colon ca. HT29 


0.1 


Prostate: ca.* (bone 
met) PC-3 


2.7 


Colon ca. HCT-116 


6.0 


Prostate Pool 


0.2 


Colon ca. CaCo-2 


2.0 


Placenta 


0.0 


Colon cancer tissue 


0.6 


■ Items Pool 


0.0 


Colon ca. SVV1I 16 


0.4 


Ovarian ca. 
OVCAR-3 


5.4 Colon ca. Colo-205 


0.0 


Ovarian, ca. SK- 
OV-3 


6.8 


Colon ca. SW-48 


0.0 


Ovarian ca. 

\J V l^/VIV-H 


1.6 


Colon Pool 


0.4 


Ovariarii ca. 
OVCAR-5 


2.1 


Small Intestine Pool 


0.1 


Ov&ricin csl. 
IGROV-l 


7.7 


Stomach Pool 


0.3 


Ovarian ca. 
OVCAR-8 


9.6 (Bone Marrow Pool 

_ „. . J _ „, , 


0.1 


Ovary 


0.8 iFetal Heart 


100.0 


Breast La. \1CF-7 


1.7 Heart Pool 


0.2 


Breast ca. MDA- 
MB-231 


0.8 (Lymph Node Pool 


0.0 


Breast ca. BT 549 


7.7 Fetal Skeletal Muscle 


0.3 " 


Breast ca. T47I > 


0.3 Skeletal Muscle Pool 


1.3 


Breast ca. MDA-N 


0.0 -Spleen Pool 


0 0 


Breast Pool 


0.4 [Thymus Pool 


0.4 


Trachea 


„ _ CNS cancer 

'(glio/astro) U87-MG 


0.0 


Lung 


CNS cancer 
|(glio/astro) U-118-MG 


0.0 


Fetal Lung 


n , 1CNS cancer 

|(neuro;met) SK-N-AS 


0.0 


Lung ca. NCI-N417 


CNS cancer (astro) SF- 

i 539 


0.2 


Lung ca. LX-1 


0.4 


CNS cancer (astro) 
SNB-75 


0.6 


Lung ca. NCI-H146 


12.2 


CNS cancer (glio) 
SNB-19 


7.1 


Lung ca. SHP-77 

L — „ , . 


2.4 


CNS cancel (t/lio) SF- 
295 


1.8 



I liny ca. \549 


3.4 Brain i \im gdala) Pool 


31.4 


Lung ca. NCI-H526 


12.0 jBrain (cerebellum) 


32.3 


Lung ca. NCI-H23 


5.8 jBrain (fetal) 


9.3 


Lung ca. NCI-H460 


9 ~ jBrain (Hippocampus) 
• J JPool 


22.5 


Lung ca. HOP-62 


1.0 (Cerebral Cortex Pool :1 40.3 


Lung ca. NCI-H522 


j ^ jBrain (Substantia 
inigra) Pool 


57.4 


Liver 


0.0 


Buun (Thalamus) Po A 


36.6 


Fetal Liver 


0.1 


Brain (whole) 


24.8 


Liver ca. HepG2 


0.3 


Spinal (' ord Pool 


20.2 


Kidney Pool 


0.2 


Adrenal Gland 


0.0 


Fetal Kidney 


0.7 


Pituitary gland Pool 


2.2 


Renal ca. 786-0 


0.2 


Sali\ar\ Gland 


0.2 




1.0 jThyroid (female) 


1.1 


Renal ca. ACHN 


. , Pancreatic ca. 
|CAPAN2 


0.0 


Renal ca. UO-31 


3.7 (Pancreas Pool 


0.6 



Table CRD . Panel 4D 



Tissue Name 


ReL Exp.(%) I 

Ag3273, Run I Tissue Name 
165338992 


Rel. Exp.(%) 
Ag3273, Run 
165338992 


Secondary Thl act 


1.0 [HUVEC IL-lbeta 


0.0 


Secondary Th2 act 


2.5 HUVEC IFN gamma 


0.0 


Secondary Trl act 


3 y (HUVEC TNF alpha + 
|EFN gamma 


0.0 


Secondary Thl rest 


<. ft 'HUVEC TNF alpha + 
|IL4 


0.0 


Secondary Th2 rest 


0.9 lHUVECIL-11 




Secondary Trl rest 


g g Lung Microvascular EC 
{none 


0.0 


Primary Thl act 


y. „ iLung Microvascular EC 
TNFalpha + IL-lbeta 


0.0 


Primary Th2 act 


^ g (Microvascular Dermal 
LiC none 


0.0 


Primary Trl act 


g - jMicrosvasuIar Dermal 

l ; .C TNFalpha + IL- 1 beta 


0.0 


Primary Thl rest 


, (Bronchial epithelium 
J TNFalpha + ILlbeta 


0.0 


Primary Th2 rest 


0.0 


Small airway epithelium 
none 


2.9 


Primary Trl rest 


0.0 


Small airway epithelium 


0.0 
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jTNFalpha + IL-lbeta 




CD45RA CD4 
lymphocyte act 


0.0 jCoronery artery SMC rest 


[ 


CD45RO CD4 
(lymphocyte act 


n n JCoronery artery SMC 
" JTNFalpha + IL- 1 beta 


0.0 


)CD8 lymphocyte act 


0.0 j Astrocytes rest 


16.5 


i Secondary CD 8 
lymphocyte rest 


n n (Astrocytes TNFalpha + 
jlL-lbeta 


10 4 


Secondary CD8 
lymphocyte act 


0.0 KU-812 (Basophil) rest 


0.0 


CD4 lymphocyte none 


00 |KU-8 12 (Basophil) 
jPMA/ionomycin 


0.0 


2ry Thl/Th2/Trl_anti- 
CD95 CHI 1 


191 (CCD1106 

j(Keratinocytes) none 


0.0 


LAK cells rest 


0.0 


K-K^\JY lUO 

(Keratinocytes) 
TNFalpha + IL-lbeta 


0.0 


LAK cells IL-2 


1.2 


Liver cirrhosis 


12.7 


LAK cells IL-2+IL-12 




Lupus kidney 


3.9 


LAK cells IL-2+IFN 
gamma 


4.1 


NCI-H292 none 


10.8 


LAK cells IL-2+ IL-18 




NCI-H292 IL-4 


23.2 


LAK cells 
PMA/ionomycin 


0.0 


NCI-H292 IL-9 


22.4 


NK Cells IL-2 rest 


0.0 


NCI-H292IL-13 


1 0.3 


Two Way MLR 3 day 


1.1 


NCI-H292 TFN gamma 


18.0 


Two Way MLR 5 da> 


0.0 


HPAEC none 


0.0 


Two Way MLR 7 day 


0.0 


HPAEC TNF alpha + IL- 
1 beta 


0.0 


PBMC rest 


0.0 


Lung fibroblast none 


1.9 


PBMC PWM 


1.2 


Lung fibroblast T\l 
alpha + IL-1 beta 


0.0 


PBMC PHA-L 


0.0 


Lung fibroblast IL-4 


3.8 


Ramos (B cell) none 


0.0 


Lung fibroblast IL-9 


0.5 


Ramos (B cell) 
ionomycin 


0.0 


Lung fibroblast IL-1 3 


5.7 


B lymphocytes PWM 


3.5 


Lung fibroblast IFN 
gamma 


2.9 


B lymphocytes CD40I 
and IL-4 


0.0 


Dermal fibroblast 
CCD1070 rest 


0.0 


EOL-1 dbcAMP 


100.0 


Dermal fibroblast 
CCD1070 TNF alpha 


0.0 


EOL-1 dbcAMP 
PMA/ionomycin 


25.9 


Dermal fibroblast 
CCD1070 IL-1 beta 


0.0 


Dendritic cells none 


3.8 


1 )o,,iu, hl)i .hi - II N 


2.8 
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gamma 




Dendritic cells LPS 


2.6 


Dermal fibroblast IL-4 


4.8 


Dendritic cells anti- 
CD40 


5.0 


IBD Colitis 2 


0 0 


Monocytes rest 


0.0 IBD Crohn's 


0.0 


Monocytes LPS 


0.0 jColon 


37.4 


Macrophages re st 


4.2 jLung 


8.4 


Macrophages LPS 


0.0 Thymus 


40.9 


HUVEC none 


0.0 


Kidney 




HUVEC starved 


0.0 







CNSneurodegenerationvl.O Summary: Ag3273 Two experiments with the same 
probe and primer set produce results that are in excellent agreement. This panel confirms 
the expression of this gene at low to moderate levels in the brains of an independent group 
of individuals. Expression of this gene is found to be down-regulated in the temporal cortex 
of Alzheimer's disease patients. Therefore, up-regulation of this gene or its protein product, 
or treatment with specific agonists for this protein, may be of use in reversing the 
dementia/memory loss associated with Alzheimer's disease and neuronal death. 

Genera l_screening_panel_vl .4 Summary: Ag3273 Highest expression of the CG57460- 
01 gene is seen in fetal heart (CT=28.6). In addition, this gene is expressed at much higher 
levels in fetal heart when compared to expression in the adult heart (CT=38). Thus, 
expression of this gene may be used to differentiate between the fetal and adult source of 
this tissue. In addition, the higher expression in fetal heart suggests that this protein product 
may be involved in the development of this organ. Therefore, therapeutic modulation of the 
expression or function of this gene may be useful in the treatment of heart disease. 

This gene also shows highly specific brain expression. Please see Panel 
CNSneurodegeneration for discussion of utility of this gene in the central nervous system. 

In addition, expression of this gene appears to be upregulated in a number of cancer 
cell lines when compared to the normal tissues. Specifically, expression of this gene 
appears to be higher in ovarian, breast, lung and renal cancer cell lines when compared to 
their respective normal tissues. Therefore, therapeutic modulation of the activity of this 
gene or its protein may be of benefit in the treatment of ovarian, breast, lung and renal 
cancer. The CG5 7460-01 gene encodes a transmembrane protein with homology to N- 
acetyltransferase Camello 2, a protein involved in cellular adhesion (ref. 1). 
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1 . Popsueva AE, Luchinskaya NN, Ludwig AV, Zinovjeva OY, Poteryaev DA, 
Feigelman MM, Ponomarev MB, Berekelya L, Belyavsky AV. Overexpression of camello, 
a member of a novel protein family, reduces blastomere adhesion and inhibits gastrulation 
5 in Xenopus laevis. Dev Biol 2001 Jun 15;234(2):483-96 

Panel 4!D Summary: Ag3273 Highest expression of the CG57460-01 is seen in 
eosinophils. In addition, differential expression is observed in the eosinophil cell line EOL- 
1 under resting conditions over that in EOL-1 cells stimulated by phorbol ester and 
ionomycin. Thus, this gene may be involved in eosinophil function. Therefore, therapeutic 
10 modulation of the expression or function of this gene may reduce eosinophil activation and 
be useful in the treatment of asthma and allergies. 

In addition, significant expression in normal colon and thymus suggest a role for 
this gene in the normal homeostasis of these tissues. Therefore, therapeutic modulation of 
the expression or function of this gene may modulate immune function (T cell 
1 5 development) and be important for organ transplant, AIDS treatment or post chemotherapy 
immune reconstitution. Furthermore, since expression of this gene is decreased in colon 
samples from patients with IBD colitis and Crohn's disease relative to normal colon, 
therapeutic modulation of the activity of the protein encoded by this gene may be useful in 
the treatment of inflammatory bowel disease. 

20 CS. CG57464-01 

Expression of gene CG57464-01 was assessed using the primer-probe set Ag3248, 
described in Table CSA. Results of the RTQ-PCR runs are shown in Tables CSB, CSC, 
CSD and CSE. 



Table C SA. Probe Name Ag3248 



Primers 


Sequences 


Length 


Start 
Position 


| SEQID 1 

I NO: 1 


Forwarc 


5 ' -cctccctggtagaggtcaac-3 ' 


20 


929 




Probe 


TET-5 ' -ctactcagtgcccagcagccaggt-3 ' - 

TAMRA 


24 


954 


1 696 


Reverse 


5 1 -tgtctgcatgcagcctatg-3 ' 


19 


996 


i 697 ; 
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Tissue Name 


Rel. Exp.(%) 
Ag3248, Run 
210037962 


Rel.Exp.(%)| 
Ag3248, Run i * ISSUe 
224063124 | Name 


Rel. Exp.(%) 
Ag3248, Run 
210037962 


Rel. Exp.(%) 
Ag3248, Run 
224063124 


AD 1 Hippo 


26.4 


4.8 


Control 
(Path) 3 
Temporal 
Clx 


28.5 




39.2 


AD 2 Hippo 


73.2 


71.7 


Control 
(Path) 4 
Temporal 
Ctx 


57.8 


61.1 


AD 3 Hippo 


21.6 


5.6 


AD 1 

Occipital 

Ctx 


19.1 


21.2 


A F» A 1-1 irmrv 

t\u t n ippu 


28 3 


28.9 


AD 2 

Occipital 

Ctx 

(Missing) 


n n 
0.0 


— 


AD 5 hippo 


85.3 


100.0 


AD 3 

Occipital 

Ctx 


13.0 


15.9 


AD 6 Hippo 


70.2 


72.7 


AD 4 
Occipital 
C tx 


37.4 


28.7 


Control 2 
Hippo 


64.6 


49.3 


AD 5 ~~ '™ 

Occipital 

Ctx 


33.9 


85.3 


control H 

Hippo 


55.5 


64.2 


AD 6 

Occipital 

Ctx 


58.6 


14.4 


Control (Path) 
3 Hippo 


46.3 


0.9 


Control 1 
Occipital 
Ctx 


,73 


22.7 


Ctx 


28.9 


24.1 


Control 2 
Occipital 
Ctx 


89.5 


81.2 


AD 2 Temporal 
Ctx 


57.4 


62.4 


Control 3 
Occipital 
Ctx 


68.3 


62.0 


AD 3 Temporal 
Ctx 


18.9 


16.7 


Control 4 
Occipital 
Ctx 


29.3 


34.2 


AD 4 Temporal 
Ctx 


42.3 


29.9 


Control 
(Path) 1 
Occipital 
Ctx 


100.0 


92.7 


YD 5 Inf 


77.4 


97.3 


Control 


36.3 


15.1 



1000 



Temporal Ctx 




|(Path) 2 
|Occipital 

[Ctx 






AD 5 

SupTem poral 
Ctx 


69.7 


(Control 

87.7 

(Occipital 
[Ctx 


32.5 


25.2 


AD 6 Inf 
Temporal Ctx 


39.2 


87.1 


Control 
(Path) 4 
Occipital 
Ctx 


70.2 


66.9 


AD 6 Sup 
Tempi tral ( !tx 


73.2 


70.7 


Control 1 
Parietal Ctx 


24.8 


32.3 


Control 1 
Temporal Ctx 


25.3 


26.1 


Control 2 
Parietal Ctx 


70.7 


94.0 


Control 2 
Temporal Ctx 


43.5 


77.4 


Control 3 
Parietal Ctx 


59.0 


0.0 


Temporal Ctx 


74.2 


49.3 


Control 
(Path) 1 
Parietal Ctx 


42.6 


80.7 


Control 4 
Temporal Ctx 


45.7 


68.8 


Control 
(Path) 2 
Parietal Ctx 


67.4 


59.9 


Control (Path) 
1 Temporal Ctx 


63.3 


(Control 
58.6 [(Path) 3 

IParietal Ctx 


25.3 


24.1 


Control (Path) 
2 Temporal Ctx 


55.9 


(Control 
55.1 [(Path) 4 

[Parietal Ctx 


78.5 


79.0 



Table CSC . General_screening_panel_vl.' 



Tissue Name 


Rel. Exp.(%) 
Ag3248, Run 
214693634 


Tissue Name 


Rel. Exp.(%) 
Ag3248, Run 
214693634 


Adipose 


1.5 


Renal ca. TK-10 


3.6 


Melanoma* 
Hs688(A).T 


6.2 


Bladder 


7.6 


Melanoma* 
Hs688(B).T 


5.8 


Gastric ca. (liver met.) 
NCI-N87 


16.8 


Melanoma* M14 


4.9 


( iastric ca. KATO III 


14.5 


Melanoma* 
LOXIMVI 


5.3 


Colon ca. SW-948 


6.4 


Melanoma* SK- 
MEL-5 


4.3 


Colon ca. SW480 


15.6 


Squamous cell 


3.3 


Colon ca.* (SW480 


9.1 
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carcinoma SCC-4 




met) SW620 




Testis Pool 


1.5 


Colon ca. HT29 


7.1 


Prostate ca.* (bone 
met) PC-3 


2.3 


Colon ca. HCT-116 


7.5 


Prostate Pool 


3.1 Colon ca. CaCo-2 


8.7 


Plciccntsi 


3.0 (Colon cancer tissue 




Uterus Pool 


0.9 


Colon ca. SW1116 


10.2 


Ovarian ca. 
OVCAR-3 


24.0 


Colon ca. Colo-205 


6.2 


Ovarian ca. SK- 
OV-3 


18.7 


Colon ca. SW-48 


7.5 


Ovarian ca. 

U V LAK-4 


4.1 


Colon Pool 


4.2 


Ovarian ca. 
OVCAR-5 


33.0 


Small Intestine Pool 


4.5 


Ovarian ca. 
IGROV-1 


16.0 


Stomach Pool 


2.7 


Ovarian ca. 
OVCAR-8 


20.9 


Bone Marrow Pool 


1.3 


Ovary 


3.1 


Fetal Heart 


3.2 


Breast ca. MCF-7 


4.5 jHeart Pool 


2.6 


Breast ca. MDA- 
MB-231 


12.2 iLymph Node Pool 


4.6 


Breast ca. BT 549 | 12.0 [Fetal Skeletal Muscle 


| 2 


Breast ca. T47D 1 00.0 iSkeletal Muscle Pool 


\ n 


Breast ca. MDA-N 


9.7 »pi v , | i 


4.1 


Breast Pool 


5.2 Thymus Pool 


5.3 


Trachea 


, 1 jCNS cancer 

((glio/astro) U87-MG 


12.1 


Lung 


n . |CNS cancer 

(glio/astro) U-118-MG 


10.9 


Fetal Lung 


4.4 


CNS cancer 
(neuro;met) SK-N-AS 


16.0 


Lung ca. NCI-N417 


3.1 


CNS cancer (astro) SF- 
539 


6.2 


Lung ca. LX-1 


.-^ jCNS cancer (astro) | 

l ^ 'SNB-75 1 ° J 


Lung ca. NCI-H146 


_ 1 jCNS cancer (glio) 
[SNB-10 


13.2 




Lung ca. SHP-77 


CNS cancer (slio) SF- 

|295 


25.9 


Lung ca. A549 

________ 


5.4 Biain ( \!!!\»dal D 1 ■ >>: 
7.4 JBrain (cerebellum) 


4.7 
8.5 


Lung ca. NCI-H23 


4.4 jBrain (fetal) 


2.1 
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Lung ca. NCI-H460 


4.5 


Brain (Hippocampus) 
Pool 


6.1 


Lung ca. HOP-62 


6.3 


Cerebral Cortex Pool 


6.0 




5.5 


Brain (Substantia 
nigra) Pool 


8 1 


Liver 


3.2 


Brain (Thalamus) Pool 


7.8 


Fetal Liver 


2.3 


Brain (whole) 


3.4 


1 ,iver ca. 1 [epG2 


5.4 


Spinal Cord Pool 


6.6 


Kidney Pool 


4.5 


Adiuiai Gland 


3.0 


Fetal Kidney 


3.1 


Pituitary gland Pool 


3.4 


Renal ca. 786-0 


7.4 


Salivary Gland 


4.3 


Renal ca. A498 


5.8 


Th\ roid ( female) 


6.1 


Renal ca. ACHN 


5.6 


Pancreatic ca. 
CAPAN2 


11.3 


Renal ca.UO-31 


6.1 


Pancreas Pool 


5 2 



Table CS D. Panel 2.2 



Tissue Name 


ReL Exp.(%) j 

Ag3248, Run 1 Tissue Name 
17444 12'J.X 


Rel. Exp.(%) 
Ag3248, Run 
174441298 


Normal Colon 


1 4 A jKidney Margin 
j(OD04348) 


100.0 


Colon cancer 
(OD06064) 


, _ Kidney malignant 

cancer (OD06204B) 


6.3 


Colon Margin 
(OD06064) 


Ikidney normal 
7.5 adjacent tissue 
|(OD06204E) 


9.3 


Colon cancer 
(OD06159) 


„ n jKidney Cancer 
|(OD04450-01) 


57.8 


Colon Margin 
(OD06159) 


5.3 


Kidney Margin 
(01)04450-03) 


16.2 


Colon cancer 
iOD06297-04) 


2.4 


Kidney Cancer 
8120613 


10.8 


Colon Margin 
(OD06297-05) 


6.3 " 


Kidney Margin 
8120614 


l6 l J 


CC Gr.2 ascend colon 
(OD03<>2 1 ) 


14.1 


Kidney Cancer 
9010320 


9.0 


CC Margin (OD03921) 


12.8 


Kidney Margin 
9010321 


19.5 


Colon cancer metastasis 
(( (D06104) 


4.4 


Kidney Cancer 
8120607 


34.2 


Lung Margin 
(( JD06104) 


4.8 


Kidney Margin 
8120608 


27.7 


( nl-^n mets to lung 


15.0 


Normal Uterus 


3.8 



1003 



(OD04451-01) 


J 




Lung Margin 
(OD04451-02) 


, 7 


Uterine Cancer 064011 


10.5 


Normal Prostate 


9.0 


Normal Thyroid 


4.6 


Prostate Cancer 
(OD04410) 


7.5 


Thyroid Cancer 
064010 


18.6 


Prostate Margin 
(OD04410) 


- 1 , jThyroid Cancer 
JA302152 


24.0 


Normal Ovary 


28.7 


Thyroid Margin 
A302153 




13.3 


Ovarian cancer 
(OD06283-03) 


3.1 


Normal Breast 


4.0 


Ovarian Margin 
(OD06283-07) 


1.1 


Breast Cancer 
(OD04566) 


9.3 


Ovarian Cancer 064008 


6.8 


Breast Cancer 1024 


6.9 


Ovarian cancer 
(OD06145) 


8.6 


Breast Cancer 
(( ©04590-01) 


86.5 


Ovarian Margin 
(OD06I45) 


17.8 


Breast Cancer Mets 
(OD04590-03) 


22.4 


Ovarian cancer 
(OD06455-03) 


2.0 


Breast Cancer 
Metastasis (OD04655- 
05) 


47.3 


Ovarian Margin 


2.5 


Breast Cancer 064006 


9.8 


Norma! Lung 


4.7 Breast t ancer 0 100266 


5.8 


Invasive; poor diff. lung 
adeno(ODO4945-01 


9.0 


Breast Margin 
9100265 


3.3 


Lung Margin 
(ODO4945-03) 


4.5 


Breast Cancer 
A209073 


7.3 


Lung Malignant Cancer 
(OD03126) 


10.6 


Breast Margin 
A2090734 


17.0 


Lung Margin 
(OD03126) 


8.4 


Breast cancer 
(OD06083) 


8.6 


Lung Cancer 
(OD05014A) 


8.7 


Breast cancer node 
metastasis (OD06083) 


8.2 


Lung Margin 
(OD05014B) 


8.6 


Normal Liver 


33.4 


Lung cancer 
(OD06081) 


6.3 


Liver Cancer 1026 


20.6 


Lung Margin 
(OD06081) 


4.4 Liver Cancer 1025 


29.1 


Lung Cancer 
(OD04237-01) 


2.7 


Liver Cancer 6004-T 


27.4 


Lung Margin 
(OD04237-02) 


16.7 


Liver Tissue 6004-N 


2.5 
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Onilar 1V/fplsmr»m?i 
KjKsllicH iViciallLMlla 

Metastasis 


25.0 


! 

Liver Cancer 6005-T j 


26.2 


( )cu I ar [VI e ' anoin a 
Margin (Liver) 


15.8 


Liver Tissue 6005-N j 


58.2 




3.8 


Liver Cancer 064003 i 


52.5 


Melanoma Margin 
(Lung) 


2.6 


Normal Bladder \ 


10.7 


Normal Kidney 


12.4 


Bladder Cancer 1023 


6.8 


Kidney Ca, Nuclear 
grade 2 (OD04338) 


32.3 


Bladder Cancer 1 
A302173 


3.8 


Kidney Margin 
(OD04338) 


7.1 


Normal Stomach 


16.8 


Kidney Ca Nuclear 
grade 1/2 (OD04339) 


71.7 


Gastric Cancer 
9060397 


9.2 


Kidney Margin 
(OD04339) 


14.6 


Stomach Margin 
9060396 i 


10.4 


Kidney Ca, Clear cell 
type (OD04340) 


8.4 


Gastric Cancer 
9060395 


1.7 


Kidney Margin 
(OD04340) 


31.2 


Stomach Margin 
9060394 


7.5 


Kidney Ca, Nuclear 
grade 3 (OD04348) 


7.5 

__ _.. 


Gastric Cancer 064005 


10.7 


Table CSE. Panel 4D 


r 

] Tissue Name 


Rel. Exp.(%) | 

Ag3248, Run 1 Tissue Name 
!<>43MO«>52 


\ Rel. Exp.(%) 
1 Ag3248, Run 
164390952 


^e^oiidji} Thl i:i 


15.3 IIUVEC IL-lbeta 


1 - . . 


(Secondary Th2 act 


16.5 I IUVEC li'N gamma 


16.0 

I - 


(Secondary Trl act 


155 HUVECTNF alpha + 
IlFN gamma 


7.8 


Secondary Thl rest 


IlUVECTNFalpha + 

JL4 


9.2 


{Secondary Th2 rest 


8.8 iHUVEC IL-11 


12.8 


Secondary Trl rest 


j ^ g 'Lung Microvascular EC 

jnone 


19.1 


jPrimary Thl act 

I ... 


„ . (Lung Microvascular EC 
TNFalpha+ IL-lbeta 


13.8 


jPrimary Th2 act 


o.3 ; 


Microvascular Dermal 
2C none 


18.0 


(Primary Trl act 


; 


vlicrosvasular Dermal 
X TNTulpliu-t IL-lbeia 


6.9 


(Primary Thl rest 


12.3 I 

_ JL 


Jronchial epithelium 
TMFalpha - ILlbeta 


21.3 
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Primary Th2 rest 


„ , (Small airway epithelium 
4.6 Lone 


13.5 


Primary Trl rest 


_ , j Small airway epithelium 
rFNFfllnha 4- TT -Ih^tn 

lllNJ/aipila ~ IJu- 1 UCLct 


45.4 


CD45RA CD4 
lymphocyte &ct 


12.1 Coronery artery SMC rest 


12.9 


CD45RO CD4 
lymphocyte set 


. _ ft JCoronery artery SMC 

iTWPiilnTia 4- TT 1t-»^tn 
j llNJraipna t JLJL-lDeia 


15.2 


CD8 lymphocyte act 


12.5 (Astrocytes rest 


18.6 


Secondary CD8 
lymphocyte rest 


. ~ „ ^Astrocytes TNFalpha + 
_ IlL-lbeta 


17.2 


Secondary CDS 
lymphocyte act 


9.9 IKU-812 (Basophil) rest 


8.7 


CD4 lymphocyte none 


6 8 kl -812 (Basophil) 
|P]VLA/ionomy c i n 


8.6 


2ry Thl/Th2/Trl_anti- 


95 (CCD1106 

i Ker ni noc- tes ) none 


10.9 


LAK cells rest 


ICCD1106 
1 5 .2 (Keratinocytes) 

(TNFalpha + IL-1 beta 


12.2 


LAK cells IL-2 


13.6 (Liver cirrhosis 


7.4 


LAK cells IL-2+IL-12 


9.1 ^Lupus kidney 


10.7 


LAK cells IL-2+IFN 

gamma 


13.9 [NCI-H292 none 42.6 


LAK cells IL-2+ IL-18 


9.7 NCI-H292 IL-4 


48.3 


LAK cells 
PMA/ionomycin 


1.5 InCI-H292 IL-9 


54.3 


NK Cellls IL-2 rest 


12.0 NCI-H292 IL-13 


37 1 


Two Way MLR 3 day 


19.8 NC1-H292 I FN gamma 


48.3 


Two Way Ml R 5 da\ 


6.3 jHPAEC none 


17.9 


Two Way MLR 7 day 


Q ~ HPAi ( IM sIpha + IL- 
y>/ ]l beta 




PBMC rest 


1 0.7 Lung fibroblast none 


21-5 


PBMC PWM 


27 q jLung fibroblast TNF 
alpha + II -1 beta 


27.9 


DDk \C 1514 \ I 


12.0 Lung fibroblast IL-4 


3 1 .9 


Ramos (B cell) none 


27.9 Lung fibroblast IL-9 


35.4 


Ramos (B cell) 
ionomycin 


100.0 iLung fibroblast IL-13 


18.6 


B lymphocytes PWM 


2j q (Lung fibroblast I FN 
" gamma 


37.4 


B lymphocytes CD40L 
and IL-4 


„ Q Derma! fibroblast 
CCD1070 rest 


17.7 


EOL-1 dbcAMP 


1 Q „ Dermal fibroblast 

CCD 1070 TNF alpha 


21.2 
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POT 1 flhr-A1V/fP 

PMA/ionomycin 


1.4 


Dermal fibroblast 
CCD1070 II -1 beta 


8.4 


Dendritic cells none 


18.3 


Dermal fibroblast IFN 
gamma 


16.8 


Dendritic cells LPS 


20.0 


Dermal fibroblast IL-4 


27.7 


Dendritic cells anti- 
CD40 


22 1 


TRr> Politic 9 


3 0 


Monocytes rest 


12.6 


IBD Crohn's 


4.2 


Monocytes LPS 


2.6 


Colon 


28.5 


Macrophages rest 


20.6 


Lung 


12.0 


Macrophages LPS 


10.4 


Thymus 


53.6 


HUVEC none 


11.9 


Kidney 


20.0 


HUVEC starved 


15.6 







CNSneurodegenerationvl.O Summary: Ag3248 Results from two experiments using 
the same probe/primer set gave results that are in excellent agreement. This panel confirms 
the expression of this gene at low levels in the brains of an independent group of 
individuals. However, no differential expression of this gene was detected between 
5 Alzheimer's diseased postmortem brains and those of non-demented controls in this 
experiment. Please see Panel 1.4 for a discussion of the potential utility of this gene in 
treatment of central nervous system disorders. 

General_screening_panel_vl.4 Summary: Ag3248 Expression of the CG57464-01 gene 
is highest in a breast cancer cell line (CT = 27). This also gene appears to be overexpressed 
10 in ovarian and CNS cancer cell lines when compared to the normal tissue controls. Thus, 
therapeutic modulation of the activity of this gene or its protein may be of benefit in the 
treatment of breast, ovarian and CNS cancer. 

In addition, this gene is expressed at low levels in all regions of the central nervous 
system examined, including amygdala, hippocampus, substantia nigra, thalamus, 
1 5 cerebellum, cerebral cortex, and spinal cord. Therefore, this gene may play a role in central 
nervous system disorders such as Alzheimer's disease, Parkinson's disease, epilepsy, 
multiple sclerosis, schizophrenia and depression. 

Among tissues with metabolic or endocrine function, this gene is expressed at low 
levels in pancreas, adipose, adrenal gland, thyroid, pituitary gland, skeletal muscle, heart, 
20 liver and the gastrointestinal tract. Therefore, therapeutic modulation of the activity of this 
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gene may prove useful in the treatment of endocrine/metabolically related diseases, such as 
obesity and diabetes. 

Panel 2,2 Summary: Ag3248 This gene is expressed at low to moderate levels in the 
majority of samples on this panel, with highest expression detected in a sample derived 
5 from normal kidney (CT = 28.6). Expression of the CG57464-01 gene appears to be 

upregulated in a number of breast cancer samples when compared to normal breast. Thus, 
therapeutic modulation of the activity of this gene or its protein product may be of benefit 
in the treatment of breast cancer. 

Panel 4D Summary: Ag3248 Expression of the CG57464-01 gene is highest in Ramos B 
10 cells treated with ionomycin (CT = 29). Therefore, expression of this gene may be used as 
a marker of activated B cells. In addition, this gene is expressed at relatively high levels in 
lung fibroblasts as well as in the mucoepidermoid cell line NCI-H292 independent of 
treatment (CTs = 30), suggesting that therapeutic modulation of the activity of this gene or 
its protein product may be of benefit in the treatment of asthma and emphysema. 

1 5 This gene is also expressed at low to moderate levels in a wide range of other cell 

types of significance in the immune response in health and disease. These cells include 
members of the T-cell, B-cell, endothelial cell, macrophage/monocyte, and peripheral 
blood m ononuclear cell family, as well as epithelial and fibroblast cell types from lung and 
skin, and normal tissues represented by colon, lung, thymus and kidney. This ubiquitous 

20 pattern of expression suggests that this gene product may be involved in homeostatic 
processes for these and other cell types and tissues. 

This pattern is in agreement with the expression profile in 
General_screening_panel_vl.5 and also suggests a role for the gene product in cell survival 
and proliferation. 

25 Therefore, modulation of the gene product with a functional therapeutic may lead to 

the alter ation of functions associated with these cell types and lead to improvement of the 
symptoms of patients suffering from autoimmune and inflammatory diseases such as 
asthma, allergies, inflammatory bowel disease, lupus erythematosus, psoriasis, rheumatoid 
arthritis, and osteoarthritis. 

30 CT. CG57466-01: Acetylglucosaminyltransferase 
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Expression of gene CG57466-01 was assessed using the primer-probe set Ag3249, 
described in Table CTA. Results of the RTQ-PCR runs are shown in Tables CTB, CTC and 
CTD. 



Table CT A. Probe Name Ag3249 



Primer« 


Sequences 


Lengt i Position 


SEQID 
NO: 


Forward|5 ' -accactaactgctcagccaata-3 * 


22 i 156 


698 


p„ i „ |TET-5 ' -aacttgacccaccagccctggtt-3 1 - 

rroDe Itak-^ 


23 180 


699 


Ke\er r .cr -> ' -ta je a ja n= act jc '-Tj. j a< - - ' 


22 22(1 700 



Table CTB . CNS neurodegeneration vl.O 



Tissue Name 


Rel. Exp.(%) Ag3249, 
Run 210037963 


Tissue Name 


Rel. Exp.(%) Ag3249, 
Run 210037963 


AD 1 Hippo 


8.4 


Control (Path) 3 
Temporal Ctx 


5.6 


AD 2 Hippo 


— — 

41.5 


Control (Path) 4 
Temporal Ctx 


27.2 


AD 3 Hippo 


14.0 


AD 1 Occipital Ctx 


6.6 


AD 4 Hippo 


26.1 


AD 2 Occipital Ctx 
(Missing) 


0.0 


AD 5 Hippo j 29.9 


AD 3 Occipital Ctx 


8-5 


AD 6 Hippo 


100.0 


AD 4 Occipital Ctx 


25.2 


Control 2 Hippo 


42.0 


AD 5 Occipital Ctx 


48.0 


Control 4 Hippo 


29.3 


AD 6 Occipital Ctx 


22.4 


Control (Path) 3 
Hippo 


5.6 


Control 1 Occipital 

Ctx 


6.1 


AD 1 Temporal 
Ctx 


10.7 


Control 2 Occipital 
Ctx 


51.4 


AD 2 Temporal 
Ctx 


25.5 


Control 3 Occipital 
Ctx 


22.1 


AD 3 Temporal 
Ctx 


9.9 


Control 4 Occipital 
Ctx 


15.8 


AD 4 Temporal 
Ctx 


29.9 


Control (Path) 1 
Occipital Ctx 


92.7 


AD 5 Inf Temporal 
Ctx 


25.5 


Control (Path) 2 
Occipital Ctx 


18.7 


AD 5 Sup 
Temporal Ctx 


27.4 


Control (Path) 3 
( )ccipital Ctx 


0.7 


AD 6 Inf Temporal 

Ctx 


87.1 


Control (Path) 4 
Occipital Ctx 


29.9 
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AJJ 0 bUp 

Temporal Ctx 


- n iControl 1 Parietal 
]Ctx 


" 

13.7 


Control 1 
Temporal Ctx 


c iControl 2 Parietal 
24 - 5 Ctx 


29.1 


Control 2 
Temporal Ctx 


A1 ~ IControl 3 Parietal 
41 - 2 |Ctx 




25.7 


frintrrvl "X 
V^UIIUOI O 

Temporal Ctx 


33 ? [Control (Path) 1 
Parietal Ctx 


64.2 


Control 3 
Temporal Ctx 


21.9 


Control (Path) 2 
Parietal Ctx 


25.9 


Control (Path) 1 
Temporal Ctx 


69.7 


Control (Path) 3 
Parietal Ctx 


___ 


Control (Path) 2 
Temporal Ctx 


26.4 


Control (Path) 4 
Parietal Ctx 


39.8 





Table CTC . General_screening_panel_vl.4 



Tissue Name 


Rel. Exp.(%) 
Ag3249, Run 
214693635 


Tissue Name 


Rel. Exp.(%) 
Ag3249, Run 

1 4D7JDJJ 


Adipose 


5.6 


Renal ca. TK- 10 


2.7 


Melanoma* 
Hs688(A)/l 


0.4 


Bladder 


59.5 


Melanoma* 
H ' <8(B).T 


0.5 


( iastric ca. (liver met.) 
NCI-N87 


78.5 


Melanoma* Ml 4 


8.8 


Gastric ca. KATO III 


60.3 


Melanoma* 
LOXIMVI 


0.1 


Colon ca. SW-948 


0.5 


Melanoma* SK- 
MEL-5 


15.2 


Colon ca. SW480 


4.4 


Squamous cell 
carcinoma SCC-4 


n (Colon ca.* (SW480 
met) SW620 


0.9 


1 est is P< .ol 


1.7 


Colon ca. 1 1 r29 


0.8 


Prostate ca.* (bone 
met) PC-3 


9.0 


Colon ca.HCT-1 16 


13.2 


Prostate Pool 


1.8 


Colon ca. CaCo-2 


6.6 


Placenta 


10.9 


O.knt cancer tissue 


26.6 


Uterus Pool 


0.9 


Colon ca.SWll 16 


3.6 


Ovarian ca. 
OVCAR-3 


3.8 


Colon ca. Colo-205 


0.1 


Ovarian ca. SK- 
OV-3 


6.1 


Colon ca. SW-48 


0.0 


Ovarian ca. 
OVCAR-4 


1.3 


Colon Pool 


6.2 


( >varian ca. 


22.1 


Small Intestine Pool 


11.3 
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OVCAR-5 




- — — 




Ov3.ris.ri csl 
IGROV-l 


7.5 


Stomach Pool 


6.7 


Ovarian ca. 
OVCAR-8 


1.4 


Bone Marrow Pool 


2.3 


Ovan 


24.5 


I- etal I lean 


_11.6 


Breast*ca?MCF-7 ~ 


0.7 


Heart Pool 


2.7 


Breast ca. MDA- 
MB-231 


1.4 


Lymph Node Pool 


7.1 


Breast ca. BT 549 


1.8 


Fetal Skeletal Muscle 


0.4 


Breast ca. R7D 


:^:> (Skeletal Muscle Pool 


(I s 




100.0 


spleen Po<>| 




Breast Poo! 


4.0 


Thymus Pool 


3.7 


Trachea 


65.5 


CNS cancer 
(glkvastro) I 87-MG 


0.0 


Lung 


5.8 


CNS cancer 
(alio/astro) U-l 18-MG 


0.7 


Fetal Lung 


87.7 


CNS cancer 
(neuro;met) SK-N-AS 




21.3 




Lung ca.NCI-N417 


0.0 


CNS cancer (astro) SF- 
539 


0.1 


Lung ca. LX-1 

— — — — 


?Q _ jCNS cancer (astro) 
jSNB-75 


12.2 


Lung ca. NCI-H146 


n , 1CNS cancer (glio) 
[SNB-19_ 


„ 

6.5 


Lung ca. SHP-77 


n ~ CNS cancer (glio) SF- 

U J295 


6.2 


Lung ca. A549 


12.9 jBrain (Amygdala) Pool 


5.1 


Lung ca. NC1-H526 


0.0 Brain (cerebellum) 


6.8 


Lungca.NCI-H23 


2.3 Brain (fetal) 


4.5 


Lung ca. NCI-H460 


q 2 JBrain (Hippocampus) 
jPool 


6.7 


1 ung ca. 1 K >P-62 


5.0 ICerebral Cortex Pool 


6.7 


Lung ca. iN^i-rijzz 


q 4 jBrain (Substantia 
{nigra) Pool 


7.6 


Liver 


0.8 Brain (Thalamus) Pool 


9.0 


1 etal 1 iver 


4.8 Brain (whole) 


7.5 


Liver ca. HepG2 


0.0 jSpinal Cord Poo! 


4.6 


Kidney Pool 


11.7 


Adrenal Gland 


40.6 


Fetal Kidney 


4.5 


Pituitary gland Pool 


10.5 


Renal ca. 786-0 


2.5 


Salivary Gland 


5.7 


Renal ca. A498 


1.4 


1 hyroid (female) 


5.8 


Renal ca. ACHN 


22.2 


Pancreatic ca. 
CAPAN2 


43.8 
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Renal ca. UO-31 i 


32.8 


Pancreas Pool 


n.3 


Table CTD. Panel 4D 


Tissue Name 


Rel. Exp.(%) 
Ag3249, Run 
164390953 


Tissue Name 


Rel. Exp.(%) 
Ag3249, Run 
164390953 


Secondary Thl act 


0.2 


HUVEC IL-lbeta 


0.4 


Secondary Th2 act 


0.0 


HUVEC IFN gamma 


1.0 


Secondary Trl act 


0.0 


III IVEC I NI- alpha 
IFN gamma 


0.5 


Secondary Thl rest 


0.0 


HUVEC TNF alpha + 
II 1 


0.6 


Sec 'ii Jars 1 h2 rest 


- „ 

0.1 


III \ I i II -1 1 


0.9 


Secondary Trl rest 

_ — , — — 


0.0 


Lung Microvascular EC 
none 


1.4 


Primarv Thl act 


- 

0.2 

> ' - " 


Lung Microvascular EC 
TNFalpha i IL-lbeta 


4.5 


Primary Th2 act 
Primary Trl act 


0.0 
0.0 


Microvascular Dermal 
EC none 

Microsvasiilar Dermal 
EC I NI alpha + [L-lbeta 


0.9 
7.7 


Primary Thl rest 


0.1 


Bronchial epithelium 
rNFalpha t ILlbeta 


3.6 


Primary Th2 rest 


0.0 


Small airway epithelium 
n< »ne 


0.2 


Primary Trl rest 


0.1 


Small airway epithelium 
TNFalpha + IL-lbeta 


5.8 


CD45RA CD4 
lymphocyte act 


0.5 


Coronery artery SMC rest 


0.0 


CD45RO CD4 
lymphocyte act 


0.1 


Coronery artery SMC 
TNFalpha + IL-lbeta 


0.0 


CDS 1} tnphocyte act 


, 

0.6 


Astrocytes rest 


0.8 


Secondan CDS 
Kmphocvte rest 


0.0 


Astrocytes TNFalpha + 
IL-lbeta 


3.1 


Secondary CD8 
K mphoejte act 


0.1 


KU-812 (Basophil) rest 


2.7 


CD4 lymphocyte none 


I I ^11-812 (Basophil) 
PMA/ionomycin 


"~37~~] 


2rv Thl/Th2/Trl anti- 
CD95 (JIM 


0.1 


CCD1106 

(kei alinocUes) none 


~™ 

1.6 


LAK cells rest 


1.9 


CCD 1106 — 
(Keratinocytes) 
TNFalpha + IL-lbeta 


1.3 


LAK cells IL-2 


5.0 


Liver cirrhosis 


3.2 
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LAK cells IL-2+IL- 12 


1.6 


Lupus kidney 
NCI-H292 none 


1.3 


LAK cells IL-2+IFN 
gamma 


5.0 


100.0 


LAK cells IL-2+ IL-18 


4.3 (NCI-H292 IL-4 


87.7 


LAK cells 
PMA ionomycin 


6.1 ]NCI-H292 IL-9 


96.6 


NK Cells IL-2 rest 


12.7 NCI-H292 IL-13 


54.0 


Two W iv MLR 3 day 


U.6 NCI-H292 I FN gamma 


25.2 


Two Way MLR 5 day 


1.6 HPAECnone 


0.4 


Two Way MLR 7 day 


1 IP \l ( I NI alpha + IL- 
U " y i 1 beta 


6.3 


PBMC rest 


9.4 JLung fibroblast none 


0.1 


PBMC PWM 


, _ Lung fibroblast TNF j . 
alpha + 1L-1 beta 


r diviv_- j. n/\-i_, 


0.7 Lung fibroblast IL-4 j 0.7 


Ramos (B cell) none 


0.1 Lung fibroblast IL-9 0.0 


Ramos (B cell) 
ionomycin 


2.0 Lung fibroblast IL-13 


0.3 


B lymphocytes PWM 


27 JLung fibroblast IFN 1 Q4 
gamma 


B lymphocytes CD40L 
ana 


„ iDermal fibroblast j „ 1 
CCD! 070 rest 


EOL-1 dbcAMP 


? , Dermal fibroblast 

CCD1070 TNF alpha 


0.8 


PMA/ionomycin 


R 4 iDermal fibroblast 
CCD 1070 IL-I beta 


0.0 


Dendritic cells none 


^ 2 j Dermal fibroblast IFN 
jgamma 


0.0 


Dendritic cells LPS 


2.5 Dermal fibroblast IL-4 


(1.0 


Dendritic cells anti- 
CD40 


1.3 


IBD Colitis 2 


1.4 


Monocytes rest 


0.4 


IBD Crohn's 


4.2 


Monocytes LPS 




Colon 


4X.<» 


Ma. I >p'l Igcs est 


I.I lllll.' 


19.5 


Macrophages LPS 


17.9 Thymus 


: > 


HUVEC none 


0.8 .Kidney 


3 ') 


HUVEC starved 


11 





CNS_neurodegeneration_vl.0 Summary: Ag3249 This panel confirms the expression of 
this gene at low levels in the brains of an independent group of individuals. However, no 
differential expression of this gene was detected between Alzheimer's diseased postmortem 
brains and those of non-demented controls in this experiment. Please see Panel 1 .4 for a 
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discussion of the potential utility of this gene in treatment of central nervous system 
disorders. 

General_screening_panel_vl.4 Summary: Ag3249 The CG57466-01 gene encodes a 
protein with homology to beta-l,3-galactosyltransferases, which catalyze the formation of 
5 type I oligosaccharides (ref. 1). Expression of this gene is highest in a breast cancer cell 
line (CT = 28.1). In addition, expression of this gene appears to be upregulated in 
pancreatic and gastric cancer cell lines when compared to their respective normal tissues. 
Thus, therapeutic modulation of the activity of this gene or its protein product may be of 
benefit i n the treatment of breast, pancreatic and gastric cancer. 

10 This gene also shows significant levels of expression in trachea, bladder and fetal 

lung. Interestingly, CG57466-01 gene expression is much higher in fetal lung (CT = 28.3) 
than in adult lung (CT = 32.2), suggesting that expression of this gene can be used to 
distinguish adult from fetal lung. 

In addition, this gene is expressed at low levels in all regions of the central nervous 
1 5 system examined, including amygdala, hippocampus, substantia nigra, thalamus, 

cerebellum, cerebral cortex, and spinal cord. Therefore, this gene may play a role in central 
nervous system disorders such as Alzheimer's disease, Parkinson's disease, epilepsy, 
multiple sclerosis, schizophrenia and depression. 

j\mong tissues with metabolic or endocrine function, this gene is expressed at low 
20 to moderate levels in pancreas, adipose, adrenal gland, thyroid, pituitary gland, heart, fetal 
liver and the gastrointestinal tract. Therefore, therapeutic modulation of the activity of this 
gene may prove useful in the treatment of endocrine/metabolically related diseases, such as 
obesity and diabetes. 

References: 

25 1 . Shiraishi N, Natsume A, Togayachi A, Endo T, Akashima T, Yamada Y, Imai 

N, Nakagawa S, Koizumi S, Sekine S, Narimatsu H, Sasaki K. Identification and 
characterization of three novel beta 1,3-N-acetylglucosaminyltransferases structurally 
related to the beta 1,3-galactosyltransferase family. J Biol Chem 2001 Feb 2;276(5):3498- 
507 
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Panel 4D Summary: Ag3249 This transcript is most highly expressed in a cluster of 
treated and untreated samples derived from the NCI-H292 cell line, a human airway 
epithelial cell line that produces mucins (CTs = 30-32). Mucus overproduction is an 
important feature of bronchial asthma and chronic obstructive pulmonary disease samples. 
5 The transcript is also expressed at lower but still significant levels in small airway 

epithelium treated with IL-1 beta and TNF-alpha. The expression of the transcript in this 
mucoepidermoid cell line that is often used as a model for airway epithelium (NCI-H292 
cells) suggests that this transcript may be important in the proliferation or activation of 
airway epithelium. Therefore, therapeutics designed with the protein encoded by the 
10 transcript may reduce or eliminate symptoms caused by inflammation in lung epithelia in 
chronic obstructive pulmonary disease, asthma, allergy, and emphysema. 

CU. CG57468-01: multidrug resistance protein 1 

Expression of gene CG57468-01 was assessed using the primer-probe set Ag3250, 
described in Table CUA. Results of the RTQ-PCR runs are shown in Tables CUB. 

1 5 Table CUA . Probe Name Ag3250 



Primers Sequences 


s 

{Length 


Start 
Position 


1 SEQID 
NO: 


Forward IS ' -agcaggggaaattttaacgat-3 ' 




2391 


701 


p . jTET-5 ' -agacacttggccttcaaagccatgtt-3 1 - 

l rroDe Jtamra 




2419 


j 702 


Reverse |5 ' -caaac g ■ ^ catcctgata-3 ' 


22 


2446 


1 703 



Table CUB . General_screening_panel_vl.4 



Tissue Name 


Rel. Fx p.(%) 
Ag3250, Run 
214693636 


Tissue Name 


Rel. Exp.(%) 
Ag3250, Run 
214693636 


Adipose 


0.1 


Renal ca. TK-10 


0.0 


Melanoma* 
Hs688(A).1 


0.0 


Bladder 


0.0 


Melanoma* 
Hs688(B).T 


0.0 


Gastric ca. (liver met.) 
NCI-N87 


0.0 


Melanoma* M 14 


00 


Gastric ca KATO III 


0.0 


Melanoma* 
LOXIMVI 


0.0 


Colon ca. SW-948 


11.9 


Melanoma* SK- 
MEL-5 


0.6 


Colon ca. SW480 


0.0 
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Squamous cell 
carcinoma SCC-4 


" 

0.0 


Colon ca.* (SW480 
met) SW620 


0.0 


rest is Pool 


0.0 


Colon ca. HT29 


0.0 


Prostate ca.* (bone 
met) PC-3 


0.0 


Colon ca.HCT-116 


0.0 


Prostate Pool 


0.0 


Colon ca. CaCo-2 


0.0 


Placenta 


0 0 


Colon cancer tissue 




Uterus Pool 


0.0 


Colon ca. SW1116 


0.0 


Ovarian ca. 
OVCAR-3 


0.0 


Colon ca. Colo-205 


0.0 


Ovarian ca. SK- 
OV-3 


0.0 


Colon ca. SW-48 


0.0 


Ovarian ca. 

KJ V LAK-4 


0.0 


Colon Pool 


0.1 


Ovarian ca. 
OVCAR-5 


0.0 


Small Intestine Pool 


0.0 


Ovarian ca. 
IGROV-1 


0.0 

________ 


Stomach Pool 


0.0 


Ovarian ca. 
1 >VCAR-8 


0.0 


Bone Marrow Pool 


0.0 


Ovary 


0.0 


Fetal Heart 


0.0 


Breast ca. M( I -1 


0.0 


Heart Pool 


0.7 


Breast ca. MDA- 
MB-231 


0.0 


Lymph Node Pool 


0.8 


Breast ca. BT 549 


0.0 


Felal skeletal Muscle 


0.0 


Breast ca. IT1) 


0.0 


Skeletal Muscle Pool 


0.0 




0 3 


C^lppt, T) AA 1 

opieen .rooi 




Breast 1 I 


100.0 


Thymus Pool 


0.0 


Trachea 


0.0 


CNS cancer 
(glio/astro) U87-MG 


0.0 


Lung 


0.0 


CNS cancer 
(glio/astro) U-l 1 8-MG 


0.0 


Fetal Lung 


0.1 


CNS cancer 
(neuro;met) SK-N-AS 


0.0 


Lung ca. NCI-N417 


0.0 


CNS cancer (astro) SF 
539 


0.0 


Lung ca. LX-1 


—————— 

0.0 


CNS cancer (astro) 
SNB-75 


0.0 


Lung ca. NCI-H146 


0.0 


CNS cancer (glio) 
SNB-19 


0.0 


Lung ca. SHP-77 


0.0 


CNS cancer (glio) SF- 
295 


0.0 


Lung ca. A549 


0.0 


Brain (Amygdala) Pool 


0.0 


Lung ca. NCI-H526 


0.0 


Brain (cerebellum) 


0.0 
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Lung ca. NCI-H23 


"' 0.0 " " 


Brain (fetal) 


0.0 


Lung ca. NCI-H460 


0.0 


Brain (Hippocampus) 
Pool 


0.0 


Lung ca. HOP-62 


0.0 


Cerebral Cortex Pool 


0.0 


JLung ca. JNCl-HOzZ 


0.0 


Brain (Substantia 
nigra) Pool 


r — i 

0.0 


Liver 


9.5 


Brain (Thalamus) Pool 


0.0 


Fetal Liver 


21.0 


Brain (whole) 


0.0 


1 iver ca. Hep( 12 


0.0 


Spinal ( :ord Pool 


().() 


Kidney Pool 


17.4 


Adrenal Gland 


0.0 


Fetal Kidney 


6.5 


Pituitary uhrJ Pool 


0.0 


Renal ca. 786-0 


0.0 


Salivary Gland 


0.0 


Renal ca. A 4% 


0.0 


Thyroid (female) 


0.0 


Renal ca. ACHN 


0.0 


Pancreatic ca. 
CAPAN2 


0.0 


Renal ca. UO-31 


0.0 


Pancreas Pool 


0.0 



CNS_neurodegeneration_vl.O Summary: Ag3250 Expression of this gene is 
low/undetectable (CTs > 35) across all of the samples on this panel (data not shown). 



General_screening_panel_vl.4 Summary: Ag3250 Expression of the CG57468-01 gene 
is highest in normal breast (CT = 23.8). In addition, this gene is highly expressed in 
fetal/adult kidney and fetal/adult liver (CTs = 26-27). Thus, expression of this gene may be 
used to distinguish these tissues from the other samples on this panel. Strikingly, 
expression of this gene is much lower in breast, kidney, and liver cancer cell lines. 
Therapeutic modulation of the activity of this gene or its protein product may be of benefit 
in the treatment of these types of cancers. 

10 Panel 4D Summary: Ag3250 Expression of this gene is low/undetectable (CTs > 35) 
across all of the samples on this panel (data not shown). 

CV. CG59609-01: PEPTIDYL-PROLYL CIS-TRANS ISOMERASE A 

Expression of gene CG59609-01 was assessed using the primer-probe set Ag3494, 
described in Table CVA. Results of the RTQ-PCR runs are shown in Tables CVB and 
15 CVC. 




Table CVA . Probe Name Ag3494 

(Pjrjmerl Sequences _ I Le ng th Star t SEQ ID 









Position 


NO: j 


Forward 


5 1 -ccgttctatcagccatggt-3 ' 


19 


3 


704 


Probe 


TET-5 ' -ccccaccaggttcttagacatcatcg-3 ' - 
TAMRA 


26 


25 


705 


jReverse 


5 1 -aaggagacacgtcccaagag-3 1 


20 


62 


706 1 



Table CVB . General_screening_panel_vl.4 



I Tissue Name 


Rel. Exp.(%) 
Ag3494, Run 
217215292 


Tissue Name 

_™ - .... „ _ 


Rel. Exp.(%) 
Ag3494, Run 
21 7215202 


(Adipose 


0.0 


Renal ca. TK-10 


6.0 


[Melanoma* 


0.0 


Bladder 


0.0 


(Melanoma* 


0.0 


Gastric ca. (liver met.) 

JNCl-JNo / 


0.0 


Melano na ' Ml 4 


25 s 


Gastric ca KATO III 


0.0 


Melanoma* 
LOXIMVI 


0.0 


Colon ca. SW-948 


0.0 


Melanoma* SK- 
MEL-5 


0.0 


Colon ca. SW480 


0.0 


Squamous cell 
carcinoma SCC-4 


33.4 


Colon ca.* 
met) SW620 


0.0 


1 estis Poo! 


100.0 


Colon ca. HT29 


0.0 


Prostate ca.* (bone 
imet) PC-3 


0.0 


Colon ca. HCT-116 


0.0 


Prostate Pool 


0.0 


C olon ca ( *i( o-2 


0.0 


Placenta 
1 fterus Pool 


0.0 
0.0 


Colon cancer tissue 
Colon ca. SW1 116 


0.0 
0.0 


Ovarian ca. 
OVCAR-3 


0.0 


Colon ca. Colo-205 


0.0 


Ovarian ca. SK- 
OV-3 


0.0 


Colon ca. SW-48 


0.0 

... . 


Ovarian ca. 
OVCAR-4 


0.0 


Colon Pool 


0.0 


Ovarian ca. 
OVCAR-5 


0.0 


Small Intestine Pool 


0.0 I 


Ovarian ca. 
IGROV-1 


79.0 


Stomach Pool 


0.0 


Ovarian ca. 
OVCAR-8 


0.0 


Bone Marrow Pool 


0.0 


Ovary 


0.0 


Petal Heart 


0.0 


Breast ca VKT-7 


0.0 


Heart Pool 


0.0 


Breast ca. MDA- 
MB-231 


41.8 


Lymph Node Pool 


0.0 
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Breast ca. BT 549 


0.0 


Fetal Skeletal Muscle 


0.0 


Breast ca. T47D 


0.0 


Skeletal Muscle Pool 


— — - - 
0.0 


JjrCaSi Ca. lVJULJ/\-iN 




Spleen Pool 




Breast Pool 


0.0 


Thymus Pool 


0.0 


Trachea 


0.0 


CNS cancer 
(glio/astro) U87-MG 


0.0 


Lung 


0.0 


CNS cancer 
(glio/astro) U-l 18 MG 


0.0 


, — — — ■ , 

Fetal Lu ng 


0.0 


CNS cancer 
»iwuro:meU SK-VAS 


0.0 


Lung ca.NCI-N417 


0.0 


CNS caiiLcr (astro) Sr- 
539 


0.0 


Lung ca. LX-1 


0.0 


CNS cancer (astro) 
SNB-75 


0.0 


Lung ca. NCI-H146 


0.0 


CNS cancer (glio) 
SNB-19 


0.0 


Lung ca. SHP-77 


40.6 


CNS cancer (glio) SF- 
295 


0.0 


Lung ca. A549 


0.0 


Brain (Amygdala) Pool 


0.0 


Lung ca. NCI-H526 


0.0 


Brain (cerebellum) 




Lung ca. NCI-H23 


oTcf 


Brain (fetal) 




Lung ca. NCI-H460 


0.0 


Brain (Hippocampus) 
Pool 


0.0 
~i)0 


Lung ca. HOP-62 


0.0 


Cerebral Cortex Pool 


Lung ca. NCI-H522 


0.0 


Brain (Substantia 
nigra) Pool 


0.0 


Liver 


0.0 


Brans (Thalamus) Po< >1 


0.0 


Fetal Liver 


0.0 


Brain (whole) 


0.0 


Liver ca.. HepG2 


0.0 


Spinal Cord Pool 


0.0 


Kidney Pool 


47.0 


Adrenal Gland 


0.0 


Fetal Kidney 


0.0 


Pituitary gland Pool 


0.0 


Renal ca. 7 S6-0 


0.0 


Salivary Gland 


0.0 


[RenaTca. A498 


0.0 


IV p-nd ffjn.a'Lj 


""""b.o 


Renal ca. ACHN 


0.0 


Pancreatic ca. 
CAPAN2 


0.0 




jRenalca. UO-31 


0.0 


Pancreas Pool 


°:° 



Table C VC. Panel 4D 





j Rel. Exp.(%) 




Rel. Exp.(%) I 


Tissue Name 


3 Ag3494, Run 


Tissue Name 


Ag3494, Run I 




| 166441744 




166441744 


Sei ond ii \ , h 1 act 


0.0 


HUVEC IL-lbeta 


0.0 


Secondan Th2 act 


9-0 


III A IC IFN gamma 


0.0 
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Secondary Trl act 


0.0 


HUVEC TNF alpha + 
IFN gamma 


0.0 


Secondary Thl rest 


0.0 


HU\ EC I'M- alpha + 
IL4 


0.0 


Secondary Th2 rest 


1 o.o 


HUVEC 11 1 1 


0.0 


Secondary Trl rest 


0.0 


Lung Microvascular EC 
none 


0.0 


Primary Thl act 


0.0 


Lung Microvascular EC 
1 NFalpha + IL-lbeta 


10.2 


Primary Th2 act 


A A Microvascular Dermal 
0.0 § 

IX none 


0.0 


Primary Trl act 


„ „ (Microsvasular Dermal 

EC TNFalpha + II -lbeta 


0.0 


Primary Thl rest 


0.0 


Bronchial epithelium 
TNFalpha + IL lbeta 


0.0 


Primary Th2 rest 


0.0 


Small airway epithelium 
none 


0.0 


Primary Trl rest 


0.0 


Small airway epithelium 
TNFalpha + IL-lbeta 


0.0 


CD45RA CD4 


0.0 


Coronery artery SMC rest 


0.0 


CD45RO CD4 
lymphocyte act 


„ _ iCoronery artery SMC 
TNFalpha + I L- lbeta 


0.0 


CDS lymphocyte act 


0.0 Astrocytes rest 


0.0 


Secondary CD8 
lymphocyte rest 


0.0 


Astrocytes TNFalpha + 
IL-lbeta 


0.0 


Secondary CD8 
lymphocyte act 


0.0 


KU-8 12 (Basophil) rest 


0.0 


CD4 lymphocyte none 


0Q jKU-8 12 (Basophil) 
jPMA/ionomycin 


0.0 


2ryThl/Th2/Trl_anti- 


00 (CCD1106 

! Kcratinocytes) none 


0.0 


LAK cells rest 


[CCD1106 
0.0 l(Keratinocytes) 

TNFalpha : IL-lbeta 


0.0 


LAK cells IL-2 


0.0 JLiver cirrhosis 


100.0 


LAK cells IL-2+IL-12 


5.6 Lupus kidney 


4.6 


LAK cells IL-2+IFN 
gamma 


0.0 NCI-1 1292 none 


0.0 


LAK cells IL-2+ IL-18 


0.0 .NCI-H292 1L-4 


0.0 


LAK cells 
PMA/ionomycin 


0.0 |NCI-H292 IL-9 


0.0 


NK Cells IL-2 rest 


0.0 NCI-H292 1L-13 


0.0 


Two Way MLR 3 day 


0.0 


NCI-H292 IFN gamma 


0.0 


Two Way MLR 5 day 


0.0 


HPAEC none 


0.0 
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Two Way MLR 7 day 


n IHPAEC TNF alpha + IL- 
U U [l beta 


0.0 


PBMC rest 


0.0 Lung fibroblast none 


0.0 


PBMC PWM 


^Lung fibroblast TNF 
00 (alpha + IL-1 beta 


0.0 


PBMC PHA-L 


0.0 'Luna fibroblast IL-4 


0.0 


Ramos (B cell) none 


0.0 1 ung fibroblast IL- 1 ) 


o.o 1 


Ramos (B cell) 
ionomycin 


0.0 jLung fibroblast IL-1 3 


0.0 


B lymphocytes PWM 


q q Lung fibroblast IFN 
gamma 


0.0 


B lymphocytes CD40L 
and IL-4 


„ n iDermal fibroblast 
{CCD1070 rest 


7.7 


EOL-1 dbcAMP 


0 „ (Dermal fibroblast 

CCD 1070 TNF alpha 


0.0 


EOL-1 dbcAMP 
PM A ionomycin 


0.0 


Dermal fibroblast 
CCD 1070 IL-1 beta 


1.4 


Dendritic cells none 


0.0 


Dermal fibroblast II N 
gamma 


0.0 


Dendritic cells LPS 


0.0 


Dermal fibroblast IL-4 


0.0 i 


Dendritic cells anti- 
CD40 


0.0 


IBD Colitis 2 


26.2 


Monocytes rest 


0.0 |IBD Crohn's 


13.8 


Monocytes LPS 


0.0 iColon 


17.6 


Macrophages ivsi 


0.0 Lung 


8.1 


Macrophages LPS 


0.0 


Thymus 


16.5 


HUVEC none 


0.0 


Kidney 


0.0 


HUVEC starved 


0.0 







CNS_neurodegeneration_vl.O Summary: Ag3494 Expression of this gene is 
low/undetectable (CTs > 35) across all of the samples on this panel (data not shown). 



Generall_screening_panel_vl.4 Summary: Ag3494 Expression of the CG59609-01 gene 
is highest in testis (CT = 34.3). In addition, low but significant expression of this gene is 
5 detected in a breast cancer cell line and an ovarian cancer cell line. Thus, expression of this 
gene may be used to distinguish these samples from the other samples on this panel. 
Furthermore, therapeutic modulation of the activity of this gene may be of benefit in the 
treatment of fertility, breast cancer, and ovarian cancer. 

Panel 4D Summary: Ag3494 Expression of the CG59609-01 gene is highest in a liver 
1 0 cirrhosis sample (CT = 34.3). In addition, low but significant expression of this gene is 
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detected in samples from thymus as well as from normal and IBD colon. Thus, expression 
of this gene may be used to distinguish these samples from the other samples on this panel. 
Furthermore, therapies designed with the protein encoded for by this gene may potentially 
modulate liver function and play a role in the identification and treatment of inflammatory 
or autoimmune diseases which effect the liver including liver cirrhosis and fibrosis. 

CW. CG59613-01: PROLIFERATING CELL NUCLEAR ANTIGEN 

Expression of gene CG596 13-01 was assessed using the primer-probe set Ag3496, 
described in Table CWA. Results of the RTQ-PCR runs are shown in Tables CWB and 
CWC. 



Table CWA . Probe Name Ag3496 



jPrimers 


Sequences 


Length 


Start 
Position 


SEQID 
NO: 


For\\ arc 


5 1 -cacataccactgtgaccacaac-3 1 


22 


238 


707 


|Probe 


TET-5 ' - cct caccagcatgtccaaaatgctaa- 3 ' - 
TAMRA 


26 


277 


708 


jReverse 


5 ' -tgtcttcactg :att tqta 3' 


22 


305 


709 



Table CWB . General_screening_panel_vl.4 



Tissme Name 


Rel. Exp.(%) 
Ag3496, Run 
217217871 


Tissue Name 


Rel. Exp.(%) 
Ag3496, Run 
217217871 


Adipose 


9.6 


Renal ca. TK-10 


21.8 


Melanoma* 
Hs688(A).T 


25.2 


Bladder 


11.4 


Melanoma* 
1 is688(B).T 


20.4 


Gastric ca. (liver met.) 
NCI-N87 


3.9 


Melanoma* M14 


0.0 


Gastric ca. KATO III 


1.1 


Melanoma* 
LOXIMVI 


0.0 


Colon ca. SW-948 


0.0 


Melanoma* SK- 
MEL-5 


1.5 


Colon ca. SW480 


1.7 


Squamous cell 
carcinoma SCC-4 


5.4 


Colon ca.* (SW480 
met) SW620 


2.0 


Testis Pool 


56.3 


Colon ca. HT29 


3.9 


Prostate ca.* (bone 
met) PC-3 


2.6 


Colon ca. HCT-116 


12.9 


Prostate Pool 


37.6 


Colon ca. CaCo-2 


8.4 
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Plciccntsi 


0 0 


Colon cancer tissue 


1 4 


Uterus Pool 


21.5 


Colon ca. SW1I16 


o.o 


Ovarian ca. 
OVCAR-3 


18.3 


Colon ca. Colo-205 


0.0 


Ovarian ca. SK- 
OV-3 


1.9 


Colon ca. SW-48 


0.0 


Ovarian ca. 
OVCAR-4 


8.0 


Colon Pool 


29.9 


Ovarian ca. 
OVCAR-5 


2.9 


Small Intestine Pool 


57.8 


Ovarian ca. 
IGROV-1 


0.0 


Stomach Pool 


38.7 


Ovarian ca. 
OVCAR-8 


0.0 jBone Marrow Pool 


6.3 


Ovai) 


4.9 


Fetal Heart 


6.2 


Breast ca. MCF-7 


4.3 


Heart Pool 


12.3 


Breast ca. MDA- 
MB-231 


6.4 


Lymph Node Pool 


64.6 


Breast ca. BT 549 


4.7 Fetal Skeletal Muscle 


31.6 


Breast~caT~T47D " 


\23 'Skeletal Muscle Pool 


0 0 


Breast ca. MDA-N 


0.0 


Spleen Pool 


1.7 


Breast Pool 


47.6 


1 h> mus Pool 


47.3 


Trachea 


26.1 


CNS cancer 
(glio/astro) U87-MG 


2.3 


Lung 


7.7 


CNS cancer 
(glio/astro) U-118-MG 


22.5 


Fetal Lung 


82.9 


CNS cancer 
(neuro;met) SK-N-AS 


0.9 


Lung ca.NCI-N417 


0.0 


( NS cancel (astro) Sf - 
539 


1.8 


Lung ca. LX-1 


0.0 


CNS cancer (astro) 
SNB-75 


——————— 

35.4 

I „ „ 


Lung ca. NCI-H146 


0.0 


CNS cancer (glio) 
SNB-19 


0.0 


Lung ca. SHP-77 


3.7 


CNS cancer (glio) SF- 
295 


. 

9.2 


Lung ca. A549 


0.0 


Rutin { Amvgdala) l\-ol 


0.0 


Lung ca. NCI-H526 


0.0 


Brain (c< rcbellum) 


0.0 


Lung ca. NCI-H23 


46.0 


Brain (fetal) 


23.8 


Lung ca. NCI-H460 


0.5 


Brain (Hippocampus) 
Pool 


" 0.0 


Lung ca. HOP-62 


7.5 Cerebral C ortex Pool 


r~ ' - 1.6 


Lung ca. NCI-H522 


2.1 


Brain (Substantia 
nigra) Pool 


3.8 
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Liver 


0.0 


Brain (Thalamus) Pool 


3.2 


Fetal Liver 


7.9 


Brain (whole) 
Spinal Cord Pool 


~" 4.2 
1.9 


Liver ca. HepG2 


0.0 


Kidney Pool 


80.7 (Adrenal Gland 


1.7 


Fetal Kidney 


1 00.0 {Pituitary gland Pool 


0.7 


Renal ca. 786-0 


13.3 


Salivary Gland 


6-2 


Renal ca. A498 


0.0 


Th> roid \ female) 


_ 2.6 


Renal ca. ACHN 


23.8 


Pancreatic ca. 
CAPAN2 


5.5 


Renal ca. UO-31 


13.7 


Pancreas Pool 


48.6 



Table CWC . Panel 4D 



Tissue Name 


Rel. Exp.(%) 

Ag3496, Run f Tissue Name 
166441888 


Rel. Exp.(%) 
Ag3496, Run 
l(»(»441SSS 


Secondary Thl act 


0.0 |HUVEC IL-lbeta 


0.7 


Secondary I'h2 act 


0.7 1 11 IVECIFN gamma 


1 0 


Secondary Trl act 


Q0 HUVEC TNF alpha + 
JIFN gamma 


0.0 


Secondary Thl rest 


0.7 


HUVEC TNF alpha + 
IL4 


0.6 


Secondary Th2 rest 


1.2 


HUVEC I L- 11 


04 


Secondary Trl rest 


q 4 Lung Microvascular EC 

Inone 


a—. 


Primary Th 1 act 


0 , iLung Microvascular EC 
|TNFalpha + IL-lbeta 


1.0 


Primary Th2 act 


^ 2 JMicrovascular Dermal 
[EC none 


0.4 


Primary Trl act 


Miciosvasular Dermal 
IEC TNFalpha + IL-lbeta 


0.2 


Primary Thl rest 


1.5 


Bronchial epithelium 
I NI alpha - ILlbeta 


5.2 


Primary Th2 rest 


1.0 


Small airway epithelium 
none 


6.6 


Primary Trl rest 


1.5 


Small airway epithelium 
TNFalpha + IL-1 beta 


100.0 


CD45RACD4 
lymphocyte act 


2.1 


Coronery artery SMC rest 


2.6 

I 


CD45RG CD4 
lymphocyte act 


0.9 


Coronery artery SMC 
f\l alpha + IL-1 beta 


1.5 


CDS h t lphoo te aa 


2 . 1 (Astrocytes rest 


13.1 


Secondary CD8 
lymphocyte rest 


n ~ Astrocytes TNFalpha + 
00 [IL-lbeta 


7.4 
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Secondary CD8 
lymphocyte act 


0.0 


1 " " ' 

KU-812 (Basophil) rest 4.2 


CD4 lymphocyte none 


1.8 


KU-8 12 (Basophil) 
PMA/ionomycin 


5.2 


2ry Thl/Th2/Trl_anti- 
CD95 CHI 1 


0? |CCD1 106 

j(Keratinocytes) none 


17.9 


LAK cells rest 


ccbi 106 

0.0 l(Keratinocytes) 

TNFalpha i IL-lbeta 


82.4 


L \K cells IL-2 


2.3 {Liver cirrhosis 


17.2 


LAK cells IL-2+IL-12 


1 .7 jLupus kidney 


9.3 


LAK cells IL-2+1FN 
gamma 


2.3 [NCI-H292 none 


0.3 


LAK cells IL-2+IL-18 


3.1 : NCI-H292 IL-4 0.8 


LAK cells 
PMA. ionomycin 


0.2 JNCI-H292 IL-9 3.0 


NK Cells IL-2 rest 


2.7 ]NCI-H292 IL-13 


1.9 


Two Way MLR 3 day 


3.3 .NCI-H292 IFN gamma 


0.5 


Two Way MLR 5 day 


0.0 HPAEC none 


0.3 
0.6 


Two Way MLR 7 day 


n ., MP \EC TNFalpha + IL- 
U " 1 beta 


F'BMC rest 


1.0 Lun z i > 'ot none 


12 ? 


PBMC PWM 


j 7 Lung fibroblast TNF 
jalpha + IL-1 beta 


2.9 


PRMr PT-TA T 

iDML/ J. n/\-L 


0.7 Lung fibroblast IL-4 


11.0 


Ramos ( B cell) none 


0.0 Lung fibroblast IL-9 


5.3 


Ramos (B cell) 
ionomycin 


0.0 [Lung fibroblast IL-13 


10.4 


B lymphocytes PWM 


q p (Lnng fibroblast IFN 
Jgamma 


11.9 


B lymphocytes CD40L 
and IL-4 


? „ Dermal fibroblast 
CCD1 070 rest 


11.7 


EOL-1 dbcAMP 


^ „ ;Dermal fibroblast 

CCD 1070 TNF alpha 


5.8 


(- ( )| . | ( |[-, c \fvfp 

PM Vionomycin 


Q (Dermal fibroblast 
,CCD1070 IL-1 beta 


2.3 


Dendritic cells none 


0.1 


Dermal fibroblast IFN 
gamma 


8.8 


Dendritic colls 1 PS 


0.7 


Dermal fibroblast IL-4 


11.3 


Dendritic cells anti- 
CD40 


0.6 


IBD Colitis 2 


2.6 


\lon< ic) tes rest 


0.7 


IBD ( rohn's 


1.3 


Monocytes LPS 


0.0 


Colon 


, 114 1 


Macrophages rest 


0.0 


Lung 


19 1 
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Macrophages LPS 


0.0 


(Thymus 


T~ 




III MC none 


1.0 


Kidne) 




7.2 


iHUVEC starved 


1.4 









CNS_neurodegeneration_vl.O Summary: Ag3496 Expression of this gene is 
low/undetectable (CTs > 35) across all of the samples on this panel (data not shown). 



Genera l_screening_panel_vl. 4 Summary: Ag3496 Expression of the CG59613-01 gene 
is highest in fetal and adult kidney (CTs = 31). This gene is also expressed at higher levels 
5 in fetal lung (CT = 3 1 .4) than in adult lung (CT = 34.8), suggesting that expression of this 
gene can be used to distinguish adult and fetal lung and that this gene may play a role in 
lung development and regeneration. Differentially higher expression in fetal tissues also 
occurs in brain and skeletal muscle. 

In general, expression of this gene is associated with normal tissues rather than 
10 cancer cell lines. Specifically, CG596 13-01 gene expression is downregulated in 

pancreatic, colon, gastric, renal, lung, breast and prostate cancer cell lines when compared 
to their respective normal tissues. Therefore, therapeutic modulation of the activity of this 
gene may be of benefit in the treatment of these cancers. 

Among tissues with metabolic or endocrine function, this gene is expressed at low 
15 levels in pancreas, adipose, adrenal gland, fetal skeletal muscle, and the gastrointestinal 
tract. Therefore, therapeutic modulation of the activity of this gene may prove useful in the 
treatment of endocrine/metabolically related diseases, such as obesity and diabetes. 

Panel 4D Summary: Ag3496 Expression of the CG59613-01 gene is highest in small 
airway epithelium treated with TNF alpha and IL-1 beta (CT = 29.4). In addition, this gene 
20 is substantially upregulated in keratinocytes treated with TNF alpha and IL-1 beta. Low 
expression of this gene is also seen in lung and dermal fibroblasts independent of 
treatment. Therefore, therapeutics designed with the protein encoded by the transcript may 
reduce or eliminate symptoms caused by inflammation of the lung and skin in chronic 
obstructive pulmonary disease, asthma, allergy, emphysema, and psoriasis. 

25 CX. CG59619-01 : ACTIN, CYTOPLASMIC 2 
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Expression of gene CG596 19-01 was assessed using the primer-probe set Ag3498, 
described in Table CXA. Results of the RTQ-PCR runs are shown in Tables CXB and 
CXC. 



Table CXA . Probe Name Ag3498 



Primers 


Sequences 


I 

(Length 

i 


Start 
Position 


SEQ ID 

NO: 


Foru arci 


5 1 -tgatatggacatccccaaag-3 1 


; 20 


860 


710 


Probe 


TET-5 1 -acctgtacgccaacacagtgctgtct-3 1 - 
TAMRA 


I 26 


880 


711 


Reverse 


5 ' -atctccttctgcatcctattgg-3 ' 


■ 22 


934 


712 



Table CXB . General_screening_panel_vl.4 



' ™ 1 

Tissue Name 


Rel. Exp.(%) 
Ag3498, Run 
217217873 


Tissue Name 


Rel. Exp.(%) 
Ag3498, Run 
217217873 


Adipose 


4.7 


Renal ca. TK- 10 


0.0 


Melanoma* 
Hs688(A).T 


0.0 


Bladder 


3.3 


Melanoma* 
Hs688(B).T 


0.0 


Gastric ca. (liver met.) 
NCI-N87 


0.0 


Melanoma* Ml 4 


0.0 


Gastric ca. kATO III 


0.0 


Melanoma* 
LOXIMVI 


0.0 


Colon ca. SW-948 


0.0 


Melanoma* SK- 
MEL-5 


_ _ 


Colon ca. SW480 


0.0 


Squamous cell 
carcinoma SC( -4 


0.0 


Colon ca.* (SW480 
met) SW620 


0.0 


Testis Pool 


- 1 


Colon ca. HT29 


0.0 


Prostate ca.* (bone 
met) PC-3 


0.0 


Colon ca.HCT-116 


8.7 


Prostate Pool 


0.0 


Colon ca. CaCo-2 


4.9 


Placenta 


0.0 


C olen cancer tissue 


0.0 | 


Uterus Pool 


0.0 


Colon ca. SW1116 


0.0 


Ovarian ca. 
OVCAR-3 


6.6 


Colon ca. Colo-205 


0.0 


Ovarian ca. SK- 
OV-3 


9.5 


Colon ca. SW-48 


0.0 


Ovarian ca. 
OVCAR-4 


0.0 


Colon Pool 


18.3 


Ovarian ca. 
OVCAR-5 


17.9 


Small Intestine Pool 


26.2 
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Ovarian ca. 
IGROV-1 


0.0 


Stomach Pool 


7.2 


Ovarian ca. 
OVCAR-8 


0.0 


Bone Marrow Pool 


0.0 


Ovary 


2.9 


Fetal Heart 


6.4 


Breast ca. MCF-7 


0.0 


Heart Pool 


11.2 


Breast ca. MDA- 
MB-231 


0.0 


Lymph Node Pool 


34.2 


Breast ca. BT 549 


8.4 






Breast ca. T47D 


2.5 


skeletal Muscle Pool 


10.8 


Breast ca. MDA-N 


0.0 


Spleen Pool 


4.7 


Breast Pool 


13 7 


Thymus Pool 


27.0 


Trachea 


5.4 


CNS cancer 
(glio/astro) U87-MG 


0.0 


Lung 


7.0 


CNS cancer 
(glio/astro) U-118-MG 


2.2 


Fetal Lung 


7.9 


CNS cancer 
(neuro:met) SK-N-AS 


0.0 


Lung ca. NCI-N417 


0.0 


CNS cancer (astro) SF- 
539 


0.0 


, ~ 

Lung ca. LX-1 


- 

0.0 


CNS cancer (astro) 
SNB-75 


0.0 




Lung ca. NCI-H146 

________ 


3.1 


CNS cancer (glio) 
SNB-19 


0.0 


Lung ca. SHP-77 


0.0 


CNS can er ( 2,\u>) sF 
295 


0.0 


Lung ca. A549 




Brain (Amygdala) Pool 




Lung ca. NCI-H526 


0.0 


Brain (cerebellum) 


3.2 


Lung ca. NCI-H23 


0.0 


Brain (fetal) 


12.9 


Lung ca. NCI-H460 


0.0 


Brain (Hippocampus) 
Pool 


6.0 


Lung ca. HOP-62 


0.0 


Cerebral Cortex Pool 


4.9 


Lung ca. NCI-H522 


0.0 


Brain (Substantia 
nigra) Pool 


26.6 


Liver 


0.0 


Bruin (Thalamus) Pool 


37.9 


Fetal Li ver 


0.0 


Brain (whole) 


0.0 


Liver ce.. HepG2 


on 


Spinal Cord Pool 


0.0 


kidne\ Pool 


100.0 


Adrenal Gland 


o.o - 


Fetal Kidney 


4.4 


Pltllitan gl 11 1.1 Pool 


0.0 


Renal ca. 786-0 


0.0 


Salivary Gland 


00 


Renal ca. A498 


0.0 


I hyiou. 'temale) 


"o.o 


Renal ca. ACHN 


0.0 


Pancreatic ca. 
CAPAN2 




Renal ca. UO-3 1 


0.0 


Pancreas Pool 
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Table C XC. Panel 4. ID 



Tissue Name 


Rel. Exp.(%) S 

Ag3498, Run Tissue Name 
169839576 


Rel. Exp.(%) 
Ag3498, Run 
169839576 


Secondsiry Th 1 act 


78.5 HUVEC IL-lbeta 


11.3 


Second: ry Hi2 act 


17.0 HUVEC IFN gaiiinu. 


12.4 


Secondary Trl act 


^ y 9 [HUVEC TNF alpha + 
([IFN gamma 


15.7 


Secondary Thl rest 


5.2 


HUVEC TNF alpha + 
IL4 


12.5 


Secondary Th2 rest 


7.0 


HI IVEC II -1 1 


5.4 


Secondary Trl rest 


5.5 


Lung Microvascular EC 
none 


17.2 


Primary Thl act 


14.8 


Lung Microvascular EC 
TNFalpha + IL-lbeta 


7.6 


jPrimary Th2 act 


16.0 


Microvascular Dermal 
EC none 


8.1 


(Primary Trl act 


, t - IMicrosvasuIar Dermal 

EC 1 ilph i i [I -Ibel i 


8.0 


Primary Th 1 rest 


8.2 


Bronchial epithelium 
TNFalpha + IL1 beta 


5.3 


primary Th2 rest 


7.1 


Small airway epithelium 
none 


4.3 


(Primary Trl rest 


8.1 


Small airway epithelium 
TNFalpha + IL-lbeta 


6.4 


CD45RA CD4 
lymphocyte act 


15.8 


Coronery artery SMC rest 


5.8 


CD45RO CD4 
j lymphocyte act 


1 - q iCoronery artery SMC 
I NI alpha + IL-lbeta 


5.9 


CD8 lymphoc>lc act 


19.6 \ trocytcs rest 


6.2 


[Secondary CD8 
.lymphocyte rest 


1 . n jAstrocytes TNFalpha + 
lIL-lbeta 


4.6 


(Secondary CD8 
[lymphocyte act 


9.7 iKU-812 (Basophil) rest 


34.4 


|CD4 lymphocyte none 


43 KU-81 2 (Basophil) 
VM A/ionomycin 


31.9 


2ry ThlOWTrl anti- 
Cl)95 CH11 


61 CCD 1106 

ij(Keratinocytes) none 


28.9 


LAK cells rest 


|CCD1106 
12.6 j(Keratinocytes) 

jTNFalpha + IL-lbeta 


34.2 


LAK cells IL-2 


9.9 iLiver cirrhosis 


1.4 


LAK cells IL-2+IL-12 


12.6 


NCI-H292 none 


9.3 


LAK cells IL-2+IFN 


9.4 


NCI-H292 IL-4 


13.1 
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gamma j 




LAK cells IL-2+ IL-18 


12.5 


|NCI-H292 IL-9 


17.7 


PMA/ionomycin 


13.9 


|NCI-H292 IL-13 


13.6 


NK Cel s [L-2 rest 


L 5 .:L 


NCI-H292 U N gamma 


22.2 


Two Way MLR 3 day 


13.0 


(HPAEC none 


4.4 


Two Way MLR 5 day 


14.4 


IHPAEC TNF alpha + IL- 
Jl beta 


12.4 


Two Way MLR 7 day 


8.8 Lun ib ^..st none 


5.7 


PBMC rest 


6.6 


Lung fibroblast TNF 
[alpha + 1L-1 beta 


14.6 


PBMC PWM 


16.8 


1 ung Slbt oblast !L-4 


8.8 


PBMC PHA-L 


L_ IH 


Lung fibroblast IL 9 




Ramos (B cell) none 


13.0 

r-ii-i,.-.,,,.-., , m ,n, m . 


iLung fibroblast IL-13 




Ramos (B cell) 
lionomycin 


11.4 


! Lung fibroblast IFN 
earn ma 


° — 

17 X 


B lymphocytes PWM 


13.7 


Dermal fibroblast 
CCD 1070 rest 


9.8 


B lymphocytes CD40L 
and IL-4 


13.8 


[bermal fibroblast 
CCD 1070 TNF alpha 


10.8 


EOL-1 dbcAMP 




(Dermal fibroblast 
CCD 1 070 II 1 ! ,i i 


11.2 

- - ~-~ 


EOL-1 dl • YMP 

iPMA/ionomycin 


100.0 


Dermal fibroblast IFN 
[gamma 


19.8 


Dendritic cells none 


14.4 


Dcrnruu filx.oblasi 11 -4 


9.7 


Dendritic cells LPS 


9 : 8 


Dennal Fibroblasts rest 


7.7 


Dendritic cells anti- 
ICD40 


13.2 


[Neutrophils TNFa+LPS 


0.7 


Monocytes rest 


8.9 


•Neutrophils rest 


1.1 


[Monocytes LPS 


32.5 


[Colon 


4.4 


Macrophages rest 


12.6 


iLung 


_ 5 J 


Macrophages LPS 


17.3 


Thymus 


73 


III VIC none 


6.2 


iKidney 




jHUVEC starved 


100 


— 



CNS_n(eurodegeneration_vl.0 Summary: Ag3498 Expression of this gene is 
low/undetectable (CTs > 35) across all of the samples on this panel (data not shown). 



General_screening_panel_vl.4 Summary: Ag3498 The CG596 19-01 gene is only 
expressed at detectable levels in the adult kidney (CT = 34.2). Thus, expression of this gene 
can be used to distinguish kidney from the other samples on this panel. In addition, 
expression of this gene is much lower in fetal kidney (CT = 38.7), suggesting that this gene 
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can be used to distinguish between the fetal and adult source of this tissue. Furthermore, 
this gene is not expressed at detectable levels in renal cancer cell lines. Therefore, 
therapeutic modulation of this gene may be of use in the treatment of renal cell carcinoma. 

Panel 4.1D Summary: Ag3498 Expression of the CG59619-01 gene is highest in 
5 activated eosinophils (CT = 25.7), displaying 10-fold upregulation when compared to the 
control eosinophils. Therefore, therapies designed with the protein encoded for by this gene 
could block or inhibit inflammation or tissue damage due to eosinophil activation in 
response to asthma, ulcerative colitis and parasitic diseases. 

The CG59619-01 gene is expressed at moderate levels in the majority of samples 
on this panel, including T cells, B cells, endothelial cells, macrophages, monocytes, 
dendritic cells, basophils and peripheral blood mononuclear cells, as well as epithelial and 
fibroblast cell types from lung and skin, and normal tissues represented by colon, lung, 
thymus and kidney. This ubiquitous pattern of expression suggests that this gene product 
may be involved in homeostatic processes for these and other cell types and tissues. 
Therefore, modulation of the gene product with a functional therapeutic may lead to the 
alteration of functions associated with these cell types and lead to improvement of the 
symptoms of patients suffering from autoimmune and inflammatory diseases such as 
asthma, allergies, inflammatory bowel disease, lupus erythematosus, psoriasis, rheumatoid 
arthritis, and osteoarthritis. 

20 CY. CG5962 1-0 1 : SELENIDE, WATER DIKINASE 1 

Expression of gene CG59621-01 was assessed using the primer-probe set Ag3764, 
described in Table CYA. 



Table CYA . Probe Name Ag3764 



(Primers 

1 


Sequences 


Length 


Start 
Position 


SEQID 
NO: 


[Forward 


5 ' -catgttcagcctcatgcat-3 1 


__19_ 


925 


713 


IProbe 


TET-5 1 -agacctcaggcggccttctgatct-3 ' - 

TAMRA 


24 


957 


~ 714 


[Reverse" 


5 ' -ctgcttgctgacatggtaaac-3 ' 


21 


981 


715 



o 

5 io 



W 15 
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General_screening_panel_vl.4 Summary: Ag3764 Results from one experiment with 
the CG59621-01 gene are not included. The amp plot indicates that there were 
experimental difficulties with this run. 

Panel 4.1D Summary: Ag3764 Expression of this gene is low/undetectable (CTs > 35) 
5 across all of the samples on this panel (data not shown). 

CZ. CG59625-01: GLUCOSE TRANSPORTER TYPE 3 

Expression of gene CG59625-01 was assessed using the primer-probe set Ag3499, 
described in Table CZA. Results of the RTQ-PCR runs are shown in Tables CZB and CZC. 



Table CZA . Probe Name Ag3499 



Primers 


Sequences 


Length 


Start 
Position 


SEQID 
NO: 


[Forward 


5 ' -cttcccctctgctgcttactat-3 1 


22 


1298 


716 


jp , !tet-5 1 -ttttattatcttcaccggcttcctca-3 ' - 

in [TAMRA 


26 


1334 


717 


Re^ erf e 5 aaggtaaaqg j qqf a 3' 


22 j 1361 


718 



1 0 Table CZB . CNS_neurodegeneration_v 1 .0 



Tiss ue Name 


Rel.Exp.(%)Ag3499,| 

Ru.»2lO«>35S(,4 l,SSU£ [>am€ 


Rel. Exp.(%) Ag3499, 
Run 


AD 1 Hippo 


- s Control (Path) 3 
[Temporal Ctx 


4.9 


AD 2 Hippo 


25 9 Control (Path) 4 
[Temporal Ctx 


32.3 


AD 3 Hippo 


0 AD 1 Occipital 
58 tctx 


11.6 


AD 4 Hippo 


j 5 AD 2 Occipital 
fCtx (Missing) 


0.0 


AD 5 hippo 


100.0 


AD 3 Occipital 
Ctx 


5.1 


AD 6 Hippo 


29.7 


AD 4 Occipital 

Ctx 


4.5 


Control 2 Hippo 


31.9 


AD 5 Occipital 
Ctx 


39.2 


Control 4 Hippo 


q q AD 6 Occipital 

sCtx 


72.2 


Control (Path) 3 
Hippo 


, , Control 1 Occipital 

66 |ctx 


13.5 
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AD 1 Temporal Ctx 


7.0 




Control 2 Occipital 
Ctx 


82.4 


AD 2 Temporal Ctx 


49.3 


(Control 3 Occipital 

jCtx 


8.8 


AD 3 Temporal Ctx 


2.9 






Control 4 Occipital 
Ctx 


0.0 


AD 4 Temporal Ctx 


5.6 


[Control (Path) 1 
Occipital Ctx 


64.6 


AD 5 In f Temporal 
Ctx 


83.5 


Control (Path) 2 
(Occipital Ctx 


4.0 


AD 5 SupTemporal 
Ctx 


37.6 


[Control (Path) 3 
(Occipital Ctx 


4.0 


AD 6 In f Temporal 
Ctx_ 


23.0 


(Control (Path) 4 
JOccipital Ctx 


_____ 
13.2 


AD 6 Sup Temporal 
Ctx 


24.8 




Control 1 Parietal 
Ctx 


9.9 


Control 1 Temporal 
Ctx 


11.0 




Control 2 Parietal 

Ctx 


33.2 


Control 2 Temporal 
Ctx 


50.7 


IControl 3 Parietal 
Ctx 


12.1 


Control 3 Temporal 
Ctx 


6.0 




Control (Path) 1 
Parietal Ctx 


57.8 


Control 4 Temporal 
Ctx 


0.1 


Control (Path) 2 
Parietal Ctx 


25.2 


Control (Path) 1 
Temporal Ctx 


28.9 


Control (Path) 3 
Parietal Ctx 


5.6 


Control (Path) 2 
Temporal Ctx 


13.9 


Control (Path) 4 
Parietal Ctx 


60.3 



Table CZC . Panel 4D 



Tissue Name 


Rel. F.\p.<%> 
Ag3499, Run 
166441940 


Tissue Name 


Rel. Exp.(%) 
Ag3499, Run 
166441940 


Secondary Thl act 


12.1 


HUVEC IL-lbeta 


0. 1 


Secondary I'li2 act 


1 3.2 ■ 


HUVEC I FN gamma 




Secondary Trl act 


3.6 


HUVEC TNF alpha + 
U N namma 


0.1 


Secondary Thl rest 


7.5 


HUVEC TNF alpha I 
IL4 


0.0 


Secondary Th2 rest 


5.3 


HUVEC 11 -1 1 


0.9 


Secondary Trl rest 


6.0 


Lung Microvascular EC 
none 


2.0 


Primary Thl act 


2.4 


Lung Microvascular EC 
TNFalpha + IL-lbeta 


1.2 



1033 



Primary Th2 act 


rj 5 (Microvascular Dermal 
{EC none 


0.1 


Primary Trl act 


in 1 iMicrosvasular Dermal 

(EC TNFalpha + IL-lbeta 


0.1 


Primary Thl rest 


14.9 


Bronchial epithelium 
TNFalpha + IL1 beta 


0.3 


Primary Th2 rest 


5.1 


Small airway epithelium 
none 


0.3 


Primary Trl rest 


5.8 


Small airway epithelium 
TNFalpha + IL-lbeta 


........ 

1.9 


CD45RA CD4 
lymphocyte act 


2.9 


Coronery artery SMC rest 


2.4 


CD45RO CD4 
lymphocyte act 


12.3 


Coronery artery SMC 
TNFalpha + IL-lbeta 


1.3 


CD8 lymphocyte act 


4.1 


Astrocytes rest 


0.3 


Secondary CD8 
lymphocyte rest 


4.3 


Astrocytes TNFalpha + 
IL-lbeta 


0.5 


Secondary CD8 
lymphocyte act 


2.9 


KU-812 (Basophil) rest 


0.0 


CD4 lymphocyte none 


3.2 


KU-812 (Basophil) 
PMA/ionomycin 


0.3 


2ry Thl/Th2/Trl_anti- 

JrTiQS PHI 1 


6.4 


CCD1106 

t keialToo te , none 


1.9 


LAK cells rest 


8.9 


CCD1106 
(Keratinocytes) 
TNFalpha + IL-1 beta 


30.6 


LAK cells IL-2 


14.1 Liver cirrhosis 


19 2 


LAK cells IL-2+IL- 12 


7.1 Lupus kidney 


^ L5 | 


LAK cells IL-2+IFN 
gamma 


8.2 


NCI-H292 none 


0.5 


LAK cells IL-2+ IL-18 


_ ,,, 6 ; 3 


NCI-H292 IL-4 


0.8 


LAK cells 
PMA/ionomycin 


100.0 


NCI-H292 IL-9 


0.2 | 


NK Cells IL-2 rest 


4.8 


NCI-H292 IL-13 


0.3 


Two Way MLR 3 day- 


7.4 


NCI-H292 1FN gamma 


0.2 


Two Weiy MLR 5 day 


7.6 


HPAEC none 


1.7 


Two Way MLR 7 day 


3.8 


HPAEC TNF alpha + IL- 
1 beta 


1.5 


PBMC rest 


0.1 (Lung fibroblast none 


3.2 


PBMC PWM 


10.9 


Lung fibroblast TNF 
alpha + IL-I beta 


2.7 


PBMC PHA-L 


9.2 


Lung fibroblast IL-4 


4.7 


Ramos (B cell) none 


0.3 


Lung fibroblast IL-9 


2.4 


Ramos (B cell) 
ionomycin 


0.2 


Lung fibroblast IL-1 3 


3.0 
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B lymphocytes PWM 


3.8 


iLung fibroblast IFN 
Jgamma 


5.5 




B lymphocytes CD40L 
and IL-4 


2.6 


(Dermal fibroblast 
1CCD1070 rest 


3.7 


EOL-1 dbcAMP 


0.1 


iDermal fibroblast 
iCCT>]C)7() TNF nlnVifl 


15.1 




EOL-1 dbcAMP 


0.6 




2.5 




PMA/ionomycin 


|cCD1070 IL-1 beta 




Dendritic cells none 


30.4 


jDermal fibroblast IFN 
Jgamma 


0.1 


Dendritic cells LPS 


14.2 


Dermal fibroblast IL-4 


0.1 


Dendritic cells anti- 
CD40 


15.1 


jjlBD Colitis 2 


0.0 


Monoc3tes rest 


5.8 


jlBD Crohn's 


1.1 


Monocytes LPS 


12.1 


(Colon 


3.3 


Macrophages rest 


0.3 


jLung 


7.9 


Macrophages LPS 


12.9 


((Thymus 


0.6 


III Vl.( none 




Kidne\ 


9.4 


HUVEC starved 


0.1 | 





CNS_ni2urodegeneration_vl.O Summary: Ag3499 This panel confirms the expression of 
this gene at moderate levels in the brains of an independent group of individuals. However, 
no differential expression of this gene was detected between Alzheimer's diseased 
postmoitem brains and those of non-demented controls in this experiment. Please see Panel 
5 1 .4 for a discussion of the potential utility of this gene in treatment of central nervous 
system disorders. 

Panel 4D Summary: Ag3499 Expression of the CG59625-01 gene is highest in 
PMA/ionomycin-treated lymphokine activated killer (LAK) cells (CT = 24.3). Since these 
cells are involved in tumor immunology and tumor cell clearance, as well as virally and 
1 0 bacterial infected cells, therapeutic modulation of this gene product may alter the functions 
of these cells and lead to improvement in cancer cell killing as well as host immunity to 
microbial and viral infections. 

This gene is also expressed at high levels in stimulated keratinocytes, dendritic 
cells, monocytes and macrophages, suggesting that small molecule therapeutics designed 
1 5 against the CG59625-01 protein could reduce or inhibit inflammation in asthma, 
emphysema, allergy, psoriasis, arthritis, or any other condition in which 
localization/activation of these cell types is important. 
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This gene is also expressed at moderate levels in a number of other cell types of 
significance in the immune response in health and disease. 

DA. CG59887-01 and CG59887-02: Amino Acid/Metabolite Permease 

5 Expression of gene CG598 87-01 and full length clone CG59887-02 was assessed 

using the primer-probe set Ag4715, described in Table DAA. Please note that CG59887-02 
represents a full-length physical clone of the CG59887-02 gene, validating the prediction 
of the gene sequence. 



Table DAA . Probe Name Ag4715 



Primers 


Sequences 


•Length 


Start 
Position 


SEQBD 
NO: 


Forward 


5 ' -cgtgttgtggtggttgtttt-3 ' 


1 20 


1362 


719 


Probe 


TET-5 ' - actgcgcacgcgccttaacaatg- 3 1 - 
TAMRA 


I 23 


1383 


720 


Reverse 


5 1 -ggctagtggtcgagcaattt-3 1 


20 


1426 


721 



10 General_screening_panel_vl.4 Summary: Ag4715 Expression of the CG59887-01 gene 
is low/undetectable in all samples on this panel (CTs>35). (Data not shown.) The amp plot 
indicates that there is a high probability of a probe failure. 

DB. CG59857-01: RHOTEKIN 

Expression of gene CG59857-01 was assessed using the primer-probe set Ag3622, 
1 5 described in Table DBA. Results of the RTQ-PCR runs are shown in Tables DBB, DBC 
and DBD. 



Table DBA . Probe Name Ag3622 



(Primers Sequences 


Length 


Start 
Position 


SEQID 
NO: 


Forward 5 ' -acaccctggaggacctgaatat-3 ' 


22 


84 


722 


L . „ ItET-5 1 -ctctacattcggcagatggcactcag-3 ' - 

jrrooe | TAMRA 


26 


107 


723 


(Reverse js ' -ggatctcatggtctagcttcct-3 ' 


22 


155 


724 



Table DBB . CNS neurodegeneration vl.O 
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Tissue Name 


Rel. Exp.(%) Ag36227| Tissue Name ~~" 
Run 211005293 i issue Name 


Rel. Exp.(%) Ag3622, 
Run 211005293 


AD 1 Hippo 


iControl (Path) 3 
[Temporal Ctx 


3.9 


AD 2 Hippo 


42 9 [Control (Path) 4 
(Temporal Ctx 


9.2 


AD 3 Hippo 


6.9 [AD 1 Occipital Ctx 


10.1 


AD 4 Hippo 


13 2 IAD 2 Occipital Ctx 
'(Missing) 


0.0 


AD 5 Hippo 


40.6 AD 3 Occipital Ctx 


7.1 


AD 6 Hippo 


38.4 [AD 4 Occipital Ctx 


23.3 


control l oippo 


29.9 


AD 5 Occipital Ctx 




Control 4 Hippo 


13.6 


AD 6 Occipital Ctx 


0.1 


Control (Path) 3 
Hippo 


0.8 


Control 1 Occipital 
Ctx 


4.9 


AD 1 Temporal 
Ctx 


17.8 


Control 2 Occipital 
Ctx 


51.4 


AD 2 Temporal 
Ctx 


27.7 


Control 3 Occipital 
Ctx 


12.2 


AD 3 Temporal 
( t\ 


6.3 


Control 4 Occipital 
i i\ 


8.2 


AD 4 Temporal 
Ctx 


19.5 


Control (Path) 1 
Occipital Ctx 


58.6 


AD 5 Inf Temporal 
Ctx 


100.0 


Control (Path) 2 
Occipital Ctx 


10.2 


AD 5 Sup 
'temporal Ctx 


47.3 


Control (Path) 3 
Occipital Ctx 


5.7 


AD 6 Inf Temporal 
Ctx 


4Q . (Control (Path) 4 
Occipital Ctx 


7.9 


|AD 6 Sup 
1 emporal ( 'tx 


2g j iControl 1 Parietal 
[Ctx 


7.6 


Control 1 
Temporal Ctx 


c , Control 2 Parietal 
56 Ctx 


44.4 


Control 2 
Temporal Ctx 


~. q iControl 3 Parietal 
i4 - y JCtx 


16.2 

- _ ,™ _™_ 


Control 3 
Temporal Ctx 


13 7 IControl (Path) 1 
(Parietal Ctx 


32.1 


Control 3 
Temporal Ctx 


8.2 


Control (Path) 2 
Parietal Ctx 


15.6 


Control (Path) 1 
Temporal Ctx 


18.8 


Control (Path) 3 
Parietal Ctx 


2.4 


Control (Path) 2 
Tempor al Ctx 


16.0 


Control (Path) 4 
Parietal Ctx 


17.6 



Table E>BC . General_screening_panel_vl.4 
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Tissue Name 


Rel. Exp.(%) 
Ag3622, Run 
218211380 


Rel. Exp.(%) 
Ag3622, Run 
218307304 


Tissue Name 


Rel. Exp.(%) 
Ag3622, Run 
218211380 


Rel. Exp.(%) 
Ag3622, Run 
218307304 


Adipose 


0.7 1.1 lRenalca.TK-10 


4.7 


' 4.0 


Melanoma* 
Hs688(A).T 


4.6 7.6 (Bladder 


10.2 


3.5 


Melanoma* 
Hs688(B).T 


7.2 


, . jGastric ca. (liver 
jmet.) NCI-N87 


6.0 


9.0 


Melanoma* 
M14 


29.9 


37.9 


Gastric ca. 
KATO III 


7.8 


7.7 


Melanoma* 
LOXIMVI 


15.1 


17.2 


( ol >n ca. SW- 
948 


1.9 


1.3 


Melanoma* 
SK-MEL-5 


19.3 


22.7 


Colon ca. 
SW480 


11.0 


15.1 


Squamous 
cell 

carcinoma 
SCC-4 


0.4 


1.5 


Colon ca.* 
(SW480 met) 
SW620 


5.9 


10.4 


l e -i is Pool 


2.0 


1.7 


Colon ca. HT29 


5.4 


— — 
6.8 


Prostate ca.* 
(bone met) 
PC-3 


3.3 


3.4 


Colon ca. HCT- 
116 


5.6 


6.7 


Prostate Pool 


0.8 


2.3 


Colon ca. CaCo- 
2 


4.2 


3.2 


Placenta 


2.2 


3.0 


Colon cancer 
tissue 


2.3 


3.5 


Uterus Pool 


0.8 


0.7 


Colon ca. 
SWI 1 16 


1.1 


2.9 


Ovarian ca. 
OVCAR-3 


19.5 


18.7 


Colon ca. Colo- 
205 


1.0 


0.3 


Ovarian ca. 
SK-OV-3 


9.0 


8.0 


Colon ca. SW-48 


1.2 


1.2 


Ovarian ca. 

fiVPAR A 
V L,/\K-4 


1.1 


2.2 


Colon Pool 


16.0 


5.8 


Ovarian ca. 
nvrAB s 

\J V \_//\IV.- J 


6.4 


9.0 


Small Intestine 
Pool 


1.7 


3.3 


Ovarian ca. 
IGROV-1 


10.7 


10.7 Stomach Pool 


2.7 


3.8 


Ovarian ca. 
OVCAR-8 


7.1 


2Q ^ Bone Marrow 
Pool 


1.2 


1.8 


(Ovary 


3.2 


4.3 Fetal Heart 


5.1 


0 5 


i Breast ca. 
MCF-7 


1.1 


1.4 [Heart Pool 


1.5 


3.0 


Breast ca. 

MDA-MB- 

231 


8.8 


i a q Lymph Node 
1U " y pool 


4.9 


6.5 
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Breast ca. BT 
549 


4.7 


8 4 (Fetal Skeletal™ 
Muscle 


69.7 


11.3 


Breast ca. 
T47D 


17.0 


15.2 


Skeletal Muscle 
Pool 


2.7 


1.5 


Breast ca. 

MJJA-JN 


23.8 


21.0 


Spleen Pool 


2.8 


1.8 


Breast Pool 


4.4 


5.0 


Thymus Pool 


2.8 


5.9 


Trachea 


3.8 


5.8 


CNS cancer 
(glio/astro) U87- 
MG 


17.7 


25.0 


Lung 


0.7 


0.8 


CNS cancer 
(glio/astro) U- 
118-MG 


46.0 


57.4 


Fetal Lung 


8.2 


jCNS cancer 
9.3 l(neuro;met) SK- 
N-AS 


26.8 


35.6 


L ung ca. 
NCI-N417 


0.4 


n . ICNS cancer 

j(astro) SF-539 


9.7 


9.8 


Lung ca. LX- 
1 


9.3 


. „ _ |CNS cancer 

((astro) SNB-75 


22.5 


34.4 


Lung ca. 
NCI-H146 


8.4 


, _ CNS cancer 
(glio)SNB-lO 


9.0 


13.0 


Lung ca. 
SHP-77 


7.4 


r . ICNS cancer 
(glio) SF-295 


33.2 


37.1 


Lung ca 
A549 


13.3 


2 2 3 JBrain 

(Anr id -1 Pool 


86.5 


82.9 


Lung ca. 
NCI-H526 


1.3 


1 4 Brain 

l(cerebellum) 


22.5 


30.8 


L uiV2 ca. 
NCI-H23 


5.1 


j — 
6.0 JBrain (fetal) 

.J....... 


5.1 


6.5 


Lung ca. 
NCI-H460 


5.2 


4.9 

_____ 


Brain 

(Hippocampus) 
Pool 


45.4 


52.1 


Lung ca. 
HOP-62 


5.1 


7.4 


Cerebral ( ortex 
Pool 




27.4 


40.3 


Lung ca. 
W1-H522 


6.7 


9.8 


Brain (Substantia 
nigra) Pool 


46.7 


54.7 


Liver 


0.5 


0.9 


Brain 

. Ih ilamus) Pool 


51.4 


90.1 


Fetal Liver 


1.7 


1.5 


Brain (whole) 


23.8 




Liver ca. 
HepG2 


2.9 


4.5 jSpinal Cord Pool 


100.0 


100.0 


Kidney Pool 
Fetal Kidney 


5.3 
7.0 


6.4 lAdrenal Gland 
Pituitary gland 
1 Pool 


1.5 
0.3 


, 3__ J 


Ru\il ca. 




3.0 Salivarv Gland 




K 2 
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786-0 












Renal ca. 
A498 


4.9 


5.6 


Thyroid (female) 


2.8 


2.1 


Renal ca. 
AC I IN 


4.2 


4.0 


Pancreatic ca. 
CARAN2 






Renal ca. 
UO-3 1 


10.4 


7.7 


Pancreas Pool 


- 


6.2 



Table DBD . Panel 4.1D 



Tissue Name 


Rel. Exp.(%) 
Ag3622, Run 
169944131 


Tissue Name 


Rel. Exp.(%) 
Ag3622, Run 
169944131 


Secondary Oil act 


0.0 


HUVLC IL-lbeta 


14.7 


Secondary Th2 act 


2 9 HUVEC 1FN gamma 




Secondary Trl act 

_ 


? 3 HI 1 S, FA INF ilpha 
jlFN gamma 


2.5 


Secondary Thl rest 

. 


IHUVLC INF alpha - 
U - U II 4 


58 i 


Secondary Th2 rest 


0.0 JhUVEC IL-1 1 


19.2 


Secondary Trl rest 




q q jLung Microvascular EC 
jnone 


49.0 




Primary Thl act 


, _ JLung Microvascular EC 
TNI alpha - IL-1 beta 


11.3 


Primary Th2 act 


q q ^Microvascular Dermal 
[EC none 


7.7 


Primary Trl act 


n „ (Microsvasular Dermal 

EC 1 NFalpha i IL-lbeta 


10.9 


Primary Thl rest 


n n jBronchial epithelium 
TNFalpha + ILlbeta 


21.5 


_ 

Primary Th2 rest 


q q jSmall airway epithelium 

jnone 


7.2 


Primary Trl rest 


0.0 


Small airway epithelium 
TNFalpha IT - 1 beta 


27.9 


CD45RA CD4 " 
lymphocyte act 


0.8 


Coronery artery SMC rest 


22.5 


CD45ROCD4 
lymphocyte act 


3.5 


Coronery artery SMC 
TNFalpha + IL-lbeta 


31.4 


CD 8 hrnphocyte act 


1.8 


Astrocytes rest 


17.0 


Secondary CD8 
lymphocyte rest 


1.6 


Astrocytes TNFalpha + 
IL-lbeta 


25.3 


Secondary CD8 
lymphocyte act 


0.2 


KU-812 (Basophil) rest 


8.0 


CD4 lymphocyte none 


0.0 


KU-812 (Basophil) 
PMA/ionomycin 


3.4 
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2ry Thl/Th2/Trl anti- 
CD05 CH11 


4.2 


CCD 1106 

(Keratinocytes) none 


44.8 


LAK cells rest 


0.4 


CCD 1106 

(Keratinocytes) 

I NI alpha + [L-lbeta 


61.6 


LAK cells 11-2 


(.«» 


Liver cirrhosis 


0.0 


LAK cells IL-2+IL-12 


0.0 


NCI-H292 none 


5.3 


LAK cells IL-2+IFN 

igamma 


1.5 


NCI-H292 IL-4 


9.9 


JlAK cells IL-2+ IL-18 


0.0 


NCI-H292 IL-9 


14.4 


LAK cells 
[PMA/ionomycin 


0.0 


NCI-H292 IL-13 


1.4 


NK Cells 1L-2i est 


1 3.9 


NC1-H292 1FN gamma 


11 4 


Two \Yu> MLR > da\ 


3.6 


HPAEC none 


20.9 


Two Way MLR 5 day 


3.8 


HPAEC TNF alpha + IL- 
1 beta 


10.2 


r\vo Way MLR 7 day 


0.0 


Lung fibroblast none 


38.2 


PBMC rest 


1.7 


Lung fibroblast TNF 
alpha + IL-1 beta 


25.0 


PBMC PWM 


3.6 " 


Lung fibroblast IL-4 


35.8 


PRM C 1 I \ I 




Lung fibroblast IL-9 


100.0 


Ramos i B cell ) none 


ol 


Lung fibroblast IL-13 


69.7 


Ramos (B cell) 
ionomycin 


1.1 


Lung fibroblast IFN 
gamma 


58.2 


B lymphocytes PWM 


0.5 


Dermal fibroblast 

PPD1 fl7fl r^ct 


39.2 


B lymphocytes CD40L 
and IL-4 


7.9 


Dermal fibroblast 
CCD1070 TNF alpha 


5.7 


EOL-1 dbcAMP 




0.0 


Dcnrn.3.1 fibroblast 
CCD 1070 II -1 beta 


17.4 


EOL-1 dbcAMP 
PMA/ionomycin 


0.0 


Dermal fibroblast IFN 
gamma 


31.2 


Dendrit c cells none 


9.9 


Dermal fibroblast 11. -4 


44.8 


Dendritic cells LPS 


2 4 


Dermal Fibroblasts rest 


19.1 


Dendritic cells anti- 
CD40 


24.7 


Neutrophils TNFa+LPS ! 


0.0 


Monocytes rest 


0.0 


Neutrophils rest 


0.0 


Monocytes LPS 


63 


Colon 




Mac i op iages lest 


8.8 


Lung 


13.6 


Macrophages LPS 


"3.4 


Thymus 


2.4 


HUVEC none 


err 


Kidne\ 


23.8 


HUVEC starved 


28.5 1 
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CNS_neurodegeneration_vl.O Summary: Ag3622 This panel confirms the expression of 
the CG59857-01 gene at significant levels in the brains of an independent group of 
individuals. However, no differential expression of this gene was detected between 
Alzheimer's diseased postmortem brains and those of non-demented controls in this 
5 experiment. Please see Panel 1 .4 for a discussion of the potential utility of this gene in 
treatment of central nervous system disorders. 

General_screening panel_vl.4 Summary: Ag3622 Two experiments with the same 
probe and primer set show highest expression of the CG59857-01 gene in spinal cord 
samples (CTs=26-28). In addition, high levels of expression of this gene are seen in brain 
derived tissue, including samples from amygdala, hippocampus, substantia nigra, thalamus, 
cerebellum, cerebral cortex, and CNS cancer cell lines. Therefore, expression of this gene 
could be used to distinguish between brain derived samples and other samples used in this 
panel. Furthermore, this gene may play a role in central nervous system disorders such as 
Alzheimer's disease, Parkinson's disease, epilepsy, multiple sclerosis, schizophrenia and 
depression. 

Significant expression is also detected in fetal skeletal muscle (CTs=27-31). 
Interestingly, this gene is expressed at much higher levels in fetal when compared to adult 
skeletal muscle (CTs=32-34). This observation suggests that expression of this gene can be 
used to distinguish fetal from adult skeletal muscle. In addition, the relative overexpression 
of this gene in fetal skeletal muscle suggests that the protein product may enhance 
muscular growth or development in the fetus and thus may also act in a regenerative 
capacity in the adult. Therefore, therapeutic modulation of the protein encoded by this gene 
could be useful in treatment of muscle related diseases. More specifically, treatment of 
weak or dystrophic muscle with the protein encoded by this gene could restore muscle 
mass or function. 

Among tissues with metabolic function, this gene is expressed at moderate to low 
levels in pituitary, adipose, adrenal gland, pancreas, thyroid, and adult and fetal skeletal 
muscle, heart, and liver. This widespread expression among these tissues suggests that this 
gene product may play a role in normal neuroendocrine and metabolic and that disregulated 
30 expression of this gene may contribute to neuroendocrine disorders or metabolic diseases, 
such as obesity and diabetes. 
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Panel 4JLD Summary: Ag3622 Highest expression of the CG59857-01 gene is seen in IL- 
9/IL-13 treated lung fibroblasts (CT=31). In addition, significant expression is seen in 
clusters of treated and untreated lung and dermal fibroblasts, epithelium and endothelium. 
Therefore, modulation of the gene product with a functional therapeutic may lead to the 
5 alteration of functions associated with these cell types and lead to improvement of the 
symptoms of patients suffering from autoimmune and inflammatory diseases such as 
asthma, allergies, and psoriasis. 

DC. CG59855-01 and CG59855-02: ATP SYNTHASE SUBUNIT C 

Expression of gene CG59855-01 and full length clone CG59855-02 was assessed 
10 using the primer-probe set Ag3621, described in Table DC A. Results of the RTQ-PCR runs 
are shown in Tables DCB and DCC. Please note that CG59855-02 represents a full-length 
physical clone of the CG59855-02 gene, validating the prediction of the gene sequence. 



Table DCA . Probe Name Ag3621 



PrimersJ Sequences 


Length 


Start 
Position 


SEQID 1 

NO: | 


Forwardi 5 -gggt taat igg t ,t gt- 


20 




725 ! 


p , TET-5 ' -tqccttctccttgaatagcecaqaga-3 ' - 

1 ,ODC TAM !<! A 


2L 


94 


726 


Reverse |5 ' -ctgctgtaggaaggctgtttag-3 ' 


22 


126 


~~ 727 | 



Table DCB . General_screening_panel_vl.4 



i — n 

Tissue Name 


Rel.Exp.(%) 
Ag3621,Run 
217702346 


Tissue Name 


Rel. Exp.(%) 
Ag3621,Run 
217702346 


Adipose 


0.0 


Renal ca. TK-10 


0.0 


Melanoma* 
Hs688(A).T 


0.0 


Bladder 


7.5 


Melanoma* 
Hs688(B).T 


0.0 


Gastric ca. (liver met.) 
NCI-N87 


0.0 


Melanoma* Ml 4 


0.0 


Gastric ca. KATO III 


0.0 


Melanoma* 
LOX1MVI 


0.0 


Colon ca. SW-948 


0.0 


Melanoma* SK- 
MEL-5 


0.0 


Colon ca. SW480 


0.0 


Squamous cell 
carcinoma SCC-4 


0.0 


Colon ca.* (SW480 
met) SW620 
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Testis Pool 


35.1 Colon ca. HT2'> 


0.0 


Prostate ca.* (bone 
met) PC -3 


0.0 


Colon ca.HCT-1 16 


0.0 


Prostate Pool 


3.5 


Colon ca. CaCo-2 


0.0 


Placenta 


2.9 


Colon cancer tissue 


0.0 


Uterus Pool 


5.9 


Colon ca. SW1116 


0.0 


_ _„.»,,,,„„ 
Ovarian ca. 
OVCAR-3 


0.0 


Colon ca. Colo-205 


0.0 


Ovarian ca. SK- 
IOV-3 


47.6 


Colon ca. SW-48 ! 


0.0 


'Ovarian ca. 

Iaa/p AX) A 


0.0 


Colon Pool 


57.0 


Ovarian ca. 
OVCAK.-5 


0.0 


Small Intestine Pool 


42.9 


Ovarian ca. 
IGROV-1 


0.0 


Stomach Pool 


7.5 


Ovarian ca. 
OVCAR-8 


0.0 


Bone Marrow Pool 


7.0 


Ovary 


12.9 


Fetal Heart 


0.0 


Breast ca. MCF-7 


0.0 


Heart Pool 


0.0 


Breast ca. MDA- 
lMB-231 


0.0 


Lymph Node Pool 


1 

8.1 


fereast ca. BT 549 
jBreast ca. T47D 


0.0 |Fetal Skeletal Muscle 
* 0.0 ^Skeletal Muscle Pool 


13.2 
0.0 


jBreast ca. MDA-N | 0.0 Spleen Pool 


27.4 


Breast Pool 


49.0 


Sis .n us Pool 


25.9 1 


(Trachea 


46.3 


CNS cancer 
(glkvaslro) I 87-V1G 


0.0 


Lung 


38.4 


CNS cancer 
(glio/astro) U-118-MG 


0.0 


jFetal Lung 


100 0 jCNS cancer 

luu u |(neuro;met) SK-N-AS 


0.0 


[Lung ca. NCI-N417 


|CNS cancer (astro) SF- 
J539 


0.0 


I , v , j n n !CNS cancer (astro) 
jLungca.LX-1 J 0.0 JsNB-75 


0.0 


t xt^t u,aA ( \S. iku 
Lung ca. NCI-H 146 0.0 JSNB-19 


j 0.0 


Lung ca. SHP-77 


1 sCNS cancer (glio) SF- 
U -° J295 


0.0 


pLungca. A549 


0.0 [Brain (Amygdala) Poo 


0.0 


Lung ca. NCI-H526 
Lung c£.NCI-H23^ 


0.0 JBrain (cerebellum) 
6.7 Brain (fetal) 


0.0 
0.0 


(Lung ca. NCI-H460 


6.0 jBrain (Hippocampus) 


0 0 
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Pool 




Lung ca. HOP-62 




0.0 


Cerebral Cortex Pool 


0.0 


T nnn i-a "MPT Vf^OJ 

L.Ung Ca. INL-l-rl3Zz 


0.0 


Brain (Substantia 
nigra) Pool 


0 0 


Liver 


0.0 


Brain (Thalamus) Pool 


— 

0.0 


Fetal Liver 


0.0 


Brain (whole) 


0.0 


Liver ca. HepG2 


0.0 


Spinal ( ord Pool 


6.1 


Kidney Pool 


57.8 


Adrenal Gland 


0.0 


Fetal Kidney 


10.7 


Pituitary gland Pool 


12.4 






0.0 


Salivary Gland 


" oT6 " 


[Renal ca. A498 




o!o 


I hyroid ( Vmulc) 


0.0 


i 

jRenal ca. ACHN 




0.0 


Pancreatic ca. 
CAPAN2 


... 


0.0 


Renal ca. UO-31 


0.0 


Pancreas Pool 




63.3 j 


Table DCC. Panel 4. ID 


1 Tissue Name 


Rel. Exp.(%) 
Ag3621, Run 
169944096 


Tissue Name 


Rel. Exp.(%) | 
Ag3621, Run 
|6<).>44<><>(» 


'Secondary Thl act 




1.0 


IHUVEC IL-lbeta 


0.0 


(SecondEiry Th2 act 




0.0 


IHUVEC IFN gamma 




1.0 


r 

Secondary Trl act 


16.5 


[HUVEC TNF alpha + 
IFN gamma 




0.0 


Secondary Thl rest 


7.5 


[HUVEC TNF alpha + 
[IL4 




0.0 


Stv-utiijaj} "I Ii2 test 


2.6 


III VEC II -1 1 




_ — ! 


Secondary Trl rest 


0.0 


Lung Microvascular EC 
none 


1.0 


Primary Thl act 


0.0 


iLung Microvascular EC 
[TNFalpha 4 11,-1 beta 


6.7 


Primary Th2 act 


0.0 


Microvascular Dermal 
|EC none 




0.0 


Primary Trl act 


— - 
0.0 


[Microsvasular Dermal 
EC TNFalpha + IL-lbeta 


0.0 


Primary Thl rest 


0.0 


(Bronchial epithelium 
TNFalpha + lLlbeta 




0.0 


Primary Th2 rest 


1.5 


j Small airway epithelium 
jnone 


0.0 


Primary Trl rest 


0.0 


(Small airway epithelium 
1 NFalpha + II - 1 beta 


0.0 


( 1)4 5 RA CD4 
lymphocyte act 


0.0 


Coronery artery SMC rest 


0.0 


CD45RO CD4 


0.0 


Coroners arleiy SMC 


0.0 
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lymphocyte act 


TNFalpha + IL-lbeta 




CD8 lymphocyte act 0.0 


Astrocytes rest 


1.1 


Secondary CDS 
lymphocyte rest 


0.0 


Astrocytes TNFalpha + 
IL-lbeta 


0.9 


Secondary CD8 
lymphocyte act 


0.0 


KU-812 (Basophil) rest 


0.0 


CD4 lymphocyte none i 


8.6 


KU-812 (Basophil) 
PMA/ionom\cin 


0.0 


2ry Thl/Th2/Trl anti- 
CD95 CH11 


0Q CCD 11 06 

| (Keratinocytes) none | 


0.0 


LAK cells rest 


|CCD1106 ! 
0.0 j(Keratinocytes) 

JTNFalpha + IL-lbeta 


j 

0.0 


LAK cells IL-2 


0.0 sLiver cirrhosis 


0.0 


LAK cells IL-2+IL-12 


0.0 


NCI-H292 none 


4.5 


LAK cells IL-2+IFN 
gamma 


0.0 


NCI-H292 IL-4 


0.0 


LAK cells IL-2+ IL- 1 8 


0.0 


n< -1-11:02 ii -o 


1.5 


LAK cells 
PMA/ion omycin 


2.3 


NCI-H292 IL-13 


0.0 


in jtv ^en t> ii_,-z real 


0.0 


NCI-H292 IFN gamma 


0.0 


Two Way MLR 3 day 


1.3 


HPAEC none 


2.9 


Two Way MLR 5 day 


0.0 


HPAEC TNF alpha + IL- 
1 beta 


2.3 


Two Way MLR 7 day 


0.0 


I Ling iibroblasl n i 


— 5.1 — 


PBMC rest 


1.0 


1 ung fibroblast I NI- 
alpha + IL-1 beta 


0.0 


PBMC PWM 


0.0 'I ungfibrobl isl II -4 


__0.0 


|PBMC PHA-L 


0.0 Lung fibroblast IL-9 


0 0 


iRamos (B cell) none 


0.0 Lung fibroblast IL- 13 


0.0 


(Ramos (B cell) 
ion< unycin 


q q |Lung fibroblast IFN 
jgamma 


0.0 


IB lymphocytes PWM 


„ „ Dermal fibroblast 
CCD 1070 rest 


9.0 


B lymphoevtes CD40L 
(and IL-4 


n n jDermal fibroblast 
°' U ICCD1070TNF alpha 


1.1 


EOL-1 dbcAMP 


0.0 


Dermal fibroblast 
CCD 1070 IL-1 beta 


0.0 


|EOL-l dbcAMP 
jPMA/ionomycin 


0.0 


Dermal fibroblast IFN 
gamma 


0.0 


Dendritic cells none 


2.0 


Dermal fibroblast IL-4 


2.8 


IDendritic cells LPS 


17.6 


Derm;U Fibroblasts rest 


0.0 


jDendritic cells anti- 

}CD40 


2.7 

1 


Neutrophils TNFa+LPS 


0.0 
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Monocytes rest 


100.0 


Neutrophils rest 


:im> 


Monocytes LPS 


6.1 


.Colon 


0.8 


MdvioplkUv rest 


2.8 


Lung 


0.0 


Macrophages LPS 


0.0 


Thymus 


0.0 


HUVEC none 


0.0 


[Kidney 


9.3 


Ml '\ 1 ( sti, \ eJ 


0.0 





CNSneurodegenerationvl.O Summary: Ag3621 Expression of the CG59855-01 gene 
is low/undetectable in all samples on this panel (CTs>35). (Data not shown.) 



Generall_screening_panel_vl.4 Summary: Ag3621 Expression of the CG59855-01 gene 
is restricted to samples from fetal lung and adult pancrease(CTs=34.5-35). Thus, 
5 expression of this gene can be used to distinguish this sample from other samples in the 
panel. 

The CG59855-01 gene encodes a homologue of ATP synthase subunit c, 
mitochondrial precursor. Subunit c is an intrinsic membrane component of ATP synthase, 
and in mammals it is encoded by two expressed nuclear genes, PI and P2. Both genes 

10 encode the same mature c subunit, but the mitochondrial import pre-sequences in the 
precursors of subunit c are different (ref. 1). Each ATP synthase complex has multiple 
copies of subunit C. The mitochondrial ATP synthase uses energy derived from a proton 
gradient to synthesize ATP. The structure of this complex has been referred to as a 
'lollipop,' as the soluble Fl catalytic unit is attached to the mitochondrial inner membrane 

15 via the F0 unit containing subunit c. F0 subunit C transports protons across the 
mitochondrial inner membrane to the Fl-ATPase (ref. 2). 

Subunit C of the Fo region of the ATP synthase complex of the inner mitochondrial 
membrane is found in high concentrations in lysosomes in late infantile neuronal ceroid 
lipofuscinosis (Batten's disease). Kominami et al. (1995, Ref 3) found marked delay of 

20 degradation of subunit C in patient fibroblasts with no significant differences between 
control and patient cells with regard to degradation of cytochrome oxidase subunit IV. 
Furthermore, accumulation of labeled subunit C in the mitochondrial fraction was detected 
before lysosomal appearance of the radiolabeled subunit, suggesting to the authors a 
specific failure in the degradation of subunit C after its normal inclusion in mitochondria 

25 and its consequent accumulation in lysosomes. Jolly (1995, ref 4) reported that subunit C 
represents more than 50% of the accumulated metabolites in the ovine form of the disease 
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and also accumulates significantly in late infantile and juvenile forms of the human disease 
and several other animal forms. The author suggested that the extreme hydrophobicity and 
lipophilicity of subunit C may be in part responsible. 

References: 

5 1 . Dyer MR, Walker JE. (1 993) Sequences of members of the human gene family 

for the c subunit of mitochondrial ATP synthase. Biochem J 293 ( Pt l):51-64 

2. OMIM 603192 

3. Kominami E, Ezaki J, Wolfe LS. (1995) New insight into lysosomal protein 
storage disease: delayed catabolism of ATP synthase subunit c in Batten disease. 

10 Neurochem Res 20(1 1): 1305-9 

4. Jolly RD. (1995) Batten disease (ceroid-lipofuscinosis): the enigma of subunit c 
of mitochondrial ATP synthase accumulation. Neurochem Res 20(1 1): 1301 -4 

Panel 4..1D Summary: Ag3621 Expression of the CG59855-01 gene is exclusively seen 
in resting monocytes (CT=32). Thus, expression of this gene can be used to distinguish this 
15 sample from other samples in the panel. In addition, expression of this gene in monocytes 
suggests a role for the gene product in their function as antigen-presenting cells. This 
suggests that antibodies or small molecule therapeutics that block the function of this 
protein may be useful as anti-inflammatory therapeutics for the treatment of autoimmune 
and inflammatory diseases and for the treatment of immunosupressed individuals. 

20 DD. CG59807-01: Nuclear Hormone Receptor/Zinc Finger 

Expression of gene CG59807-01 was assessed using the primer-probe set Ag3591, 
described in Table DDA. Results of the RTQ-PCR runs are shown in Tables DDB and 
DDC. 

Table PDA . Probe Name Ag3591 



Primers) Sequences 


Length 


Start 
Position 


SEQID 
NO: 


For\\ lid ca gctccacttttc rrj 3' 


21 


1195 


728 


p . JTET-5 1 -cataaaaggacccacacaggagaaaa-3 1 - 
rr0D£ jTAMRA 


26 


1216 


729 
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[Reverse Is •-cttttccacattctctgcattc-3- j 22 ! 1249 , 730 j 

Table DDB . General_screening_panel_vl.4 



Tissue Name 

Adipose 


Rel. Exp.(%) 
Ag3591,Run 
217479278 


Tissue Name 


Rel. Exp.(%) 
Ag3591, Run 
217479278 


Til*) 


Renal ca. TK-10 


22.2 


Melanoma* 
Hs688(A).T 


18.6 


Bladder 


27.5 


Melanoma* 
Hs68N.C) T 


17.6 




Gastric ca. (liver met.) 
NCI-N87 


100.0 




■ ' • n.i MI4 


22.8 


Gastric ca. KATO 111 


63.7 




Melanoma * 
LOXIMVI 


12.5 


Colon ca. SW-948 


4.9 


Melanoma* SK- 
MEL-5 


10.4 


Colon ca. SW480 


41.2 


Squamous cell 
carcinoma SCC-4 


27.9 


Colon ca.* (SW4S0 
met) SW620 


17.1 




Testis Pool 


14.9 


Colcn La IIP" 


19.3 ! 


Prostate ca.* (bone 
met) PC-3 


51.4 


Colon ca. HCT-116 


52.9 


Prostate Pooi 


12.7 


Colon ca. CaCo-2 


13.5 


Placenta 


8.7 


Colon cancer tissue 


7.4 " "~ 


Uterus Pool 


5.0 


Colon ca. SWT 116 


11.3 




Ovarian ca. 
OVCAR-3 


29 9 








Ovarian ca. SK- 

oy-3 


49.7 


Colon ca. SW-48 


3.2 




Ovarian ca. 
i >VCAR-4 


9.1 


Colon Pool 


18.8 


Ovarian ca. 
nvr a p ^ 


14.6 


Small Intestine Pool 


19.2 


jOvarian ca. 
[lGROV-1 


26.2 


Stomach Pool 


15.1 


Ovarian ca. 
OVCAR-8 


14.4 


Bone Marrow Pool 


7.5 




Ovary 


12.7 


Fetal Heart 


_____ 




Breast ca. MCF-7 


9.5 


Heart Pool 


10.9 


Breast ca.MDA- 
(MB-231 


42.0 




Lymph Node Pool 


33.9 


^Breast ca. BT 549 


66.0 


1 cwl Nkek'tnl Muscle 


24.0 




Breast ca. T47D 


40.3 


Skeletal Muscle Pool 






[Breast ca. MDA-N 


13.7 


Spleen Pool 


30.4 
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Breast Pool 


25.5 fThyrnusPool 


30.6 


Trachea 


14.5 


CNS cancer 
(glio/astro) U87-MG 


19.2 


Lung 


14.6 


CNS cancer 
(glio/astro) U-118-MG 


51.1 


Fetal Lung 


72.7 


CNS cancer 
(neuro;met) SK-N-AS 


49.0 


Lung ca. NCI-N417 


2.6 


CNS cancer (astro) SF- 
539 


18.7 


Lung ca. LX-1 


21.2 


CNS cancer (astro) 1 . „ , 
SNB-75 ' 


— „ - — 

Lung ca. NCI-H146 


. jCNS cancer (glio) 
SNB-19 


26.6 


Lung ca. SHP-77 


iCNS cancer (glio) SF- 

33./ | 295 


80.1 


iLung ca. A549 


16.6 JBrain (Amygdala) Pool 


10.6 


Lung ca. NCI-H526 


14.7 Brain (cerebellum) 


65.5 


Lung ca. N< :i-H23 


35.4 (Brain (fetal) 50.0 


Lung ca. NCI-H460 


_ iBrain (Hippocampus) 
ii-U jPool 


15.3 


Lungca i lOP-62 


1 0.4 jCerebral Cortex Pool 


18.6 


iT — « \TPT T TOO 

(Lung ca. JNC1-H5Z2 


g JBrain (Substantia 
nigra) Pool 


19.9 


jLiver 


2.0 jBrain (Thalamus) Pool 


26.6 


Fetal Liver 


21.6 Brain (whole) 


25.0 


Liver ca. 1 IcpG2 


16.4 Spinal Cord Pool 


16.3 


Kidney Pool 


27.7 Adrenal Gland 


25.0 


Fttai KiJnc} 


28.1 Pituitary gland Pool 


8.2 


Renal ca. 78o-0 


25.2 Salivaiy Gland 


7.0 


Renal c.) W«»X 


12.9 Ihyro id (female) 


9.2 


jRenal ca. ACHN 


_ jPancreatic ca. 
1CAPAN2 


12.5 


Renal ..i I 0-31 


32.1 Pancreas Pool 


16.8 



Table DDC . Panel 4. ID 



" 

Tissue Name 


Rel. Exp.(%) 
Ag3591, Run 
169908857 


Tissue Name 


Rel. Exp.(%) 
Ag3591, Run 
169908857 


Secondary Thl act 


38.4 


HUVEC IL-lbeta 


24.7 


Secondary ! h2 act 


37.4 


HUVEC IFN gamma 


23.5 


Secondary Trl act 


48.6 


HUVEC TNF alpha + 
i FN ^amma 


17.3 


St eondary Thl rest 


22.7 


HUVEC TNF alpha I 


20.6 
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1 JIL4 




Secondary Th2 rest 34.9 'HUVECIL-11 


19.6 


Secondary Trl rest 


^ 5 j jLung Microvascular EC 
Inone 


36.6 


I 

Primary Thl act 


. „ _ iLung Microvascular EC 
[TNFalpha + IL-lbeta 


39.5 


Primary Th2 act 


^ ^ (Microvascular Dermal 
jEC none 


22.1 


Primary Trl act 


_ n „ (Microsvasular Dermal 
5U '* !EC TNFalpha + IL- 1 beta 


21.9 


Primary Thl rest 


. jjBronchial epithelium 
TNFalpha + IL1 beta 


35.4 


iPrimarv Th2 rest 


4Q ~ ISmall airway epithelium 
jnone 


14.0 


jPrimary Trl rest 


jSmall airway epithelium 
|TNFalpha + IL-lbeta 


31.0 


|CD45RA CD4 
lymphocyte act 


22.2 iCoronery artery SMC rest 


14.5 

1 


CD45RO CD4 
lymphoey te act 


. Coronery artery SMC 
TOFalpha + IL-lbeta 


12.0 


CD8 lymphocyte act 


39.8 Astrocytes rest 


24.1 


Secondary CD8 
lymphocyte rest 


22.5 | Astrocytes TNFalpha + J ^ 
IL-I beta 


Secondary CD8 
lymphocyte act 


3 1 .0 KU-812 (Basophil) rest 


57.8 


CD4 lymphocyte none 


158 jKU -8 12 (Basophil) 
iPMA/iononiycm 


98.6 


2ry Thl/Th2/Trl anti- 
CD95CH11 


57 8 CCD 1106 

f(Keratinocytes) none 


28.7 


1 

)LAK cells rest 


ICCD1106 
28.9 |(Keratinocytes) 

iTNFalpha + IL- 1 beta 


24.1 


jLAK cells IL-2 


44.4 


Liver cirrhosis 


8.6 


i AK ceils 11.-2+IL-12 


35.4 


NCI-H292 none 


40.1 


jLAK cells IL-2+1FN 
! gamma 


44.8 


NCI-H292 IL-4 


80.7 


jLAK cells IL-2+IL-18 


52.5 


NCI-H292 IL-9 


79.0 


ILAK cells 
PMA/icnomycin 


18.3 


NCI-H292 IL-13 


100.0 


NK Cells 1L-2 rest 


44.1 


NCI-H292 IFN gamma J 100.0 


Two \Yi\ MLR 3 day- 


42.0 


HPAEC none 


21.6 


Two Way MLR 5 day 


25.7 


HPAEC TNF alpha + IL- 
1 beta 


32.8 


Two Way MLR 7 day 


20.4 


Lung fibroblast none 


28.5 


PBMC rest 


1 1.2 


Lung fibroblast TN I 


13.9 
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alpha + IL-1 beta 




PBMC PWM 


21.3 


Lung fibroblast IL-4 


28.1 


PBMC PHA-L 


24.7 


Lung fibroblast IL-9 


49.3 


Ramos (B cell) none 


61.1 


Lung fibroblast IL- 1 3 


37.9 


Ramos (B cell) 
ionomycin 


66.9 


Lung fibroblast IFN 
gamma 


29.5 


B lymphocytes PWM j 24.1 


Dermal fibroblast 

VsV^ULVjIK) Teal 


29.9 


B lymphocytes CD40L 
and II- 


56.6 


Dermal fibroblast 
CCD1070 TNF alpha 


45.1 


EOL-1 dbcAMP 


69.7 


Dermal fibroblast 
CCD 1070 IL-1 beta 


11.0 


EOL-1 dbcAMP 
PMA/ionomycin 


54.0 


Dermal fibroblast IFN 
gamma 


8.7 


Dendritic cells none 


26.6 


Dermal fibroblast IL-4 


29.7 


Dendritic cells LPS 


10.2 


Dermal Fibroblasts rest 


12.2 


Dendritic cells anti- 

CD40 

Monocvtes rest 


23.2 
12.2 


Neutrophils TNFa+LPS 
Neutrophils rest 


3.3 


Monocytes LPS 


16.5 


Colon 


8.8 


Macropiages rest 


18.6 


Lung 


33.1 


Macrophages LPS 


8.1 


Thymus 


82.9 


III \ K n,»ne 


17.4 


Kidney 


23.8 


IHUVEC starved I 27.2 J 





General_screening_panel_vl.4 Summary: Ag3591 Highest expression of the CG59807- 
01 gene is detected in the gastric cancer cell line(CT=28). In addition, high expression of 
this gene is seen in samples derived from CNS cancer, colon cancer, breast cancer, ovarian 
cancer, prostate cancer cell lines (CTs=28-31). Therefore, therapeutic modulation of the 
activity of this gene or its protein product, through the use of small molecule drugs, protein 
therapeutics or antibodies, might be beneficial in the treatment of these cancers. 

In addition, expression of this gene is higher in fetal liver (CT=31) as compared to 
the corresponding adult tissues (CTs=34). Thus, expression of this gene can be used to 
distinguish between the fetal and adults source of this tissue. 

Among tissues with metabolic or endocrine function, this gene is expressed at high 
to moderate levels in pancreas, adipose, adrenal gland, thyroid, pituitary gland, skeletal 
muscle, heart, liver and the gastrointestinal tract. Therefore, therapeutic modulation of the 
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activity of this gene may prove useful in the treatment of endocrine/metabolically related 
diseases, such as obesity and diabetes. 



This gene is also expressed at high levels in all regions of the central nervous 
system examined, including amygdala, hippocampus, substantia nigra, thalamus, 
5 cerebellum, cerebral cortex, and spinal cord. Therefore, this gene may play a role in central 
nervous system disorders such as Alzheimer's disease, Parkinson's disease, epilepsy, 
multiple sclerosis, schizophrenia and depression. 

if 

O Panel 4.1D Summary: Ag3591 Highest expression of the CG59807-01 gene is detected in 

treated mucoepidermoid NCI-H292 cells. In addition, this gene is expressed at high to 
10 moderate levels in a wide range of cell types of significance in the immune response in 
p health and disease. These cells include members of the T-cell, B-cell, endothelial cell, 

macrophage/monocyte, and peripheral blood mononuclear cell family, as well as epithelial 
O and fibroblast cell types from lung and skin, and normal tissues represented by colon, lung, 

Sf thymus and kidney. This ubiquitous pattern of expression suggests that this gene product 

''■■-4 1 5 may be involved in homeostatic processes for these and other cell types and tissues. This 

™" pattern is in agreement with the expression profile in General screening panel vl .5 and 

also suggests a role for the gene product in cell survival and proliferation. Therefore, 
modulation of the gene product with a functional therapeutic may lead to the alteration of 
functions associated with these cell types and lead to improvement of the symptoms of 
20 patients suffering from autoimmune and inflammatory diseases such as asthma, allergies, 
inflammatory bowel disease, lupus erythematosus, psoriasis, rheumatoid arthritis, and 
osteoarthritis. 

DE. CG 59805-01: Nuclear Hormone Receptor/Zinc Finger 

Expression of gene CG59805-01 was assessed using the primer-probe set Ag3590, 
25 described in Table DEA. Results of the RTQ-PCR runs are shown in Tables DEB, DEC 
and DED. 

Table DEA . Probe Name Ag3590 



Primers 


Sequences 


1 

Length 


Start 
Position 


SEQID [ 

NO: 1 


forward 


5 ' -atgagtgcagtgaatgtggaa-3 ' 


-' 


1620 


i 731 


Probe 


TET-5 ' -cttcagtcgcagctcgtccctcact-3 ' - 


25 


1645 


732 ] 
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Table DE B. CNS_neurodegeneration_vl.O 



Tissue Name 


Rel. Exp.(%) Ag3590, 
Run 211006692 


Tissue Name 


Kei. iLxp.( /o ) Agjsyu, 
Run 211006692 


AD 1 Hippo 


16.7 


Control (Path) 3 
Temporal Ctx 


5.8 


AD 2 Hippo 


22 l (Control (Path) 4 
Temporal Ctx 


42.6 


AD 3 Hippo 


8.2 AD 1 Occipital Ctx 


17.7 


AD 4 Hippo 


7.9 


AD 2 Occipital Ctx 
(Missing) 


0.0 


AD 5 Hippo 


73.2 


AD 3 Occipital Ctx 


3.9 


AD 6 Hippo 


87.7 


AD 4 Occipital Ctx 


23.3 


Control 2 Hippo 


25.7 


\D 5 Occipital Ctx 


32.5 


Control 4 Hippo 


17.0 A\) o (\upital Ci\ 


31.2 


Control (Path) 3 
Hippo 


. (Control 1 Occipital 
jCtx 


5.2 


AD 1 Temporal 
Ctx 


_ . n jControl 2 Occipital 

252 JCtx 


39.2 


AD 2 Temporal 
Ctx 


[Control 3 Occipital 
i4 - 2 Ctx 


20.0 


AD 3 Temporal 
( tx 


iControl 4 Occipital 
1U1 fCtx 


8.1 


AD 4 Temporal 
Ctx 


Control (Path) 1 
Occipital Ctx 


80 7 


AD 5 Inf Temporal 
Ctx 


Control (Path) 2 
[Occipital Ctx 


11.1 


AD 5 Sup 
? imp >ral Ctx 


Control (Path) 3 
6 * jOccipital Ctx 


1 

6 1 1 

■ — 1 


AD 6 Inf Temporal 
Ctx 


Control (Path) 4 
y (Occipital Ctx 


17.9 


AD 6 Sup 
Temporal Ctx 


■Control 1 Parietal 
1000 [Ctx 


8.8 


Control 1 
Temporal Ctx 


_ 'Control 2 Parietal 
8 - 2 jCtx 


39.0 


Control 2 
Temporal Ctx 


Q jControl 3 Parietal 


18.3 


Control 3 
Temporal Ctx 


jControl (Path) 1 
(Parietal Ctx 


55.5 


Control 3 
Temporal Ctx 


Control (Path) 2 
lU - Z [Parietal Ctx 


26.8 


Control (Path) 1 


50.7 Control (Path) 3 


1 7-A 
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Tempore 




'Parietal Ctx 




t "tuilinl I l\ulil 2 
Temporal Ctx 


34.6 


Control (Path) 4 
Parietal Ctx 


46.0 


Table DEC. General screening panel vl.4 


Tissue Name 


Rel. Exp.(%) 
Ag3590, Run 
217474417 


Tissue Name 


Rel. Exp.(%) 
Ag3590, Run 
217474417 


Adipose 


12.8 


Renal ea. Ik-!l» 


28.9 


Melanoma* 
Hs688(A).T 


33.7 


Bladder 


28.5 


Melanoma* 
IIs688(l_i).T 


25.3 


Gastric ca. (liver met.) 
NCI-N87 


82.4 


Melanoma* Ml 4 


16.3 


Gastric ca. KAT< > 111 


24.5 


Melanoma* 
LOXIMVI 


19.6 


Colon ca. SW-948 


5.8 


Melanoma* SK- 
MEL-5 


.6.5 


Colon ca. SW480 


28.1 


Squamous cell 
carcinoma SCC-4 


29.5 


Colon ca.* (SW480 
met) Stt 62.1 


10.7 


Testis Pool 


16.0 


( (Aon ia 111 


13.6 


Prostate ca.* (bone 
met) PC-3 


57.4 


Colon ca. HCT-116 


26.2 


Prostate Po< 1 


14.6 


< olon ca. ( !aCo-2 


28.5 


Placenta. 


7.7 


Colon cancer tissue 


13.0 


Uterus r'ool 


6.0 


Colon ca. SW1I 16 


4.8 


Ovarian ca. 
OVCAR-3 


18.9 


Colon ca. Colo-205 


I 

3.4 


Ovarian ca. SK- 
OV-3 


38.2 


Colon ca. SW-48 


2.3 


Ovarian ca. 
OVCAR-4 


15.4 


Colon Pool 


27.0 


Ovarian ca. 
OVCAR-5 


17.4 


Small Intestine Pool 


24.1 


(Ovarian ca. 
IGROV-1 


12.2 


Stomach Pool 


14.3 


jOvarian ca. 
jOVCAR-8 


12.5 


Bone Marrow Pool 


11.7 


lOvary 


14.2 


Fetal Heart 


21.8 


Breast co MCT-7 


29.1 


Heart Pool 


10.7 


iBreast ca. MDA- 
JMB-231 


| 28.3 


Lymph Node Pool 


29.5 


Breast ca. BT 549 


100.0 


Fetal Skeletal Muscle 


10.7 
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Breast ca. T47D j 


34.4 


Skeletal Muscle Pool 


14.7 


Breast ca. MDA-N 1 


12.5 


Spleen Pool 


26.2 


lireast Pool 


40.6 Thymus Pool 


26.8 


Trachea 




CNS cancer 
(glio/astro) U87-MG 


32.3 


i - 

Lung 


4 6 


CNS cancer 
(glio/astro) U-118-MG 


57.0 


Fetal Lung 


66.4 


CNS cancer 
(neuro;met) SK-N-AS 


33.2 


Lung ca. NCI-N417 


2.0 


CNS cancer (astro) SF- 
539 


14.4 


(Lung ca. LX-1 


15.9 


CNS cancer (astro) 
SNB-75 


46.7 


Lung ca. NCI-H146 


3.3 


CNS cancer (glio) 
SNB-19 


14.9 


Lung ca. SHP-77 


24.5 




CNS cancer (glio) SF- 
295 


82.9 


Lung ca. A549 


I 1 .L 


Brain (Amygdala) Pool 


7.2 


Lung ca.. NCI-H526 


5 4 




13.2 


Lung ca. NCI-H23 


26.4 


Brain (fetal) 


26.8 


Lung ca. NCI-H460 


22.5 


Pool ^ HPP 


10.3 


Lung ca. HOP-62 


10.6 


Cerebral Cortex Pool 


14.6 


.Lung ca. JNCi-rijzz 


18.9 


Brain (Substantia 
nigra) Pool 


13.9 


Liver 


1.6 


Brain (Thalamus) Pool 


18.2 


Fetal Liver 


26 i 


Brain (whole) 


16.5 


Liver ca. HepG2 


19.8 


Spinal Cord Pool 


9.1 


Kidney Pool 


43.5 


Adrenal Gland 


30.8 


Fetal Kidney 


""' 26.2 


Pituitary gland Pool 


7.3 


RcruKa 786-0 


1 26.1 


Salivary Gland 




Renal ca. A498 


19.8 


Thyroid (female) 




Renal ca. ACHN 


i 

| 14.6 


Pancreatic ca. 
CAPAN2 


13.3 


Renal ca. 1 IO-3 1 


27.0 


Punueas Pool 


; 25 2 



Table DEP . Panel 4. ID 



Tissue Name 


Rel. Exp.(%) 
Ag3590, Run 
169908854 


I Tissue Name 

i 


Rel. Exp.(%) 
Ag3590, Run 
16990SS54 


Secondary Th 1 act 


33.9 


HUVEC IL-lbeta 


31.0 


Secondary Th2 act 


42.6 


1HUVEC IFN gamma 


30.o 


Secondary Trl act 


52.1 


IHUVEC TNF alpha -- 


18.9 
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jlFN gamma 




Secondary Thl rest 


20.3 


HUVEC TNF alpha + | 
IL4 j 


18.2 


Secondary Th2 rest 


28.1 


HUVEC IL- 11 


17.4 


Secondary Trl rest 


26.6 


Lung Microvascular EC j 
none 


39.5 


Primary Thl act 


44.1 


Lung Microvascular EC 
TNFalpha + IL-1 beta 


54.0 


Primary Th2 act 


38.7 


Microvascular Dermal 
EC none 


25.9 


Primary Trl act 


45.7 


Microsvasular Dermal 
EC TNFalpha + IL-lbeta 


36.6 


Primary Thl rest 


23.3 


Bronchial epithelium 
TNFalpha + ILlbeta 


51.1 


Primary Th2 rest 


27.7 


Small airway epithelium 
none 


22.1 


Primary Trl rest 


- n . 1 Small airway epithelium 
jTNFalpha + IL-lbeia 


33.0 


,„, .„ 

CD45RA CD4 
lymphocyte act 


24.8 iCoronery artery SMC rest 


38.4 


CD45RO CD4 
lymphocyte act 


iCoronery artery SMC 
|TNFalpha + IL-lbeta 


29.1 

j 


CDS lynphocyte act 


39.2 Astrocytes rest 


3 1.9 


Secondary CD8 
lymphocyte rest 


Q 1 Astrocytes TNFalpha + 
32 - 8 IlL-lbeta 


26.2 


Secondary CD8 
lymphocyte act 


19.5 lKU-812 (Basophil) rest 


56.6 


CD4 lymphocyte none 


37.4 


KU-8 12 (Basophil) 
PMA/ionomycin 


100.0 


2ryThl/Th2/Trl_anti- 


39.8 


CCD1106 

(Keratinocytes) none 


29.3 


LAK cells rest 


43.8 


CCD 1106 
(Keratinocytes) 
TNFalpha + IL-lbeta 


34.9 


|LAK cells IL-2 


38.2 jl.iver cirrhosis 


12.9 


LAK cells IL-2+IL-12 


39.5 \C l-1 1292 none 


22.2 


1 \K cells IL-2+IFN 
gamma 


50.3 |NCI-H292 1 1. -4 


38.7 


fLAK cells IL-2+IL-18 


51.1 


)NCI-H292 IL-9 


41.2 


lLAK cells 
PY1 \ ionomycin 


50.0 


NCI-H292 IL-13 


36.1 


|NK Cells IL-2 rest 


41.5 


NCI-H292 IFN gamma 


56.3 


jTwo Way MLR 3 day 


50.7 


HPAEC none 


19.3 


Two Way MLR 5 day 


28.5 


HPAEC TNF alpha + IL- 
1 beta 


50.3 
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Two Way MLR 7 day 


1^.5 Lung fibroblast none 


37.6 


PBMC rest 


Lung fibroblast TN V 
55 ' 1 (alpha + IL-1 beta 


24.0 


jPBMC PWM 


25.7 Lung fibroblast IL-4 


49.3 


PBMC PHA-L 


16.3 Lun« fibroblast IL-9 


56.6 


Ramos < B cell) none 


42.3 iLung fibroblast IL- 1 3 


43.5 


Ramos (B cell) 
ionomycin 


33 4 jLung fibroblast IFN 
jgamma 


42.9 


B lymphocytes PWM 


24.3 


Dermal fibroblast 
CCD 1070 rest 


38.7 

, — _ — .»„ CT ,,„„„— . 


d lyrnpnocyics ^u^kjlj 
and IL-4 


36.9 


Dermal fibroblast 
CCD 1070 TNF alpha 


43.8 


EOL-1 dbcAMP 

EOl -1 < bcAMP 
PMA/ionomycin 


40.3 


Dermal fibroblast 
CCD1070 IL-1 beta 


22.8 
12.3 


37.9 


Dermal fibroblast IFN 
gamma 


Dendritic cells none 


33.7 


Dermal fibroblast 1 L-4 


55.1 


1 )endritic cells 1 PS 


18.9 


Dermal Fibroblasts rest 


23.2 


(Dendritic cells anti- 
[CD40 


26.8 


Neutrophils TNFa+LPS 


1.4 


(Monocytes rest 


33.7 


Neutrophils rest 


16 5 


jMonocytes LPS 


31.4 


Colon 


8.8 


(Macrophages rest 


19.8 


Lung 


41.2 1 


MauiuphaL^. LPS 


6.b 


Thymus 


82.9 


HUVEC none 


16 0 kidne> 


30.8 


HUVEC starved 


20.0 





CNS_neurodegeneration_vl.O Summary: Ag3590 This panel confirms the expression of 
the CG5 9805-01 gene at low levels in the brain in an independent group of individuals. 
However, no differential expression of this gene was detected between Alzheimer's 
diseased postmortem brains and those of non-demented controls in this experiment. Please 
5 see Panel 1 .4 for a discussion of the potential utility of this gene in treatment of central 
nervous system disorders. 

General_screening_panel_vl.4 Summary: Ag3590 Highest expression of the CG59805- 
01 gene is detected in one of the breast cancer cell line BT 549 (CT=26). In addition, 
expression of this gene is high in CNS cancer, gastric cancer, and prostate cancer cell lines. 
10 Therefore, expression of this gene can be used to distinguish these samples from other 
samples in this panel and it can be used as marker for detection of these cancers. 
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Furthermore, therapeutic modulation of the activity of the protein encoded by this gene 
may be beneficial in the treatment of these cancers. 

Among tissues with metabolic or endocrine function, this gene is expressed at high 
to moderate levels in pancreas, adipose, adrenal gland, thyroid, pituitary gland, skeletal 
muscle, heart, liver and the gastrointestinal tract. Therefore, therapeutic modulation of the 
activity of this gene may prove useful in the treatment of endocrine/metabolically related 
diseases, such as obesity and diabetes. 

In addtion, this gene is expressed at high levels in all regions of the central nervous 
system examined, including amygdala, hippocampus, substantia nigra, thalamus, 
cerebellum, cerebral cortex, and spinal cord. Therefore, this gene may play a role in central 
nervous system disorders such as Alzheimer's disease, Parkinson's disease, epilepsy, 
multiple sclerosis, schizophrenia and depression. 

Panel 4.1D Summary: Ag3590 Highest expression of the CG59805-01 gene is detected in 
PMA/ionomycin treated Ku-812 (basophil) cells (CT=29). In addition, this gene is 
expressed at high to moderate levels in a wide range of cell types of significance in the 
immune response in health and disease. These cells include members of the T-cell, B-cell, 
endothelial cell, macrophage/monocyte, and peripheral blood mononuclear cell family, as 
well as epithelial and fibroblast cell types from lung and skin, and normal tissues 
represented by colon, lung, thymus and kidney. This ubiquitous pattern of expression 
suggests that this gene product may be involved in homeostatic processes for these and 
other cell types and tissues. This pattern is in agreement with the expression profile in 
General screen ing_panel_v 1.4 and also suggests a role for the gene product in cell survival 
and proliferation. Therefore, modulation of the gene product with a functional therapeutic 
may lead to the alteration of functions associated with these cell types and lead to 
improvement of the symptoms of patients suffering from autoimmune and inflammatory 
diseases such as asthma, allergies, inflammatory bowel disease, lupus erythematosus, 
psoriasis, rheumatoid arthritis, and osteoarthritis. 

DF. CG59928-01: Novel Universal Stress (USP) Domain Containg Protein 

Expression of gene CG59928-01 was assessed using the primer-probe set Ag3636, 
described in Table DFA. Please note that this sequence is represented by a full length 
clone. 
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Table DF A. Probe Name Ag3636 



Primers 




Sequences 




I Length 


Start 
Position 


" SEQ ID i 
NO: 


Forward 


5 1 


gaagccttcgacaagctgat-3 1 


20 


1268 


734 I 


Probe 


TET-5 ' -atcgatagagcacaggcccacctgtt- 
3 ' -TAMRA 


26 


1301 


r 735 ! 




Reverse 


5 1 


gatgacttcctcggcaaaac-3 1 


| 20 


1332 


736 j 



CNSneurodegenerationvl.O Summary: Ag3636 Expression of the CG59928-01 gene 
is low/undetectable in all samples on this panel (CTs>35). (Data not shown.) The amp plot 
indicates that there is a high probability of a probe failure. 



5 General_screening_panel_vl.4 Summary: Ag3636 Expression of the CG59928-01 gene 
is low/undetectable in all samples on this panel (CTs>35). (Data not shown.) The amp plot 
indicates that there is a high probability of a probe failure. 

Panel 4. ID Summary: Ag3636 Expression of the CG59928-01 gene is low/undetectable 
in all samples on this panel (CTs>35). (Data not shown.) The amp plot indicates that there 
10 is a high probability of a probe failure. 

DG. CG59947-01: VOLTAGE-GATED POTASSIUM CHANNEL PROTEIN 
KV3.3 

Expression of gene CG59947-01 was assessed using the primer-probe set Ag3635, 
described in Table DGA. Results of the RTQ-PCR runs are shown in Tables DGB, DGC, 
15 DGDandDGE. 



Table DGA . Probe Name Ag3635 



i 

jPrimers 


Sequences 


Length 


Start j 
Position 


SEQ ID 
NO: 


'Forward 


5 1 -tacttcaagaacatccccattg-3 ' 


22 


1 326 


737 


Iprobe 

1 


TET-5 ' -ctgtggtcaccatgacgaccctg-3 ' - 
TAMRA 


23 


1360 j 


738 


[Reverse 


5 1 -tcttggggtacatgtctccata-3 1 


22 


1386 


739 



Table DGB . CNS_neurodegeneration_vl.0 



Tissue Name 


Rel. Exp.(%) Ag3635, 
Run 211020704 


Tissue Name 

™, „„w 


Rel. Exp.(%) \g3635. 
Run 211020704 
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AD 1 Hippo 


8.7 


Control (ram ) 5 
Temporal Ctx 


2.8 


AD 2 Hippo 


14.3 


Control (Path) 4 
Temporal Ctx 


23.8 


AD 3 Hippo 


5.1 


AD 1 Occipital 
Ctx 


0.9 


AD 4 Hippo 


4.8 


AD 2 Occipital 
Ctx (Missing) 


0.0 


AD 5 hippo 


100.0 


AD 3 Occipital 
Ctx 


4.2 


AD 6 Hippo 


23.7 


AD 4 Occipital 
Ctx 


13.0 


Control 2 Hippo 


22.4 


AD 5 Occipital 
Ctx 


18.4 


Control 4 Hippo 


4.9 


AD 6 Occipital 
Ctx 


47.3 


Control (Path) 3 
Hippo 


2.7 


Control 1 Occipital 
Ctx 


2.0 


AD 1 Temporal Ctx 


6.8 


Control 2 Occipital 
Ctx 


99.3 


AD 2 Temporal Ctx 


18.3 


Control 3 Occipital 
Ctx 


11.3 


AD 3 Temporal Ctx 


4.6 


Control 4 Occipital 
Ctx 


2.6 


AD 4 Temporal Ctx 


11.5 


Control (Path) 1 
Occipital Ctx 


80.7 


AD 5 Inf Temporal 
Ctx 


68.8 


Control (Path) 2 
( < . ital Ctx 


10.4 


AD 5 SupTemporal 
Ctx 


23.5 


Control (Path) 3 
Occipital Ctx 


2.5 


AD 6 Inf Temporal 

Ctx 


21.6 


Control (Path) 4 
( )ccipitai Ctx 


18.4 


AD 6 Sup Temporal 
Ctx 


31.0 


Control 1 Parietal 
Ctx 


3.9 


Control 1 Temporal 
Ct\ 


3.8 


Control 2 Parietal 
Ctx 


20.2 


Control 2 Temporal 
Ctx 


30.8 


Control 3 Parietal 
Ctx 


20.3 


Control 3 Temporal 
Ctx 


8.3 


v_.oniroi tjr ain ) i 
Parietal ( 'tx 


66.4 


Control 4 Temporal 

Ctx 


4.9 


Control (Path) 2 
Parietal Ctx 


21.2 


Control (Path) 1 
lcinpoi.il Ctx 


46.7 


Control (Path) 3 
Parietal Ctx 


1.8 


Control (Path) 2 
1 emporal Ctx 


16.8 


Control (Path) 4 
Parietal Ctx 


51.4 
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Table DGC . Panel 2.2 



i 

Tissue Name j 


Rel. Exp.(%) 
Ag3635, Run 
173764364 


Tissue Name 


Rel. Exp.(%) 
Ag3635, Run 
173764364 


Normal L^olon 


5.4 


Kidney Margin 
(OD04348) 


100.0 


Colon cancer 
(OD06064) 


5 0 


Kidney malignant 
cancer (OD06204B) 


5.0 


Colon Margin 
(OD06064) 


3.0 


Kidney normal 
adjacent tissue 
(OD06204E) 


19.2 


Colon cancer 
(01)06159) 


1.0 


Kidney Cancer 
K >D04450-01) 


5.7 


Colon Margin 
(OD06159) 


2.8 


Kidney Margin 
(OD04450-03) 


23.3 


Colon cancer 
(OD06297-04) 


2.0 


Kidney Cancer 
8120613 


1.2 


Colon Margin 
<< >D06297-05) 


5.0 


Kidney Margin 
8120614 


23.5 


CC Gr.2 ascend colon 
(OD03921) 


4.4 


Kidney Cancer 
9010320 


3.6 


CC Margin (OD03921) 


jKidney Margin 
|9010321 


12.9 


Colon cancer metastasis 
(OD06104) 


. _ IKidney Cancer 
18120607 


13.3 


Lung Margin 
(OD06104) 


Kidney Margin , 
06 j8120608 J 163 | 


Colon mets to lung 
(OD04451-01 ) 


1.6 [Normal Uterus 


6.3 


:Lung Margin 
(OD04451-02) 


6. 1 luterine Cancer 0640 1 1 

1 __ _ _ 


20.3 

1 - 


Normal Prostate 


8.2 jNormal Thyroid 


6.7 


Prostate Cancer 
(OD04410) 


jThyroid Cancer 
064010 


4.8 


Prostate Margin 
(ODU4410) 


3.4 


Thyroid Cancer 
A302152 


33.7 


Normal Ovary 


5.4 


Thyroid Margin 
A302153 


7.0 


Ovarian cancer 
(01)0628 u>:>) 
Ovarian Margin 
(OD06283-07) 


9.9 


Normal Breast 


23.7 


, JBreast Cancer 
~ V OD04566) 


3.2 


Ovarian C ancer 064008 


8.0 Breast Cancer 1024 


65 3 


Ovarian cancer 


13.1 Breast Cancer 8.7 
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(OD06145) 
Ovarian Margin 
(OD06145) 


(OD04590-01) 
jBreast Cancer Mets 
n J(OD04590-03) 


— - • • 

4.0 


Ovarian cancer 
(OD06455-03) 


jBreast Cancer 
24.7 Metastasis (OD04655- 

|05) 


78.5 
16.3 


1 , 

Ovarian Margin 

(UIJ0O4.53-U/) 


3.5 


Breast Cancer 064006 


Normal Lung 


8.2 


Breast Cancer 01 00266 


5.2 


Invasive poor diff. lung I 
adeno(ODO4945-01 


5.4 


Breast Margin 
9100265 


5.4 


jLung Margin 
(ODO4945-03) 


7.7 


Breast Cancer 
A209073 


2.2 


Lung Malignant Cancer 
(OD03126) 


3.7 


Breast Margin 
A2090734 


20.3 


Lung Margin 
(OD03126) 


3.1 


Breast cancer 
(OD06083) 


9.1 


Lung Cancer 
(OD05014A) 


8.8 

* — ^— 


Breast cancer node 
metastasis (OD06083) 


7.6 


Lung Margin 
(OD05014B) 


12.6 


Normal Liver 


7.4 


Lung cancer 
(0006081) 


,,_,,,..,,,,„„„„„, , ,„ 
11.0 


Liver Cancer 1026 


4.5 


Lung Margin 
(OD06081) 


6.7 


Liver Cancer 1025 


9.7 


Lung Cancer 

(OUU4Zj /-UJ ) 


1.0 


Liver Cancer 6004-T ) 7.4 


Lung Margin 


10.4 


Liver Tissue 6004-N 7.4 


Ocular Melanoma 
Metastasis 


5.1 

„ 


Liver Cancer 6005-T 


5.2 


Ocular Melanoma 
Margin (Liver) 


6.1 


Liver Tissue 6005-N 


8.8 


Melanoma Metastasis 


1.6 


Liver Cancer 064003 


5.9 


Melanoma Margin 
(Lung) 


5.1 


Normal Bladder 


6.1 


Normal Kidne> 


11.3 


Bladder Cancer 1023 


4.6 


Kidney Ca, Nuclear 
grade 2 (OD04338) 


51.8 


Bladder Cancer 
A302173 


7.9 


Kidney Margin 
(OD04338) " 


4.7 


Normal Stomach 


8.6 


IKidney Ca Nuclear 
grade l/2(< >D04339) 


24.5 


Gastric Cancer 
9060397 


1.2 


(Kidney Margin 
[(OD04339) 


21.5 


Stomach Margin 
9060396 


4.3 
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Kidney Ca, Clear cell 
type (OD04340) 


2.3 


Gastric Cancer 
9060395 


1.9 


Kidney Margin 
(OD04340) 


14.8 


Stomach Margin 
9060394 


6.8 


Kidney Ca. Nuclear 
grade 3 (OD04348) 


i.i 


Gastric Cancer 064005 


3.1 



Table DG D. Panel 4. ID 



Tissue Name 


Rel. Exp.(%) 
Ag3635, Run 
169960385 


i 

Tissue Name 


Rel. Exp.(%) 
Ag3635, Run 
169960385 


Secondary Thl act 


20.1 


HUVEC IL-lbeta 


12.5 


Secondary Th2 act 


19.1 


HUVEC I FN gamma 


34.6 


Secondan, Trl act 


12.7 


HUVEC TNF alpha I 
IFN gamma 


3.2 


Secondary Thl rest 


2.0 


HUVEC TNF alpha + 
III 


5.3 


Secondary Th2 rest 


8.2 


nuvEc n - i i 


29.3 


Secondary Trl rest 


4.8 


Lung Microvascular EC 
none 


12.7 


Primary Thl act 


16.0 


Lung Microvascular EC 
I MF alpha - IL-lbeta 


3.0 


Primary Th2 act 


17.8 


Microvascular Dermal 
EC none 


6.0 


Primary Trl act 


13.1 


Microsvasular Dermal 
EC TNFalpha + IL-lbeta 


4.3 


Primary Thl rest 


3.4 


Bronchial epithelium 
TNFalpha + ILlbeta 


12.5 


Primary Th2 rest 


6.3 


Small airway epithelium 
none 


4.5 


Primary Trl rest 


5.4 


Small airway epithelium 
TNFalpha + IL-lbeta 


11.0 


CD45RA CD4 
lymphocyte act 


17.1 


Coronery artery SMC rest 


20.4 


(. D45ROCD4 
lymphocyte act 


44.1 


Coronery artery SMC 
TNFalpha + IL-lbeta 


21.8 


;CD8 lymphocyte act 


23.3 


! Astrocytes rest 


6.4 


Secondary CDS 
lymphocyte rest 


42.9 


Astrocytes TNFalpha + 
IL-lbeta 


9.9 


Secondary CD8 
lymphocyte act 


7.4 


iKU-8 12 (Basophil) rest 


| 0.2 


JcD4 lymphocyte none 


11.5 


KU-8 12 (Basophil) 
jPMA/ionomycin 


0.8 


|2ry Thl/Th2/Trl_anti- 


2.4 


ICCD1106 


11.2 
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pnos phi i 


(Kciatinoeytes) none 




1 

LAK cells rest 


36.3 


JCCD1106 
j(Keratinocytes) 
jTNFalpha + IL-lbeta 


22.8 


LAK cells IL-2 


9.1 


(Liver cirrhosis 


7.4 


LAK cel ls IL-2+IL- 12 


22.8 


1NCI-H292 none 


14.3 


LAK cells IL-2+1 FN 
gamma 


21.8 


|NCI-H292 IL-4 


20.9 


LAK cells IL-2+ IL-18 


21.2 


[NCI-H292 IL-9 


26.1 


LAK cells 

PM A/ionomyc in 


21.0 


(NCI-H292 IL-13 


25.5 


NK Cells IL-2 rest 


8.7 


1NCT-H292 IFN gamma 


13.2 


Two Way MLR 3 day 


24! 


jHPAEC none 


32.5 J 


Two Way MLR 5 day 


18.7 


jHPAEC TNF alpha + IL- 
[l beta 


14.3 


Two Way MLR 7 day 


21.6 


ILung fibroblast none 


1.3 ' ' 


PBMC rest 


23.0 


[Lung fibroblast TNF 
alpha + IL-1 beta 


0.8 


PBMC PWM 


71.2 


Lung fibroblast IL-4 




l>t> \ /f/' 1 nu A T 

rBML rHA-L 


46.7 


Lung tibiohla^t II - u 




Ramos (B cell) none 


1.8 Lung fibroblast 1L- 13 


1.4 


Ramos (B cell) 
ionomyein 


1.5 


Lung fibroblast IFN 
'gamma 


1.2 


B lymphocytes PWM 


49.7 


jDermal fibroblast 
JCCD1G70 rest 


33.7 


|B lymphocytes CD40L 
and IL-4 


68.8 


IDermal fibroblast 
JCCD1070 TNF alpha 


22.5 


EOL-1 dbcAMP 


22.7 


IDermal fibroblast 
CCD 1070 IL-1 beta 


10.1 


E< >L-1 dbcAMP 
PMA/ionomycin 


40.3 


Dermal fibroblast IFN 
I gamma 


6.1 


DviiJiit e cells none 


54.0 


jDermal fibroblast IL-4 




10.4 


Dendril c cells LPS 


73.7 


Dermal Fibroblasts rest 


9.9 


Dendritic cells anti- 
CD40 


100.0 


(Neutrophils TNFa+LPS 


0.8 

\ 


V1onoc\ tes rest 


6676 


Neutrophils rest 




2.8 


IMonocytes LPS 


2 2,1 


.Colon 


~~ "- 7 - 


(Macrophages rest 


So. 5 


jLung 


34.6 


Macrophages LPS 


13.4 


.Thymus 


| 51.1 


HUVEC none 


8.6 


sKidney 


! " 90.8 


Hi'VEC stars ed 


n - 





Table DGE . Panel CNS_1 



Tissue Name 


Rel. Exp.(%) Ag3635, j 
Run 171648701 


Tissue Name 


Rel. Exp.(%) Ag3635, 
Run 171648701 


BA4 Control 


16.7 


BA17PSP 


27.0 


R A A. f/Mitrr\l9 
Jj/Vt v^UlIU U1Z, 


39.2 


BA17 PSP2 


8.9 


BA4 

Alzheimer' s2 


5.0 


Sub Nigra Control 


1.1 


BA4 Parkinson's 


28.5 


Sub Nigra Control2 


16.5 


BA4 

Parkinson's2 


100.0 


Sub Nigra 
Alzhe imer 's2 


4.1 


BA4 

Huntingiton's 


24.7 


Sub Nigra 
Parkinson's2 


26.1 


BA4 

lliinting;on's2 


9.7 


Sub Nigra 
Huntington's 


25.3 


BA4 PS P 


11.7 


Sub Nigra 
Huntington's2 


9.3 


BA4 psp: 


36.6 


Sub Nigra PSP2 


2.4 


BA4 Depression 


14.9 


Sub Nigra 
Depression 


2.0 


BA4 

Depress ion2 


8.8 


Sub Nigra 
Depression2 


4.5 


BA7 Control 


25.3 


Glob Palladus 
Control 


3.3 

_____ 


BA7 Control2 


42.6 


Glob Palladus 
Control2 


4.9 


RA7 

Alzheimer's2 


4.5 


Glob Palladus 
Alzheimer's 


4.5 


BA7 Parkinson's 


10.4 


Glob Palladus 
Alzheimer's2 


1.2 


BA7 

Parkinson's2 


29.5 


Glob Palladus 
Parkinson's 


21.6 


jHuntington's 


25.3 


Parkinson's2 


2.0 


[baT" 

!Huntington's2 


12.9 


Glob Palladus PSP 


r ..a 


BA7 PSP 


_ 306 


Glob Palladus PSP2 


2.1 


BA7 PSP2 


12.1 

— _ 


VJIOD r dlldUUS 

Depression 


1.4 




10.1 


Temp Pole Control 


7.3 


BA9 Ci ntrol 


15.0 


Temp Pole Control2 


27.4 


BA9 Control2 


47.0 


Temp Pole 
Alzheimer's 


4.2 


BA9 Alzheimer's 


4.7 


Temp Pole 
Alzheimer's2 


2.5 


BA9 

Alzheimer's2 


9.2 


Temp Pole 
Parkinson's 


15.1 
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BA9 Parkinson's 


28.9 


Temp Pole 
Parkinson's2 


14.2 


BA9 

Parkinson's2 


34.4 


Temp Pole 
Huntington's 


19.2 


B A 9 

1 luntington's 


24.1 


Temp Pole PSP 


3.4 


Huntingiton's2 


9.5 


Temp Pole PSP2 


3.0 


BA9 PSP 


1Q2 jTempPole 
Depression2 


4.8 


BA9 PSP2 


6.2 


Cing Gyr Control 




BA9 Depression 


5.6 


Cing Gyr Control2 


25.9 


BA9 

Depression2 


8.3 


Cing Gyr 
Alzheimer's 


5.6 


BA17 Control 


33.2 


Cing Gyr 
Alzheimer's2 


6.3 


|BA17Control2 


64.6 


Cing Gyr 
Pari mi son's 


9.1 


BA17 

IAlzheimer's2 


72 jCing Gyr 

]Parkinson's2 


13.1 


BA17 
Parkinson's 


18.3 


Cing Gyr 
Huntington's 


22.1 

— — — — 


BA17 

Parkinson's2 


35.1 


Cing Gyr 
Huntington's2 


6.0 




BA17 

1 [untington's 


32.1 


Cing Gyr PSP 


6, 


BA17 

Huntington's2 


8.8 


Cing Gyr PSP2 


2.6 


BA17 
Depression 


7.3 

L _ _ „ . 


Cing Gyr 
Depression 


2.6 


BA17 J 25 ? 
Depression2 | 


Cing Gyr 
Depression2 


6.1 



CNS_neurodegeneration_vl.O Summary: Ag3635 This panel confirms the expression of 
CG59947-01 gene at low levels in the brain in an independent group of individuals. 
However, no differential expression of this gene was detected between Alzheimer's 
diseased postmortem brains and those of non-demented controls in this experiment. This 
gene encodes a potassium channel protein homolog. The significant levels of expression in 
the brain may indicate a role for this protein in signal processing in the central nervous 
system. 

References: 
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1 . Rudy B, Chow A, Lau D, Amarillo Y, Ozaita A, Saganich M, Moreno H, Nadal 
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E. 
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General_screening_panel_vl.4 Summary: Ag3635 Results from one experiment with 
the CG59947-01 gene are not included. The amp plot indicates that there were 
experimental difficulties with this run. 

Panel 2.2 Summary: Ag3635 Highest expression of the CG59447-01 gene is seen in 
1 0 normal kidney tissue adjacent to a tumor (CT=28). In addition, expression appears to be 
higher in normal kidney tissue than in the adjacent tumor in six out of nine matched pairs. 
Conversely expression appears to be higher in breast cancer than in matched normal breast 
tissue. T hus, expression of this gene could be used to differentiate between these samples 
and other samples on this panel and as a marker for kidney and breast cancers. 
1 5 Furthermore, therapeutic modulation of the expression or function of this protein may be 
effective in the treatment of breast and kidney cancer. 

Panel 4.1D Summary: Ag3635 This gene is expressed at high to moderate levels in a 
wide range of cell types of significance in the immune response in health and disease, with 
highest expression in anti CD40 dendritic cells (CT=28.1). Other cells that express this 
protein include members of the T-cell, B-cell, endothelial cell, macrophage/monocyte, and 
peripheral blood mononuclear cell family, as well as epithelial and fibroblast cell types 
from lung and skin, and normal tissues represented by colon, lung, thymus and kidney. 
This ubiquitous pattern of expression suggests that this gene product may be involved in 
homeostatic processes for these and other cell types and tissues. Therefore, modulation of 
the gene product with a functional therapeutic may lead to the alteration of functions 
associated with these cell types and lead to improvement of the symptoms of patients 
suffering from autoimmune and inflammatory diseases such as asthma, allergies, 
inflammatory bowel disease, lupus erythematosus, psoriasis, rheumatoid arthritis, and 
osteoarthritis. 
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Panel CNS_1 Summary: Ag3635 Expression in this panel confirms expression of the 
CG59947-01 gene in the brain. Please see Panel CNS_neurodegeneration_vl.O for 
discussion of utility of this gene in the central nervous system. 

DH. CG59938-01: arylsulfatase 

Expression of gene CG59938-01 was assessed using the primer-probe set Ag3634, 
described in Table DHA. 



Table DH A. Probe Name Ag3634 



f— — 
Primers 


Sequences 


Length 


Start 
Position 


SEQID 
NO: 


Forward 


5 ' -agccaatgaaagaggagaaagt-3 1 


22 


870 


740 


Probe 


TET-5 ' -cttccctcatgctgaaggaggcactt-3 ' - 
TAMRA 


26- 


894 


741 


Reverse 


5 ' -cccttttgtacctttcaatgaa-3 ' 


22 


923 


742 



CNS_neurodegeneration_vl.O Summary: Ag3634 Expression of the CG55938-01 gene 
is low/undetectable in all samples on this panel (CTs>35). (Data not shown.) 



Generall_screening_panel_vl.4 Summary: Ag3634 Expression of the CG55938-01 gene 
is low/undetectable in all samples on this panel (CTs>35). (Data not shown.) 

Panel 2.2 Summary: Ag3634 Expression of the CG55938-01 gene is low/undetectable in 
all samples on this panel (CTs>35). (Data not shown.) 

Panel 4.1D Summary: Ag3634 Expression of the CG55938-01 gene is low/undetectable 
in all samples on this panel (CTs>35). (Data not shown.) 

DI. CG59746-01: Ubiquitin Carboxyl-terminal Hydrolase 

Expression of gene CG59746-01 was assessed using the primer-probe set Ag3574, 
described in Table DIA. 



Table DIA . Probe Name Ag3574 



Primers 


Sequences 


jLength 


Start 
Position 


SEQID 
NO: 


Forw uru 


5 ' -agcacaacaca.gaaggaaatca-3 ' 




461 


743 


Probe 


TET-5 ' -tcattccacaaagttgatgagaaatca-3 1 - 


i 27 


491 


744 
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TAMRA 






Reverse 


5 ' -gtcccacttccttttgctatct-3 * 


22 


534 i 745 



CNSneurodegenerationvl.O Summary: Ag3574 Expression of the CG59746-01 gene 
is low/undetectable in all samples on this panel (CTs>35). (Data not shown.) 



Generali_screening_panel_vl.4 Summary: Ag3574 Expression of the CG59746-01 gene 
is low/undetectable in all samples on this panel (CTs>35). (Data not shown.) 

Panel 2.2 Summary: Ag3574 Expression of the CG59746-01 gene is low/undetectable in 
all samples on this panel (CTs>35). (Data not shown.) 

Panel 4. ID Summary: Ag3574 Expression of the CG59746-01 gene is low/undetectable 
in all samples on this panel (CTs>35). (Data not shown.) 

Panel CNS_1 Summary: Ag3574 Expression of the CG59746-01 gene is 
low/undetectable in all samples on this panel (CTs>35). (Data not shown.) 

DJ. CG88613-01: INOSITOL 1,4,5-TRISPHOSPHATE 3-KINASE 
ISOENZYME 

Expression of gene CG88613-01 was assessed using the primer-probe set Ag3647, 
described in Table DJA. Results of the RTQ-PCR runs are shown in Tables DJB, DJC and 
DJD. 



Table D JA. Probe Name Ag3647 



jprimers 


Sequences 


Length 


Start 
Position 


SEQID 
NO: 


Forward 


5 1 -actggagc-.j it- jac la - 


21 


1731 


766 j 


jProbe 


TET-5 1 - accaogtcat cctgcaaaagtacgtg- 3 1 - 
TAMRA 


26 


1775 


767 


jReverse 


5 ' -cagagcttcacgaagttcttct-3 ' 


22 


1809 


768 



Table DJB . CNS neurodegeneration v 1 .0 



Tissue Name 


Rcl. F.\p.<%> Ag3647, 
Rim 211019283 


Tissue Name 


Rel. Exp.(%) Ag3647. 
Rim 211019283 


AD 1 Hippo 


44.8 


Control (Path) 3 
Temporal Ctx 


16.3 


AD 2 Hippo 


28.1 


Control (Path) 4 


20.7 
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j Temporal Ctx j 


AD 3 Hippo 


10.2 


AD 1 Occipital 
Ctx 


30.1 


AD 4 Hippo 


10.7 


AD 2 Occipital 
Ctx (Missing) 


0.0 


AD 5 hippo 


98.6 


AD 3 Occipital 

Ctx 


17.1 


AD 6 Hippo 


75.3 


AD 4 Occipital 
Ctx 


17.9 




33.4 


AD 5 Occipital 
Ctx 


39.8 


Control 4- Hippo 


29.1 


AD 6 Occipital 
Ctx 


42.6 


Control (Path) 3 
Hippo 


14.9 


Control 1 Occipital! 
Ctx 


8.0 


AD 1 Temporal Ctx 


19.5 


Control 2 Occipital 
Ctx 


67.4 


AD 2 Temporal Ctx 


29 1 

' 


Control 3 Occipital] 
Ctx 


29.3 


AD 3 Temporal Ctx 


14.2 


Control 4 Occipital 
Ctx 


16.6 




23.8 


Control (Path) 1 
Occipital Ctx 


57.4 


AD 5 Inf Temporal 
Ctx 


100.0 


Control (Path) 2 
( )ccipital Ctx 


12.9 


AD 5 SupTemporal 
Ctx 


69.7 


Control (Path) 3 
Occipital Ctx 


14.6 


AD 6 Inf Temporal 
Ctx 


73.2 


Control (Path) 4 
Occipital Ctx 


12.5 


AD 6 Sup Temporal 
Ctx 


57.8 


Control 1 Parietal 
Ctx 


11.7 


Control 1 Temporal 
Ctx 


17.3 


Control 2 Parietal 
Ctx 


50.0 


Control 2 Temporal 
Ctx 


47.6 


Control 3 Parietal 
Ctx 


39.0 


Control 3 Temporal 
Ctx 


32.5 


Control (Path) 1 
Parietal Ctx 


41.8 


Control 4 Temporal 
Ctx 


20.9 


Control (Path) 2 
Parietal Ctx 


18.4 




Control (Path) 1 
temporal Ctx 


35.6 


Control (Path) 3 
Parietal Ctx 


16.0 

„.,„,,.„.„„„,„.,„„,,„„„,,„ 


Control (Path) 2 
fen i^oral Ctx 


28.3 


Control (Path) 4 
Parietal Ctx 


20.4 
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Tissue Name 


Rel. Exp.(%) 

AgOt>4 /, KUI1 

218342103 


Tissue Name 


Rel. Exp.(%) 
Agoo4 / , Kun 
218342103 


Adipose 




K.enai ca. i js.- i u 






Melanoma* 
Hs688(A).T 


15.7 


Bladder 


24.1 




Melanoma* 

rii>Ooo{E>). l 


20.2 


Gastric ca. (liver met.) 
JNCl-iNo/ 


100.0 




Melanoma* Ml 4 


1 1.3 


Uastnc ca. KA 1 ij Hi 


25.2 


OOT 


Melanoma* 
LOXIMV1 


8.5 


Colon ca. SW-948 


11.2 


Mclcinomci* SK.~ 
MEL-5 


13.0 


Colon ca. SW480 


i 

38.7 




Squamous cell 
carcinoma SCC-4 


35.6 


Colon ca.* (SW480 
met) SW620 


11.7 




Tpctic T>nn1 
1 cSUS r UU1 


6.4 


Colon ca. HT29 


17.2 


0 


Prostate ca.* (bone 
met) PC -3 


11.7 


Colon ca. HCT-116 


24.1 


p 


Prostate Pool 


12.7 


Colon ca. CaCo-2 


78.5 


hi 


Placenta 


31.6 


Colon cancer tissue 


37.1 




1 Hems Pool 


4.5 


Colon ca. SW1116 


8.0 


O 


Ovarian ca. 
OVCAR-3 


21.2 


Colon ca. Colo-205 


4.6 




Ovarian ca. SK- 
OV-3 


- — - 

22.2 


Colon ca. SW-48 


9 ; 3 . 




Ovarian ca. 

OVf 1 ATI' A 
(J V LAK-'l 


22.2 


Colon Pool 


13.5 




Ovarian ca. 
OVCAR-5 


32.5 


Small Intestine Pool 


13.0 




Ovarian ca. 
IGROV1 


32.8 


Stomach Pool 


11.7 




Ovarian ca. 
OVCAR-8 


14.3 


Bone Marrow Pool 


5.7 




Ovary 


7.9 


Fetal Heart 


5.0 




Breast ca. MCF-7 


50.0 


Heart Pool 


6.4 




Breast ca. MDA- 
MB-231 


25.7 


Lymph Node Pool 


11.2 




Breast ca. BT 549 


61.1 


Fetal Skeletal Muscle 


4.4 




Breast ca. T47D 


87.7 


Skeletal Muscle Pool 


11.6 




Breast ca. MDA-N 


7.4 


■ He -li Pool 


6.0 




Breast Pool 


10.6 


Thymus Pool 


8.3 




Trachea 


24.5 


CNS cancer 
(glio/astro) U87-MG 


14.9 




Lung 


3.5 


CNS cancer 
(glio/astro) U-I1S-MG 


16.6 
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Fetal Lung 


jCNS cancer 
|(neuro;met) SK-N-AS 


15.7 


Lung ca. NCI-N417 


_ ]CNS cancer (astro) SF- 
|539 


11.3 


Lungca. LX-1 


16.4 


CNS c&nccr (tistro) 
SNB-75 


34.4 


Lung ca. NCI-H146 


6.0 


CNS cancer (gllo) 
SNB-19 


48.0 


Lung ca. SHP-77 


23.3 


CNS cancer (glio) SF- 
295 


45.4 


Lung ca A549 


25.0 


Brain (Amygdala) Pool 


12.6 


Lung ca. NCI-H526 


6.4 


Brain (cerebellum) 


37.1 


Lung ca. NCI-H23 


10.2 


Brain (fetal) 


13.6 


Lung ca. NCI-H460 


8.7 


Brain (Hippocampus) 


15.3 


Lungca HOP-tO 


11.3 


Cerebral Cortex Pool 


15.6 


Lung ca. NCI-H522 




Brain (Substantia 
nigra) Pool 


27.0 


Liver 


2.5 


Brain { Thalamus) Pool 


17.0 


Fetal Liver 


6.6 


Brain (whole) 


8.9 


Liver ca. HepG2 


16.3 


Spinal Cord Pool 


19.9 


Kidney Pool 


24.0 


Adrenal Gland 


9.9 


Fetal kidnej 


8.0 


Pituitan gland Poo! 


6.3 


Renal ca. 786-0 


11.4 


Salivary Gland 


7.2 


Kci il ca. \ !"N 


21.2 


Thyroid (female) 


12.1 


Renal ca. ACHN 


10.2 


Pancreatic ca. 
CAPAN2 


35.1 


Renal ca. 1 '( »-?l 


18.3 


Pancreas Pool 





Table DJD . Panel 4. ID 



Tissue Name 


Rel. 

Exp.(%) 
Ag3647, 

Run 
169975750 


Rel. 
Exp.(%) 
Ag3647, 

Run 
197444046 


Tissue Name 


Rel. 
Exp.(%) 
Ag3647, 

Run 
169975750 


Rel. 

Exp.(%) 
Ag3647, 

Run 
197444046 


Secondary 1 h 1 act 


77 


7 1 


III ATC IL-lbeta 


20 4 


22.1 


Secondary Th2 act 


10.7 


8.7 


HUVEC IFN 
gamma 


21.5 


21.6 


Secondary Trl act 


9.1 


7.9 


IIUVLCTNF 
alpha + IFN 
gamma 


39.2 


26.2 


Secondary Thl rest 


4.2 


2.7 


IIUVECTNF 
alpha + IL4 


20.9 


16.6 
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Secondary Th2 rest 


6.0 


6.2 


HUVEC II - ' 1 


8.5 ; 


8.8 


Secondary Trl rest 


5.0 


4.0 


Lung 

Microvascular EC! 
none 


29.9 

i 


31.0 I 


Primary Thl act 


8.1 


6.3 


Lung 

Microvascular EC 
TNFalpha + IL- 
lbeta 


37.4 


1 

24.1 | 


Primary Th2 act 


9.0 


g ^ (Microvascular 

Dermal EC none 


17.9 


12.4 


Primary Tr 1 act 


10 1 


jMicrosvasular 
1ft IDermalEC 

TNFalpha + IL- 
Ibeta 


20.0 


15.6 
34.6 




6.0 


Bronchial 
_ _ (epithelium 
3 TNFalpha + 
JILlbeta 


41.5 


Primary Th2 rest 


4.3 


1.1 


Small airway 
epithelium none 


37.4 


33.2 


Primary Trl rest 


7.4 


5.4 


Small airway 
epithelium 
TNFalpha + IL- 
lbeta 


58.2 


43.8 I 


CD45RA CD4 
lymphocyte act 


4.2 4.2 jCoroncry artery ^ 
SMC rest i 


8.0 1 

___J 


CD45RO CD4 
lymphocyte act 


Coronery artery 1 
7.1 | 5.7 (SMC TNFalpha + | 9.7 
1 HL-lbeta J 


7.8 


CD8 lymphocyte 
act 


i » 


4.5 Astrocytes rest 


11.7 


4.2 | 


Secondary CD8 
lymphocyte rest 


9.7 


jAstrocytes 
8.5 (TNFalpha + IL- 
jlbeta 


11.1 


7.2 


Secondary CD8 
lymphocyte act 


6.5 


3 ? |KU-812 

(Rasophil) rest 


10.0 


7.9 


CD4 lymphocyte 
none 


1.8 


1.7 


KU-812 

(Basophil) 

PMA/ionomycin 


14.4 


13.3 


2ry 

Thl/Th2/Trl anti- 
CD95 CH11 


3.1 


2.9 


CCD 1106 

(Keratinocytes) 

none 


42.0 


46.7 


LAK cells rest 


9.9 


5.9 


CCD 1106 
(Keratinocytes) 
TNFalpha + IL- 
lbeta 


100.0 


100.0 


LAK cells IL-2 


5.3 


4.5 


! iver cirrhosis 


14.7 


16.2 
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LAK cells IL- j q? 
2+IL-12 j 


""" f " 1 

3.2 1NCI-H292 none ! 

i 


17.1 


S 

16.2 1 

] 


LAK cells IL- | b{ 
2+IFN gamma ]__ 


3.8 NC1-H2921L-4 


19.1 


25.0 j 


LAK cells IJL-2- , , 
IL-18 j Ui 


5.5 (NCI-H292 IL-9 i 


25.7 


23.8 


LAK cells 
PMA'ionomvcin 


20.9 


16.2 NCI-H292IL-13 


20.6 


18.8 

j 


NK Cells IL-2 rest 


8.2 


12.1 


NCI-H292 IFN 
gamma 


42.6 


28.3 


Two Way MLR 3 
day 


9.2 


11.4 


HPAEC none 


13.6 


11.7 


Two Waj MLR 5 
day 


8.4 


4.9 


HPAEC TNF 
alpha + IL-1 beta 


20.3 


22.1 


Two Way MLR 7 
day 


8.7 


, „ ILung fibroblast j 
none 


16.4 


17.3 


PBMC rest 


3.0 


jLung fibroblast 
2.2 |TNF alpha + IL-1 
jbeta 


11.6 


13.6 


PBMC PWM 


11.0 


7 , Lung fibroblast 
JIL-4 


14.6 


28.3 1 

| 


PBMC PHA-L 


j a r iLung fibroblast 
91 j 46 |lL-9 


16.3 


23.8 


Ramos (B cell) 
none 


5.2 


3.6 


Lung fibroblast 
IL-13 


14.7 


' ! 

12.9 

— 


Ramos (B cell) 
ionomycin 


5.1 


4.9 


Lung fibroblast 
IFN gamma 


22.7 


23.8 


B lymphocytes 
PWM 


4.1 


3.8 


Dermal fibroblast 
CCD 1070 rest 


12.0 


13.6 


B lymphocytes 
CD40L and IL-4 


8.4 


4.9 


Dermal fibroblast 
CCD 1070 TNF 
alpha 


10.9 


14.7 


EOL-I dbcAMP 


7.3 




Dermal fibroblast 
CCD 1070 IL-1 
beta 


10.0 




LOI -1 Jhc 
PMA/ionomycin 


8.4 


7 g iDermal fibroblast 
U N gamma 


9.2 

— — 


7.7 


Dendritic cells 
none 


7.5 


iDermal fibroblast 
8 -° IL-4 


9.7 


7.5 


jDendritic cells LPS 


5.7 


^ g .Dermal 

IFibroblasts rest 


10.1 


8.5 

— " 


Dendritic cells 
.niii-< 1 1" 


7.7 


, Q Neutrophils 
ITNFa+LPS 


11.4 


5.8 


Monocytes rest 


8.5 


7.5 


Mt uhophils i ;st 


3.0 


3.0 " 


Monocytes LPS 


24.8 


25.9 


Colon 


13.0 


9.6 


Macrophages rest 


8.4 


5.6 


Lung 


16.4 


84 
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Macrophages LPS 


12.2 


V> 


|Thymus 


10.9 


8.8 


HUVEC none 


~" 8.7 


12.8 


jKidney 


6.3 




HUVEC starved 


11.8 


11.7 









CNSneurodegenerationvl.O Summary: Ag3647 This panel confirms the expression of 
this gene at moderate levels in the brains of an independent group of individuals. However, 
no differential expression of this gene was detected between Alzheimer's diseased 
postmortem brains and those of non-demented controls in this experiment. Please see Panel 
1.4 for a discussion of the potential utility of this gene in treatment of central nervous 
system disorders. 

General_screening_panel_vl.4 Summary: Ag3647 Expression of the CG88613-01 gene 
is highest in a gastric cancer cell line (CT = 28). Expression of this gene appears to be 
upregulaited in a number of cancer cell lines when compared to normal tissues. Specifically, 
CG886 13-01 gene expression is somewhat higher in breast and ovarian cancers when 
compared to their respective normal tissues. Thus, therapeutic modulation of the activity of 
this gene or its protein product, using small molecule drugs, antibodies or protein 
therapeutics, may be of benefit in the treatment of gastric, breast and ovarian cancer. 

In addition, this gene is expressed at moderate levels in all regions of the central 
nervous system examined, including amygdala, hippocampus, substantia nigra, thalamus, 
cerebellum, cerebral cortex, and spinal cord. The CG88613-01 gene encodes a protein that 
is identical to a protein now known in the public domain as inositol 1,4,5-triphosphate 3- 
kinase C (ref. 1). Inositol 1,4,5-trisphosphate 3-kinase (ITPK) catalyzes the 
phosphorylation of Ins(l,4,5)P3 to Ins(l,4,5)P4, both of which are modulators of calcium 
homeostasis. Calcium is one of the most important intracellular messengers in the brain, 
being essential for neuronal development, synaptic transmission and plasticity, and the 
regulation of various metabolic pathways (ref. 2). Therefore, this gene may play a role in 
central nervous system disorders such as Alzheimer's disease, Parkinson's disease, epilepsy, 
multiple sclerosis, schizophrenia and depression. Furthermore, this gene is also expressed 
in tissue s with metabolic or endocrine function, including pancreas, adipose, adrenal gland, 
thyroid, pituitary gland, skeletal muscle, heart, liver and the gastrointestinal tract. 
Therefore, therapeutic modulation of the activity of this gene may prove useful in the 
treatment of endocrine/metabolically related diseases, such as obesity and diabetes. 

References: 
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1. Dewaste V, Pouillon V, Moreau C, Shears S, Takazawa K, Erneux C. Cloning 
and expression of a cDNA encoding human inositol 1,4,5-trisphosphate 3-kinase C. 
Biochem J 2000 Dec 1;352 Pt 2:343-51 

2. Mattson MP, Chan SL. Dysregulation of cellular calcium homeostasis in 

5 Alzheimer's disease: bad genes and bad habits. J Mol Neurosci 2001 Oct;l 7(2):205-24 

Panel 4. ID Summary: Ag3647 Results from two experiments using the same 
probe/primer set are in excellent agreement. Expression of the CG886 13-01 gene is highest 
in keratinocytes treated with the inflammatory cytokines TNF-a and IL-lb(CT = 29.5). 
Therefore, modulation of the expression or activity of this protein through the application 
10 of small molecule therapeutics may be useful in the treatment of psoriasis and wound 
healing. 

This gene is also expressed at moderate levels in small airway epithelial cells, 
bronchial epithelium, and lung microvascular endothelial cells. Endothelial cells are known 
to play important roles in inflammatory responses by altering the expression of surface 

1 5 proteins that are involved in activation and recruitment of effector inflammatory cells (ref. 
1). Expression in small airway epithelial cells, bronchial epithelium, lung microvascular 
endothelial cells suggests that the protein encoded by this transcript may be involved in 
lung disorders including asthma, allergies, chronic obstructive pulmonary disease, and 
emphysema. This gene is homologoust o PI-3-kinase which is involved in cell survival and 

20 receptor signaling of a number of cells of importance in the immune response in health and 
disease, including lung pathologies. Therefore, Small molecule antagonists of this gene 
product may lead to amelioration of symptoms associated with asthma, allergies, chronic 
obstructive pulmonary disease, and emphysema. 

This gene is expressed at low levels in the remainder of the samples on this panel, 
25 suggesting that the gene product may play an important role in homeostasis of a number of 
cell types. 

References: 

1 . Siddiqui RA, English D. Phosphatidylinositol 3'-kinase-mediated calcium 
mobilization regulates chemotaxis in phosphatidic acid-stimulated human neutrophils. 
30 Biochim Biophys Acta 2000 Jan 3;1483(l):161-73 
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2. Condliffe AM, Cadwallader KA, Walker TR, Rintoul RC, Cowburn AS, 
Chilvers ER. Phosphoinositide 3-kinase: a critical signalling event in pulmonary cells. 
RespirRes 2000;l(l):24-9 

DK. CG59993-01 and CG59993-02: synaptotagmin II 

Expression of gene CG59993-01 and variant CG59993-02 was assessed using the 
primer-probe set Ag3645, described in Table DKA. Results of the RTQ-PCR runs are 
shown in Tables DKB, DKC and DKD. 



Table DKA . Probe Name Ag3645 



j 

Primers Sequences 


Length 


Start 
Position 


SEQID 
NO: 


Forward |5 1 -gaagaagaccctgaacccatac-3 ' 


22 


1056 


746 


, |tet-5 ' -agctttgagatccccttcgagcagat-3 ' - 
Prot>e Jtamra 


26 


1093 


747 


Reverse '5 • - cgaccactacctggactttctg- 3 ' 


22 


1120 


748 



Table DKB . CNS_neurodegeneration_vl.0 



Tissue Name 


Rel. Exp.(%) Ag3645, 
Run 211019104 


Tissue Name 


Rel. Exp.(%) Ag3645, 
Run 211019104 


AD 1 Hippo 


0.4 


Control (Path) 3 
Temporal Ctx 


0.2 


AD 2 Hippo 


0.2 


Control (Path) 4 
Temporal Ctx 


7.2 


AD 3 H ippo 


0.0 


AD 1 Occipital 
Ctx 


8.7 


AD 4 Hippo 


0.1 


AD 2 Occipital 
Ctx (Missing) 


0.0 


AD 5 hippo 


100.0 


AD 3 Occipital 
Ctx 


0.6 


AD 6 Hippo 


1.5 


AD 4 Occipital 

Ctx 


10.7 


Control 2 Hippo 


0.9 


AD 5 Occipital 
Ctx 


19.2 


Control 4 Hippo 


0.1 


AD 6 Occipital 
Ctx 


57.8 


Control (Path) 3 
Hippo 


0.0 


Control 1 Occipital 
Ctx 


0.3 


AD 1 Temporal Ctx 


0.8 


Control 2 Occipital 

Ctx 


81.8 
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AD 2 Temporal Ctx 


2.8 


jControl 3 Occipital 

JCtx 


11.0 


AD 3 Temporal Ctx 


0.3 


Control 4 Occipital 

[Ctx 


0.5 


AD 4 Temporal Ctx 


2.6 


Control (Path) 1 
jOccipital Ctx 


29.1 


AD 5 Inf Temporal 

( IV 


50.0 


{Control (Path) 2 
Occipital Ctx 


8.5 


AD 5 SupTemporal 

Ctx 


0.9 


Control (Path) 3 
Occipital Ctx 


0.7 


AD 6 Inf Temporal 

Ctx 


2.8 


Control (Path) 4 
Occipital Ctx 


12.8 


AD 6 Sup Temporal 
Ctx 


8.2 


Control 1 Parietal 
Ctx 


0.4 


Control 1 Temporal 
Ctx 


0.1 


jControl 2 Parietal 
[Ctx 


2.3 


Control 2 Temporal 

Ctx 


7.2 


Control 3 Parietal 

Ctx 


28.7 


Control 3 Temporal 
Ctx 


1.5 


Control (Path) 1 
Parietal Ctx 


33.2 


Control 4 Temporal 
Ctx 


0.7 


Control (Path) 2 
Parietal Ctx 


21.8 


Control (Path) 1 
1 e mp >ral Ctx 


4.7 


Control (Path) 3 
Parietal Ctx 


0.5 


Control (Path) 2 
Temporal Ctx 


6.8 


Control (Path) 4 
Parietal Ctx 


68.3 




Table DKC. General screening panel vl.4 


. 

Tissue Name 


Rel. Exp.(%) 
Ag3645, Run 
218341901 


Tissue Name 


Rel. Exp.(%) ! 
Ag3645, Run 
218341901 


Adipose 


o.o fi 


Icnalca. TK-10 


0.1 


Melanoma* 
Hs688(A).T 


0.1 I 


Madder 


0.1 


Melanoma* 
Hs688(B).T 


o, < 


jastric ca. (liver met.) 
<ICI-N87 


0.0 


Melanoma* Ml 4 


0.0 ( 


jastric ca. KATO III 


0.0 


Melanoma* 
LOXIMVI 


0.0 ( 


:olon ca. SW-948 


0.0 


(Melanoma* SK- 
MEL-5 


0.4 ( 


:olon ca. SW480 


0.0 

....... .. _ 


Squamous cell 
carcinoma SCC-4 


o.i ; 


:olon ca.* (S W480 
net) SW620 


0.0 


Testis Pool 


1.0 ( 


:olon ca. HT29 


0.0 
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Prostate ca.* (bone 
met) PC-3 


0.0 


Colon ca. HCT-116 


0.0 


Prostate Pool 


0.1 


Colon ca. CaCo-2 


0.6 


Placenta 


0.0 




0.0 


Uterus P'ool 


0.0 


Colon ca. SW1 1 16 


0.0 


Ovarian ca. 
OVCAR-3 


0.2 


Colon ca. Colo-205 


0.0 


Ovarian ca. SK- 
OV-3 


0.0 


Colon ca. SW-48 


0.2 


Ovarian ca. 

OVPAR A 
\J v v^/\r>.~H' 


0.2 


Colon Pool 


0.2 


Ovarian ca. 
OVCAR-5 


0.1 


Small Intestine Pool 


0.1 


Ov3.ri3.11 C3.. 
IGROV-l 


0.0 


Stomach Pool 


0.1 


Ovarian ca. 
OVCAR-8 


0.0 


Bone Marrow Pool 


0.0 


Ovary 


0.7 


Fetal Heart 


0.1 


Breast ca. MCF-7 


0.0 


Heart Pool 


0.0 


Breast ca. MDA- 
MB-231 


1.4 


Lymph Node Pool 


1.0 


Breast ca BI 54 c ) 


0.2 


Fetal Skeletal Muscle 


0.0 


Breast ca. T47D 


0.1 


Skeletal Muscle Pool 


0.0 

— ~— — ■ 


Breast ca. MDA-N 




opieen rooi 




Breast Pool 


0.2 


thymus Pool 




Trachea 


0.2 


CNS cancer 

(glio astro) i 87-MG 


0.0 


Lung 


0.3 


CNS cancer 
(glio/astro) U-118-MG 


0.0 


Fetal Lung 


0.5 


CNS cancer 
fncuro-tnctt SK-N-AS 


0.2 


Lung ca. NCI-N417 


0.5 


CNS cancer (astro) SF- 
539 


0.0 


Lung ca. LX-1 


0.0 


CNS cancer (astro) 
SNB-75 


0.2 


(Lung ca. NCI-H146 


0.1 


CNS cancer (glio) 
SNB-19 


0.0 


Lung ca. SHP-77 


0.3 


CNS cancer (glio) SF- 
295 


0, 


iLung ca. A549 


0.0 


Brain i Vm>gdala) Poul 


5.9 


[Lung ca.NCI-H526 


0.0 


Bram feud -el Sum ( 


[ 100.0 


|Lung ca.NCI-H23 


0.2 


Brain (fetal) 


! 0.9 


ILung ca. NCI-H460 


0.0 


Brain (Hippocampus) 
Pool 


i " 
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D 



Lung ca. HOP-62 


0.0 p:rebral Cortex Pool 


19.3 


Lung ca. IN L^l-rl J LA 

— »,^.^J 


Q Q Brain (Substantia 
Inigra) Pool 


? 

19 5 


Liver 


0.0 Brain (Thalamus) Pool 


14.1 1 


Fetal 1 h er 


0.0 Brain (whole) 


10.0 


Liver ca. HepG2 


0.0 ; Spinal Cord Pool 


15.0 


Kidney Pool 


0.1 


Adrenal Gland 


0.9 


Fetal Kidney 


0.6 


Pituitary gland Pool 


0.0 


Renal ca. 786-0 


0.0 


Salivary Gland 


0.0 


Renal ca. A498 


0.0 


Thyroid (female) 


0.1 


Renal ca. ACHN 


0.0 


Pancreatic ca. 
CAPAN2 


0.0 


Renal ca.UO-31 


0.0 


IIIUl- !•- Pool 


0.8 


Table DKD. Panel 4. ID 


Tissue Name 


Rel. Exp.(%) 
Ag3645, Run 
169975206 


Tissue Name 


Rel. Exp.(%) 
Ag3645, Run 
I09«)7520(> 


Secondary 1 h 1 act 


0.0 HUVEC IL-lbeta 


0.0 


Secondary Th2 act 


0.0 


HUVEC IFN gamma 


0.0 


Secondary Trl act 


0.0 


HUVEC TNF alpha + 
1 FN gamma 


0.0 


Secondary Thl rest 


0.0 


HUVEC TNF alpha i 
IL4 


0.0 


Secondly Th2 rest 


0.0 


HUVEC II -1 1 


0.0 


Secondary Trl rest 


0.0 


Lung Microvascular EC 
none 


0.0 




Primary Thl act 


— — — — — 

0.0 


Lung Microvascular EC 
TNFalpha + IL-lbeta 


0.0 


Primary Th2 act 


0.0 


Microvascular Dermal 
EC none 


0.0 


Primary Trl act 


0.0 


Microsvasular Dermal 
EC l\Falpha + IL-lbeta 


0.0 


Primary Thl rest 


0.0 


Bronchial epithelium 
TNFalpha - ILlbeta 


0.0 


Primary Th2 rest 


0.0 


Small airway epithelium 
none 


0.0 


! Primary Trl rest 


0.0 


Small airway epithelium 
TNFalpha + IL-lbeta 


0.0 


CD45RA CD4 
lymphocyte act 


0.0 


Coronery artery SMC rest 


0.0 


CD45RO CD4 
lymphocyte act 


n n jCoronery artery SMC 
TNFalpha + IL-lbeta 


0.0 
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CDS lymphocyte act 0.0 


Astrocytes rest 0.0 


Secondary CD8 Q0 
lymphocyte rest 


Astrocytes TNFalpha + 
IL-1 beta 


100.0 


Secondary CDS 
Kniphocyte act 


0.0 


KU-812 (Basophil) rest 


0.0 


CD4 lymphocyte none 


0.0 


KU-812 (Basophil) 
PMA/ionomycin 


0.0 


2ry Thl/Th2/Trl_anti- 
CD95 CH11 


0.0 


CCD1106 

(Keratinocytes) none 


0.0 


J^/VFv ceila real 


0.0 


CCD 1106 
(Keratinocytes) 
TNFalpha + IL-1 beta 


0.0 


If A V r*n\ lo TT 1 

i y/\ n. cens ll.-z 


0.0 (Liver cirrhosis 


0.0 


*T A V ^allo TT 1J-TT 10 

LAK cells IL-_;i 1JL- li 


0.0 (NCI-H292 none 


15.0 


|LAK cells IL-2+IFN 
(gamma 


8.3 JNCI-H292 IL-4 0.0 




U.O 


NCI-H202 IL-9 


22.5 


It a v gallic 
iLAJS. ceil IS 

jPMA/ionomycin 


0.0 


NCI-H292 IL-1 3 


0.0 | 


NK Cells IL-2 rest 


0.0 


NCI-H292 IFN gamma 


0.0 




0.0 


HPAEC none 


0.0 


Two Way MLR 5 day 


jHPAEC TNF alpha + IL- 
UU jlbeta 


0.0 


|Two Way MLR 7 day 


0.0 Lung libroblast none 


0.0 


JPBMC rest 


ft „ ILung fibroblast TNF 
jalpha + IL-1 beta 


4.3 


PBMC PWM 


0.0 Lung fibroblast II -4 


o.o 


PBMC PHA-L 


12.9 Lung fibroblast IL-9 


0.0 


Ramos (B cell) none 


0.0 Lung fibroblasl II -13 


36.9 


Ramos (B cell) 
lonomycin 


0.0 


Lung fibroblast IFN 
gamma 


0.0 


B lymphocytes PWM 


0.0 


Dermal fibroblast 
CCD1070 rest 


0.0 


B lymphocytes CD40L 
and IL-4 


0.0 


[Dermal fibroblast 
CCD 1070 TNFalpha 


0.0 


EOL-1 dbcAMP 


iDermal fibroblast 
CCD 1070 11.- 1 beta 


0.0 


EOL-1 dbcAMP 
PMA/ionomycin 


4g6 (Dermal fibroblast IFN j 0Q 
jgamma 1 


Dendritic cells none 


0.0 


Dermal fibroblast IL-4 | 0.0 


Dendritic cells 1 PS 


0.0 


Dermal Fibroblasts rest 


0.0 


Dendritic cells anti- 
CD40 


20.9 


Neutrophils TNFa+LPS 


0.0 


Monocytes rest 


0.0 


Neutrophils rest 


0.0 



1082 



Monocytes L PS 


0.0 


Colon 


25.7 


Macrophages rest 


0.0 


Lung 


28.7 


Macrophages LPS 


0.0 


Thymus 


43.2 


HUVEC none 


0.0 


Kidney 


25.9 


HUVEC starved 


0.0 







CNSneurodegenerationvl.O Summary: Ag3645 While no association between the 
expression of the CG59993-01 gene and the presence of Alzheimer's disease is detected in 
this panel, these results confirm the expression of this gene in areas that degenerate in 
Alzheimer's disease, including the cortex and hippocampus. Synaptotagmin expression is 
5 altered in the brain of Alzheimer's patients, possibly explaining impaired synaptogenesis 
and/or synaptosomal loss secondary to neuronal loss observed in the neurodegenerative 
disorder. It may also represent, reflect or account for the impaired neuronal transmission in 
Alzheim er's disease (AD), caused by deterioration of the exocytic machinery. Since the this 
gene is a homolog of synaptotagmin, agents that potentiate the expression or function of 
1 0 the protein encoded by the this gene may be useful in the treatment of Alzheimer's disease. 

Generall_screening_panel_vl.4 Summary: Ag3645 The CG59993-01 gene is a homolog 
of synaptotagmin, and shows high to moderate expression across all brain regions with 
highest expression in the cerebellum (CT = 26.4) Synaptotagmin is a presynaptic protein 
involved in synaptic vesicle release, making this an ideal drug target for diseases such as 

15 epilepsy, in which reduction of neurotransmission is beneficial. Selective inhibition of this 
gene or its protein product may therefore be useful in the treatment of seizure disorders. 
Furthermore, selective inhibition of neural transmission through antagonism of the protein 
encoded by this gene may show therapeutic benefit in psychiatric diseases where it is 
believed! that inappropriate neural connections have been established, such as schizophrenia 

20 and bipolar disorder. In addition, antibodies against synaptotagmin may cause Lambert- 
Eaton m yasthenic syndrome. Therefore, peptide fragments of the protein encoded by this 
gene may serve to block the action of these antibodies and treat Lambert-Eaton myasthenic 
syndrome. 

Panel 4.1D Summary: Ag3645 Expression of the CG59993-01 gene is restricted to a 
25 sample derived from astrocytes treated with TNF-alpha and IL-1 beta (CT=33.9). This 

expression in samples related to the central nervous system is consistent with results of the 
previous panels and suggests that modulation of this protein could be beneficial in the 
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treatment of CNS disease-associated inflammation or neurodegeneration, including 
mutliple sclerosis. 

DL. CG59991-01: OOPLASM SPECIFIC PROTEIN 

Expression of gene CG5 999 1-01 was assessed using the primer-probe set Ag3644, 
described in Table DLA. Results of the RTQ-PCR runs are shown in Tables DLB and 
DLC. 



Table DL A. Probe Name Ag3644 



Primers 

Forward 


Sequences 


Length 


Start 
Position 


SEQID 
NO: 


5 1 -gggagtaatgcctctcagtga-3 * 


21 


2294 


740 


Probe 


TET-5 ' -cttgagagtctcccagtgcgccct-3 ' - 

TAMRA 


24 


2318 


750 


Reverse 


5 ' -atgccacagtcctccagtatc-3 ' 


21 


2351 


751 



Table DLB . General_screening_panel_v 1 .4 



■ ""■ — " 

Tissue Name 


Rel. Exp.(%) I 
Ag3644, Run 1 Tissue Name 
218306573 


Rel. Exp.(%) | 
Ag3644, Run 
218306573 ) 


Adipose 


0.0 Renal ca. I k - 1 0 


0.0 


Melanoma* 
Hs688(A).T 


0.0 Bladder 


0.0 


Melanoma* 
Hs688(B). 1 


0.0 


Gastric ca. (liver met.) 
NCI-N87 


0.0 


Melanoma* Ml 4 


0.0 


Gastric ca. KATO III 


0.1 


Melanoma* 
LOXIMVI 


0.0 


Colon ca. SW-948 


0.0 


Melanoma* SK- 
MEL-5 


0.0 JColon ca. SW480 


0.0 


Squamous cell 
icarcinoma SCC-4 


|Colon ca.* (SW4X0 
[met) SW620 


0.0 


festis Pool 


0.5 Colon ca. HT2 1 ) 


0.0 


Prostate ca.* (bone 
met) PC-3 


0.0 


Colon ca. HCT-116 


0.1 


Prostate Pool 


0.0 


Colon ca. CaCo-2 


L °-° 


(Placenta 


0.0 


Colon cancer tissue 


0.0 


i Uterus 1*00! 
(Ovarian ca. 
OVCAR-3 


0.0 


Colon ca. SW1116 


s 0.0 


0.0 


Colon ca. Colo-205 


0.0 
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~-4 



Ovarian ca. SK- 
OV-3 


0.0 


Colon ca. SW-48 


0.0 


Ovarian ca. 
OVCAP-4 


0.6 


Colon Pool 


0.0 


OVCAR-5 


0.0 


Small Intestine Pool 


1 

0.0 


Ov&riciri C3.. 
IGROV-l 


0.0 


Stomach Pool 


0.0 


Ovarian ca. 
OVCAR-8 


0.0 


Bone Marrow Pool 


0.0 


Ovary 


0.0 


Fetal Heart 


0.0 | 


Breast ca. MCF-7 


0.0 


Heart Pool 


0.0 J 


Breast ca. MDA- 
MB-231 


0.0 


Lymph Node Pool 


0.0 


Breast ca. BT 549 


0.0 


Fetal Skeletal Muscle 


0.0 


Breast ca. T47D 


0.0 


Skeletal Muscle Pool 


0.0 


DrcdSl La. lVlJ_J/\-rS 


0 0 


Spleen Pool 


0 0 


Breast Pool 


0.0 


Thymus Pool 


00 - _...! 


Trachea 


0.0 


CNS cancer 
(glio/astro) U87-MG 


0.0 


Lung 


0.0 


CNS cancer 
(glio/astro) U-118-MG 


0.0 


Fetal Lung 


0.0 


CNS cancer 
(neuro;met) SK-N-AS 


0.0 


Lung ca.NCI-N417 


0.0 


CNS cancer (astro) SF- 


0.0 


Lung ca. LX-1 


0.0 


SNB-75 


0.0 


Lung ca. NCI-H146 


0.0 


CNS cancer (glio) 
SNB-19 


0.0 


]Lung ca. SHP-77 


100.0 


CNS cancer (glio) SF- 
295 


0.3 


jLung ca. A549 


0.0 


Biain ( A my yd tics) Pool 


________ 


Lung ca. NCI-H526 


0.0 


Brain (ceielvHsnn } 


0.0 


Lung ca. NCI-H23 


0.0 


Brain (fetal) 


0.0 


jLung ca. NCI-H460 


0.0 


Brain (Hippocampus) 
Pool 


0.0 


Lung ca. HOP-62 


0.0 


Cerebral Cortex Pool 


0.0 


Lung ca. NCI-H522 


0.0 


Brain (Substantia 
nigra) Pool 


0.0 


. .. 

Liver 


0.0 


P»i din { Thalamus) Pool 


0.0 


Fetal Liver 


0.0 


Brain (whole) 


0.0 


Liver c'c. HepG2 


0.0 


Spinal Cord Pool 


0.0 


(Kidney Pool 


0.0 


Adrenal Gland 0.0 
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Fetal Kidney 


0.0 


Piuiitary gland Pool 




0.0 


Renal ca. 786-0 1 


0.0 


Salivary Gland 


0.0 


Renal ca. A498 


no 


rhyroid (female) 


0.1 


Renal ca. ACHN | 


0.0 


Pancreatic ca. 
CAPAN2 


0.0 


Renal ci. UO-31 if" 


().() 


'ancn as Pool 


0.1 


Table DLC. Panel 4. ID 


Tissue Name 


Rel. Exp.(%) 
Ag3644, Run 
169975188 


Tissue Name 




Rel. Exp.(%) I 
Ag3644, Run 
169975188 1 


Secondary 1 hi act 


0.0 HUVEC IL-lbeta 


oo 1 


Secondary Th2 act 


0.0 


HUVEC 1FN gamma 




0.0 1 


Secondary Trl act 


0.0 


HUVEC TNF alpha + 
IFN gamma 




0.0 


(Secondary Thl rest 


-— — 

0.0 


HUVEC TNF alpha + 
IL4 




0.0 


Secondary Th2 rest 


0.0 


HUVEC IL-1 1 


0.0 


Secondary Trl rest 


0.0 


Lung Microvascular EC 
none 


0.0 


Primary Thl act 


0.0 


Lung Microvascular EC 
TNFalpha + IL-1 beta 


0.0 


[Primary Th2 act 


0.0 


Microvascular Dermal 
EC none 


0.0 


i 

IPrimary Trl act 


0.0 


Microsvasular Dermal 
EC TNFalpha + IL-lbeta 


0.0 

! 


- - — — — ■ 

j Primary Thl rest 


0.0 


Bronchial epithelium 
TNFalpha i II. Ibeta 


0.0 


IPrimary Th2 rest 


0.0 


Small airway epithelium 
none 


0.0 


|___ 
(Primary Trl rest 


0.0 


Small airway epithelium 
TNFalpha + IL-lbeta 


0.0 


CD45RA CD4 
(lymphocyte act 


0.0 


Coronery artery SMC rest 


0.0 


CD45ROC D4 
l\ mphoeyte act 


0.0 


Coronery artery SMC 
TNFalpha + IL-lbeta 


0.0 

™_„ 4 


( US l>tnpiic«L\tc- act 


0.0 


Astrocytes rest 


0.0 


j Secondary CD8 
j lymphocyte rest 


0.0 


Astrocytes TNFalpha + 
IL-lbeta 


0.0 


[Secondary CD8 
} lymphocyte act 


0.0 


KU-812 (Basophil) rest 


48.3 


jCD4 lymphocyte none 


0.0 


KU-812 (Basophil) 
P,\l -\ '; >nom\cin 




100.0 


|2ry Thl/Th2/Trl_anti- 


0.0 


CCD 11 06 




0.0 
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CD95 CHI 1 




(Keralinocvtes) none 




LAK cells rest 


0.0 


CCD1106 
(Keratinocytes) 
TNFalpha + IL-lbeta 


0.0 


LAK cells IL-2 


0.0 


Liver cirrhosis 


0.0 


LAK cells IL-2+IL-12 | 


0.0 1NCI-H292 none 


0.0 


LAK cells IL-2+1TN 
gamma 


0.0 


NCI-H292 IL-4 


0.0 


LAK cells IL-2+ IL-18 


0.0 


NCI-H292 IL-9 


0.0 


LAK cells 
PMA/ionomycin 


0.0 


NCI-H292IL-13 


o.o 1 


NK Cells IL-2 rest ; 


0 0 


NCI-H292 IFN gamma 


0.0 


Two Way MLR 3 day 


0.0 


HPAEC none 


0.0 


Two Way MLR 5 day 


0.0 


HPAEC TNF alpha + IL- 
1 beta 


0.0 


Two Way MLR 7 day 


0.0 


Lung fibroblast none 


0.0 


PBMC rest 


0.0 


Lung fibroblast TNF 
alpha + IL-1 beta 


0.0 


PBMC PWM 


0.0 


I 1)1% fibroblast IL-i 


0.0 




0.0 


I un r ilh.ohla ,t II 


0.0 


Ram - i M .11) none 


0.0 


■ mij llbroblast IL-I3 


0.0 


Ramos (B cell) 
ionomycin 


0.0 


Lung fibroblast IFN 
gamma 


0.0 


B lymphocytes PWM 


0.0 


Dermal fibroblast 
CCD 1070 rest 


0.0 


id lympnocytes y^u^vL, 
land IL-4 


0.0 


Dermal fibroblast 
CCD 1070 TNF alpha 


0.0 

— — — — — 


p_-_ T - > 

EOL-1 dbcAMP 


0.0 


Dermal fibroblast 
CCD 1070 IL-1 beta 


0.0 


EOL-1 dbcAMP 
PMA ionomycin 


0.0 


Dermal fibroblast IFN 

igamma 


1 

0.0 


Dendritic cells none 


0.0 


Dermal fibroblast IL-4 


"6.0 


Dendrit e cells LPS 


0.0 


Dermal Fibroblasts rest 


CO 


Dendritic cells anti- 
CD40 


0.0 


Neutrophils TNFa+LPS 


j 0.0 

1 


Monocytes rest 


0.0 


Neutrophils rest 


0.0 


jMonocytes LPS 


0.0 


iColon 


0.0 


(Macrophages rest 


0.0 


Lung 


; o.o 


Macrophages LI'S 


0.0 


Thymus 


0.0 


III iVK none 


0.0 


Kidney 


0.0 


1H'\ LC starved 





CNS_neurodegeneration_vl.O Summary: Ag3 644 Expression of the CG59991-01 gene 
is low/undetectable in all samples on this panel (CTs>35). (Data not shown). 
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General_screening_panel_vl.4 Summary: Ag3644 Expression of the CG59991-01 gene 
is restricted to a sample derived from a lung cancer cell line (CT=27.2). Thus, expression 
of this gene could be used to differentiate between this sample and other samples on this 
panel and as a marker to detect the presence of lung cancer. Furthermore, therapeutic 
5 modulation of the expression or function of this gene may be effective in the treatment of 
lung cancer. 

Panel 4. ID Summary: Ag3644 Expression of the CG59991-01 gene is restricted to 
samples derived from the basophil cell line KU-812 (CTs=32). Thus, expression of this 
gene could be used as a marker of this cell type. Basophils release histamines and other 

10 biological modifiers in repose to allergens and play an important role in the pathology of 
asthma and hypersensitivity reactions. Therefore, the specific pattern of expression of this 
gene suggests that therapeutic modulation of the expression or function of the protein 
encoded by this gene may block or inhibit inflammation or tissue damage due to basophil 
activation in response to asthma, allergies, hypersensitivity reactions, psoriasis, and viral 

15 infections. 

DM. CG59987-01 and CG59987-02: RHOPHILIN 

Expression of gene CG59987-01 and full length clone CG59987-02 was assessed 
using the primer-probe set Ag3643, described in Table DMA. Results of the RTQ-PCR 
runs are shown in Tables DMB and DMC. Please note that CG59987-02 represents a full- 
20 length physical clone of the CG59987-01 gene, validating the prediction of the gene 
sequence. 



Table DMA . Probe Name Ag3643 



Primers 


Sequences 


Length 


Start 
Position 


Tsiorol 

NO: ! 


Forward 


5 ' -tactttcggaagggctgtaatc-3 ' 


22 


103 


752 


Probe 


TET-5 ' -cttgcacaaaccggccggagtaaatt-3 ' - 
TAMRA 


26 


127 


| 753 


Reverse 


5 ' -tgattcaaagcagctctttgat-3 1 


22 


158 


1 754 



Table DMB . General_screening_panel_vl.4 



T Rel. Exp.(%) " ** T~ ReI.Exp.(%) 

Tissue Name J Ag3643, Run Tissue Name Ag3643, Run 

j 218306426 218306426 



1088 



wJ 



Adipose 


1.4 


Renal ca. TK- 10 


i7.2 ; 


Melanoma* 

nwoo^n ). l 


0.5 


Bladder 


9.1 


Melanoma* 

njOoo^D }. 1 


0.6 


Gastric ca. (liver met.) 
NCI-N87 


46.7 




6.3 


Gastric ca. KATO III 


39.2 


Melanoma* 

T fYYTMVT 


1.7 


Colon ca. SW-948 


16.6 

, , ^^_J 


Melanoma* SK- 
MEL-5 


8.7 


Colon ca. SW480 


7.0 


Squamous cell 
carcinoma SCC-4 


5.4 


Colon ca.* (SW480 
met) SW620 


0.4 


Testis Pool 


0.5 


Colon ca. HT29 


24.8 | 


Prostate ca.* (bone 
met) PC-3 


7.7 


Colon ca. HCT-116 


26.8 


(Prostate Pool 


6.6 


Colon ca. CaCo-2 


34.2 


(Placenta. i 




Colon cancer tissue 




Ql Fool 


0.4 


Colon ca. SW1116 


4.5 


Ovarian ca. 
OVCAR-3 


44.1 


Colon ca. Colo-205 


5.5 


Ovarian ca. SK- 
OV-3 


36.9 


Colon ca. SW-48 


6.5 


(Ovarian ca. 

\fWfC AD /I 


58.6 


Colon Pool 


0.9 


iOvarian ca. 
OVCAR-5 


50.7 


Small Intestine Pool 


1.9 


(Ovarian ca. 
|lGROV-l 


20.3 


Stomach Pool 


2.3 


Ovarian ca. 
OVCAR-8 


7.9 


Bone Marrow Pool 


0.3 


( )van 


1.6 


Fetal Heart 


0.7 


Breast ca. MCF-7 


17.4 


Heart Pool 


0.4 


Breast ca. MDA- 
MB-231 


13.7 


Lymph Node Pool 


1.0 


Breast ca Bl 54*-) 


8.2 


Fetal Skeletal Muscle 


0. 1 


(Breast ca. T47D 


100.0 


Skeletal Muscle Pool 


0.2 




4.7 




0.2 


Breast Pool 


^ 1.9 


Thymus Pool 


! 1.5 


j 

(Trachea 

1 


8.4 


CNS cancer 
(glio/astro) U87-MG 


1.0 


|Lung 


0.3 


CNS cancer 
(glio/astro) U-118-MG 


1.3 

[ 


Fetal Lung 


6.9 


CNS cancer 
(neuro;met) SK-N-AS 


5.2 



1089 



Lung ca.NCI-N417 


0.9 


CNS cancer (astro) SF- 
539 


2.5 


Lung ca. LX-1 


0.2 


CNS cancer (astro) 
SNB-75 


6.0 


Lung ca. NCI-H146 


1.9 


CNS cancer (glio) 
SNB-19 


21.6 


Lung ca. SHP-77 


0.0 


CNS cancer (glio) SF- i 
295 


14.7 


Lung ca. A549 


54 3 


Brain (Amygdala) Pool- 


1 0 


Lling Ca. iNd-riJZO 


3.7 


Brain (cerebellum) 


5.2 


Lung ca. iNd-Hzj 


3.2 


Brain (fetal) 


0.7 


Lung ca. NCI-H460 


1.2 


Brain (Hippocampus) 
Pool 


1.9 


Lung ca. HOP-62 


6.5 


Cerebral Cortex Pool 


2.3 


Lung ca. NCI-H522 


3.7 


Brain (Substantia 
nigra) Pool 


2.1 


iLiver 


1.5 jBrain (Thalamus) Pool 


2.7 


fFetaT Liver 


5.6 iBrain (whole) 


3.5 


Liver cz. 1 HepG2 


7.0 : Spinal Cord Pool 


3.6 

J 


Kidney Pool 


1.4 Adrenal Gland 


02 


Fetal Kidney 


3.4 


Pituitar\ pi and P> k>1 


1.3 


Renal ca. 786-0 


12.2 


Salisar) Gland 


6.7 


Renal c i \4"8 


8.8 


! hyroid (female) 


^ 1.2 

21.3 


iRenal ca ACHN 


|Pancreatic ca. 
y ' ICAPAN2 


Renal ca. UO-3 1 


9.9 Pancreas Pool 


12.2 


Table DMC. Panel 4. ID 


Tissue Name 


Rel. Exp.(%) | 

Ag3643, Run I Tissue Name 
169975145 


Rel. Exp.(%) I 
Ag3643, Run 
169975145 


Second ir\ 1 M act 


3.8 ;HUVEC IL-lbeta 


6.9 


Secondary Th2 act 


1.1 lll'VECIFN gamma 


4.4 


Secondary Trl act 


HUVECTNF alpha + 
0.9 T __ T 

11 \ gamma 


2.6 


Secondary Thl rest 


III VE( rNF alpha + 

1 1.4 | 


4.0 


Secondary Th2 rest 


0.9 |HUVEC IL-1 1 


2.5 


| Secondary Trl rest 


0 ^ (Lung Microvascular EC 

jnone 


4.3 


Primary Thl act 


iLung Microvascular EC 
L ["NFalpha I IL-lbeta 


- 


Primary Th2 act 


5.0 Microvascular Dermal 
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1 


lEC none 1 | 


Primary Trl act 


SMicrosvasular Dermal 
IEC TNFalpha + IL- 1 beta ] 


77 


Primar) Thl rest 


1.0 


Bronchial epithelium 
TNFalpha - II Ibeta 


17.7 


Primary Th2 rest 


0.8 


Small airway epithelium 
none 


13.7 


Primary Trl rest 


0.9 


Small airway epithelium 
TNFalpha + IL-1 beta 


23.3 


CD45RA CD4 
lymphocyte act 


3.5 


Coronery artery SMC rest 


6.7 


CD45RO CD4 
lymphocyte act 


3.0 


Coronery artery SMC 
TNFalpha + IL-1 beta 


4.3 


CDS lymphocyte act 


6.2 


Astrocytes rest 


35.4 


Secondary CD8 
lymphocyte rest 


3.4 


Astrocytes TNFalpha + 
IL-1 beta 


24.3 


Secondary CD8 
lymphocyte act 


0.0 


KU-812 (Basophil) rest 


0.5 


CD4 lymphocyte none 


00 |KU-8 12 (Basophil) 
|PMA/ionomyciii 


2.3 


2ry Thl/Th2/Trl anti- 
CD95 CH11 


0 0 CCD1106 

j(Keratinocytes) none 


39.8 


LAK cells rest 


ICCD1106 
0.7 j(Keratinocytes) 

[TNFalpha + 1L-1 beta 


32.5 


LAK cells 1L-Z 


0.4 jLiver cirrhosis 


41.2 


L \K cells I L-Z+IL- 1 _ 


1.3 NCl-H292none 


49.3 


LAK cells IL-2+IFN 
gamma 


2.2 JNCI-H292 IL-4 


45.1 


1 A \. r TT O t IT 1 O 

LAk t 1L-Z r IL-ii 


2.0 (NCI-H292 IL-9 


100.0 


LAK cells 

PM A/io nomy cin 


1.7 1NCI-H292 IL-13 


51.8 


X TT/" 1 1 , , T I O raft 

JNK. Cells lL-z rest 


0.5 "NCI-H292 IFN gamma 


!>J.O 


Two Way MLR 3 day 


0.4 ; HPAEC none 


8.0 


Two Way MLR 5 day 


n 0 IHPAEC TNF alpha + IL- 
0 8 jl beta 


6.1 


'Two Way MLR 7 day 


1 .5 (Lung fibroblast none 


13.3 


PBMC rest 


(Lung fibroblast TNF 
alpha + IL-1 beta 


7, 


jPBMC PWM 


4.3 1 una fibroblast IL-4 


3.6 


PBMC PHA-L 


3.6 Lung fibroblast IL-9 


1 ' 8.5 ' ' ' 


Ramos . 15 cell) none 


14.8 


Lung fibroblast IL-13 


3.3 


Ramos (B cell) 
ionomycin 


15.7 


Lung fibroblast IFN 
gamma 


10.5 


B lymp locytes PWM 


6.3 


Dermal fibroblast 


5.1 







CCD 1071 resl 




B lymphocytes CD40L 
and IL-4 


_____ 
6.5 


fDermal fibroblast 
|CCD1070TNF alpha 


5.6 


EOL-1 dbcAMP 


0.0 


|Dermal fibroblast 
1CCD1070 IL-1 beta 


7.7 


FfYF -1 filhrAMP 

EA-/JL/ 1 VI \ : V 1 1 

PMA ionomycin 


0.0 


jDermal fibroblast IFN 
J gam in a 


5.5 


Dendritic ct Is none 


0.5 


Dermal fibroblast IL-4 


3.7 _j 


Dendritic cells LPS 


0.0 


'■ Dermal Fibroblasts rest 


1.5 


Dendritic cells anti- 
CD40 


0.0 


(Neutrophils TNFa+LPS 


0.0 


Monocytes rest 


0.0 


([Neutrophils rest 


0.0 


[Monocytes LPS 


0.3 


Colon 


34.6 


Macrophages rest 


0.4 


iLung 


4.3 


Macrophages 1 I's 


().() 


Thymus 


9.9 


HUVEC: none 


3.3 


(Kidney 


51.4 


HUVEC starved 


3.4 





CNS_neurodegeneration_vl.O Summary: Ag3643 Results from one experiment with the 
CG59987-01 gene are not included. The amp plot indicates that there were experimental 
difficulties with this run. 



General_screening_panel_vl.4 Summary: Ag3643 Expression of the CG59987-01 gene 
5 is highest in a breast cancer cell line (CT=25.3). In addition, significant levels of 

expressi on are seen in clusters of cell lines derived from brain, gastric, colon, lung, and 
ovarian cancers. In addition, expression overall appears to be higher in samples derived 
from cancer cell lines than in normal tissues. Thus, expression of this gene could be used as 
a marker to detect the presence of cancer. This gene encodes a homolog of rhophilin, a rho 
10 GTPase that is involved in a signaling pathway that regulates cell adhesion, among other 
functions. Therefore, therapeutic modulation of the expression or function of this gene may 
be effective in the treatment of these cancers. 

Among tissues with metabolic function, this gene is expressed at moderate to low 
levels in pituitary, adipose, adrenal gland, pancreas, thyroid, skeletal muscle, and adult and 
15 fetal heart, and liver. This widespread expression among these tissues suggests that this 

gene product may play a role in normal neuroendocrine and metabolic and that disregulated 
express ion of this gene may contribute to neuroendocrine disorders or metabolic diseases, 
such as obesity and diabetes. 
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This gene is also expressed at moderate to low levels in the CNS and may be a 
small molecule target for the treatment of neurologic diseases. 

Panel 4.1D Summary: Ag3643 Expression of the CG59987-01 gene is highest in NCI- 
IK 92 cells stimulated by IL-9(CT=29.2). The gene is also expressed in a cluster of treated 
and untreated NCI-H292 mucoepidermoid cell line samples. The transcript is also 
expressed at lower but still significant levels in both small airway and bronchial epithelium 
treated with IL-1 beta and TNF-alpha. In comparison, expression in the normal lung is 
relatively low. The expression of the transcript in activated normal epithelium as well as a 
cell line that is often used as a model for airway epithelium (NCI-H292 cells) suggests that 
this transcript may be important in the proliferation or activation of airway epithelium. 
Therefore, therapuetics designed with the protein encoded by this transcript could be 
important in the treatment of diseases which include lung airway inflammation such as 
asthma and COPD. 

DN. CG59971-01 and CG59971-02: Leucine Rich Repeat protein 

Expression of gene CG59971-01 and variant CG59971-02 was assessed using the 
primer-probe set Ag3639, described in Table DNA. Results of the RTQ-PCR runs are 
shown in Tables DNB and DNC. 



Table DNA . Probe Name Ag3639 



[primers 


Sequences 


Length 


Start 
Position 


1 SEQID 
NO: 


Forward 


5 ' -ttctgccaacttcagctacaat-3 ' 


22 


510 




jprobe 


TET-5 1 -cttagacagctccctgcgcctcttgt-3 ' - 
TAMRA 


26 


543 


| 756 


[Reverse 


5 1 -acttgattgtggcttaggttca-3 1 


22 


584 


757 



Table DNB . General_screening_panel_vl.4 



Tissue Name 

! 


Rel. Exp.(%) 
Ag3639, Run 
218234144 


Tissue Name 


Rel. Exp.(%) 
Ag3639, Run 
218234144 


Adipose 


3.2 


Renal ca. I K- 1 0 


24.0 


Melanoma* 
Hs688(A).T 


17.9 


Bladder 


11.8 


Melanoma* 
Hs688(B).T 


12.6 


Gastric ca. (liver met.) 
NCI-N87 


35.6 
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Melanoma* M14 j 




oasinc ca. iv/\ i vj in 


35 8 


Melanoma* 
t rwiA/fvi 

ivL/A.llvl V 1 


15.5 


Colon ca. SW-948 


8.0 


Melanoma* SK- 
MEL-5 1 


25.2 


Colon ca. SW480 


55.5 


Squamous cell 
carcinoma SCC-4 


10.5 


Colon ca.* (SW480 
met) SW620 


32.8 


Testis Pool i 


8.9 


Colon ca. HT29 


13.7 1 


Prostate ca.* (bone 
met) PC-3 


19.3 


Colon ca. HCT-116 


34.9 


Prostate Pool 


3.2 


Colon ca. CaCo-2 




[Placenta 


16.3 


Colon cancer tissi l 


14.3 


Uterus Pool 


1.0 


Colon ca. SW1116 


7.9 1 

- — 


lovarian ca. 
|OVCAR-3 


19.9 


,„„ ^ — 

Colon ca. Colo-205 


13.3 j 


lOvarian ca. SK- 
|0V-3 


30.1 


Colon ca. SW-48 


8.8 


(Ovarian ca. 
IOVCAR-4 


14.8 


Colon Pool 


8.2 


Ovarian ca. 
OVCAR-5 


60.7 


Small Intestine Pool 


7.7 


jOvarian ca. 
IIGROV-1 


15.1 


Stomach Pool 


3.9 


Ovarian ca. 
OVCAR-8 


9.5 


Bone Marrow Pool 


2.7 


Ovary 


12.6 


Fetal Heart 


7.2 


[Breast ca. MCF-7 


54.0 


Heart Pool 


3.7 


•Breast ca. MDA- 
|MB-231 


31.2 


Lymph Node Pool 


10.4 


(Breast ca. BT 549 


2~*.2 


Fetal Skeletal Muscle 


4.0 


jBreast ca. T47D 


100.0 


Skeletal Muscle P.* ol 


3.6 


Breast ca. MDA-N 


20.4 


Spleen Pool 


8.4 


Breast Pool 


8.9 


Thymus Pool 


15.3 


Trachea 


10.3 


CNS cancer 
(glio/astro) U87-MG 


37.9 




jLung 


1.1 


CNS cancer 
(glio/astro) U-l 18-MG 


26.2 


(Fetal Lung 


21.9 


CNS cancer 
(neuro;met) SK-N-AS 


22.4 


(Lung ca. NCI-N417 


4.5 


CNS cancer (astro) SI - 
539 


15.7 


jLung ca. LX-1 


32.8 


CNS cancer (astro) 
SNB-75 


46.7 


JLung ca. NCI-H146 


7.7 


CNS cancer (glio) 
SNB-19 


12.2 
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Lung ca. SHP-77 




33.2 


CNS cancer (glio) SF- 
295 




31.4 


Lung ca. A549 


30.8 


Brain (Amygdala) Pool 


7.9 


Lung ca. NCI-H526 


8.7 


Brain (cerebellum) 


27.4 


Lung ca. NCI-H23 


23.2 


Brain (fetal) 


24.3 


Lung ca. NCI-H460 


18.0 


Brain (Hippocampus) 
Pool 


6.8 


Lung ca. HOP-62 


9.2 


Cerebral Cortex Pool 


8.0 


Lung ca. NCI-H522 


22.5 


Brain (Substantia 
nigra) Pool 


8.3 


Liver 


0.8 


3iain iTluLuuis) Pool 


10.7 


; et tl Liver 


8.8 


Brain (whole) 


13.8 


Liver ca. HepG2 




19.3 


Spinal Cord Pool 


7.6 


kidney Pool 




12.9 


Adrenal Gland 


14.1 


Fetal Kidney 


8.4 


Pituitary gland Pool 


6.6 


Renal ca. 786-0 


20.0 


Salivary Gland 


3.0 


Renal... A1«>S 




5.2 


Thyroid (female) 


4.9 


Renal ca. ACHN 


34.4 


Pancreatic ca. 
CAPAN2 


20.9 


Renal ca. UO-3 1 


15.5 


Pancreas Pool 


10.7 


Table DNC. Panel 4.1D 


Tissue Name 


Rel. Exp.(%) 
Ag3639, Run 
169975065 


I 

} Tissue Name 




Rel. Exp.(%) 
Ag3639, Run 
169975065 


Secondary Thl act 


51.8 


iHUVEC IL-lbeta 




27.0 


Secondan 1 h2 act 


68.8 


HUVEC [FN gamma 




31.2 


Secondary Trl act 


74.7 


[HUVECTNF alpha + 
[] N ^amma 




28.9 


Secondary Thl rest 


26.4 


IHUVEC TNF alpha + 
jlL4 




26.4 


Secondary Th2 rest 


41.5 


(HUVEC IL-1 1 




22.4 


Secondary Trl rest 


24.1 


Lung Microvascular EC 
none 


55.5 


Primary Thl act 


50.3 


Lung Microvascular EC 
TNFalpha + IL-lbeta 


36.1 


Primary Th2 act 


75.8 


\liei ovascular Dermal 
(EC none 




23.5 


Primary Trl act 


66.4 


IMicrosvasular Dermal 
lEC TNFalpha + IL- 1 beta 


33.0 


Primary Thl rest 


19.8 


Bronchial epithelium 
TNFalpha + IL1 beta 




22.8 


Primary Th2 rest 


33.9 


{Small airway epithelium 


23.2 
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none 




Primary Trl rest 


64.2 


Small airway epithelium 
TNFalpha + IL-lbeta 


37.1 


CD45RA CD4 
lymphocyte act 


35.4 


Coronery artery SMC rest 


21.2 


CD45RO CD4 
lymphocyte act 


59.5 


Coronery artery SMC 
TNFalpha + IL-lbeta 


13.4 


CD8 lymphocyte act 


64.6 


\stn >c\ tes rest 


23.3 


Secondary CD8 
lymphocyte rest 


36.1 


Astrocytes TNFalpha + 
IL-lbeta 


25.2 


Secondary CD8 
lymphocyte act 


45 1 


IN. U o 1 Z ^odbOpilll^ ICbl 


22.4 


CD4 lymphocyte none 


20.0 


KU-8 12 (Basophil) 
PMA/ ionomycin 


32.8 


2ry Thl/Th2/Trl_anti- 
CD95 CHI 1 


44.1 


CCD1106 

(Keratinocytes) none 


61.6 


LAK cells rest 


53.2 


CCD1106 
(Keratinocytes) 
TNFalpha + IL-lbeta 


49.0 

~ 


LAK cells IL-z 


57.4 


Liver cirrhosis 




LAK cells l X..-Z < 1 L- 1 2. 


54 0 


NCI-H292 none 


46 7 


LAK cells IL-2+IFN 
gamma 


59.0 


NCI-H292 IL-4 


45.1 


| A 1 ' ,.,,1 j„ I] ~) i II 1 V 

LAK cells 1 1 1 .- 1 o> 


61.6 


NCI-H292 IL-9 


58 6 


LAK cells 
PMA/ionomycin 


39.0 


NCI-H292 IL-13 


35.6 


inJv i_,ens lju-z resi 


60.7 


MPI TJOQO TT7M reommQ 

i\L i-i li in gamma 


28.9 


1 WO W ay ivllvlN. J Udy 


55.1 


HPAEC none 


18.3 


Two Way MLR 5 day 


36.6 


HPAEC TNF alpha - II - 
1 beta 


44.8 


Twrr\ \X/a\/ "N/TT T? 7 rlav 

i wo w d.y lviijiv / udy 


36.6 


Lung fibroblast nr.p.e 


28.3 


PBMC rest 


22.2 


Lung Fibroblast TNF 
alpha + IL-1 beta 


20.2 


PBMC PWM 


62.4 


[ iul lihvl 1 i ,i IL-i 


29.1 


PBMC PHA-i , 


42.9 


1 ung HI >» obh.st IL-9 


50.7 


Ramos (B cell) none 


43.5 


Lung fibroblast IL-13 


40.3 


Ramos (B cell) 
ionomydn 


46.3 


Lung fibroblast IFN 
gamma 


37.4 

- — 


B lymphocytes PWM 


40.9 


Dermal fibroblast 
CCD1070 rest 


60.3 


B lympltiocytes CD40L 
and IL-4 


78.5 


Dermal fibroblast 
CCD1070 TNF alpha 


92.7 


EOL-1 dbcAMP 


34.2 


Dermal fibroblast 
O 1)1070 U.-l beta 


22.7 


EOl -1 dbcAMP 


62 0 


Oeinsai fibroblast I FN 


24.0 
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PMA/ionomycin 


Igamma 




Dendritic cells none 


65.1 


jDermal fibroblast IL-4 


35.6 


Dendritic cells LPS 


25.3 


Dermal r ibro blasts rest 


21.8 


Dendritic cells anti- 
CD40 


41.8 


(Neutrophils TNFa+LPS 


1.7 


Mono.-} tes rest 


100.0 


NeuirophiK rest 


18.3 


Monocytes LPS 


77.4 


(Colon 


11.2 


Macrophages rest 


62.4 


|Lung 


17.7 


Macrophages LI'S 


15.9 


[ liymus 


81.8 


HUVEC none 


20.0 


(Kidney 


18.6 


HUVEC starved 







CNS_neurodegeneration_vl.O Summary: Ag3639 Results from one experiment with the 
CG5997 1-01 gene are not included. The amp plot indicates that there were experimental 
difficulties with this run. 



Genera:l_screening_panel_vl.4 Summary: Ag3639 Expression of the CG59971-02 gene 
is ubiquitous in this panel, with highest expression in a breast cancer cell line (CT=26.6). 
Overall, expression of this gene appears to be higher in samples derived from cancer cell 
lines than in normal tissues. This widespread expression suggests that this gene product is 
involved in cell growth and prolideration. Thus, expression of this gene could be used as a 
marker to detect the presence of cancer. Furthermore, therapeutic modulation of the 
expressi on or function of this gene may be useful in the treatment of cancer. 

In addition, this gene is expressed at much higher levels in fetal lung and liver 
(CTs=29-30) when compared to expression in the adult counterpart (CTs=33). Thus, 
expressiion of this gene may be used to differentiate between the fetal and adult sources of 
these tissue. 

Among tissues with metabolic function, this gene is expressed at moderate to low 
levels in pituitary, adipose, adrenal gland, pancreas, thyroid, and adult and fetal skeletal 
muscle, heart, and liver. This widespread expression among these tissues suggests that this 
gene product may play a role in normal neuroendocrine and metabolic and that disregulated 
expression of this gene may contribute to neuroendocrine disorders or metabolic diseases, 
such as obesity and diabetes. 

This gene is also highly expressed in the brain, with highest expression in the 
cerebellum (CT = 28.5), with moderate expression in other CNS regions as well including, 
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amygdala, hippocampus, cerebral cortex, substantia nigra and thalamus. This gene encodes 
a leucine-rich repeat protein. Leucine rich repeats (LRR) mediate reversible protein-protein 
interactions and have diverse cellular functions, including cellular adhesion and signaling. 
Several of these proteins, such as connectin, slit, chaoptin, and Toll have pivotal roles in 
neuronal development in Drosophila and may play significant but distinct roles in neural 
development and in the adult nervous system of humans (Ref. 1). In Drosophilia, the LRR 
region of axon guidance proteins has been shown to be critical for their function (especially 
in axon this gene shows high expression in the brain, it is an excellent candidate neuronal 
guidance protein for axons, dendrites and/or growth cones in general. Therefore, 
therapeutic modulation of the levels of this protein, or possible signaling via this protein, 
may be of utility in enhancing/directing compensatory synaptogenesis and fiber growth in 
the CNS in response to neuronal death (stroke, head trauma), axon lesion (spinal cord 
injury), or neurodegeneration (Alzheimer's, Parkinson's, Huntington's, vascular dementia or 
any neurodegenerative disease). 

References: 

1. Battye R., Stevens A., Perry R.L., Jacobs J.R. (2001) Repellent signaling by Slit 
requires the leucine-rich repeats. J. Neurosci. 21: 4290-4298. 

Panel 4.1D Summary: Ag3639 The CG59971-01 gene is expressed at high to moderate 
levels in a wide range of cell types of significance in the immune response in health and 
disease. Highest expression of the gene is seen in resting monocytes (CT=28.6). Significant 
levels of expression are also seen in members of the T-cell, B-cell, endothelial cell, 
macrophage/monocyte, and peripheral blood mononuclear cell family, as well as epithelial 
and fibroblast cell types from lung and skin, and normal tissues represented by colon, lung, 
thymus and kidney. This ubiquitous pattern of expression suggests that this gene product 
may be involved in homeostatic processes for these and other cell types and tissues. This 
pattern is in agreement with the expression profile in General_screening_panel_vl.4 and 
also suggests a role for the gene product in cell survival and proliferation. Therefore, 
modulation of the gene product with a functional therapeutic may lead to the alteration of 
functions associated with these cell types and lead to improvement of the symptoms of 
patients suffering from autoimmune and inflammatory diseases such as asthma, allergies, 
inflammatory bowel disease, lupus erythematosus, psoriasis, rheumatoid arthritis, and 
osteoarthritis. 
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Example D. Identification of Single Nucleotide Polymorphisms in NOVX nucleic acid 
sequences 

Variant sequences are also included in this application. A variant sequence can 
5 include a single nucleotide polymorphism (SNP). A SNP can, in some instances, be 
referred to as a "cSNP" to denote that the nucleotide sequence containing the SNP 
originates as a cDNA. A SNP can arise in several ways. For example, a SNP may be due 
to a substitution of one nucleotide for another at the polymorphic site. Such a substitution 
can be either a transition or a transversion. A SNP can also arise from a deletion of a 

10 nucleotide or an insertion of a nucleotide, relative to a reference allele. In this case, the 
polymorphic site is a site at which one allele bears a gap with respect to a particular 
nucleotide in another allele. SNPs occurring within genes may result in an alteration of the 
amino acid encoded by the gene at the position of the SNP. Intragenic SNPs may also be 
silent, when a codon including a SNP encodes the same amino acid as a result of the 

15 redundancy of the genetic code. SNPs occurring outside the region of a gene, or in an 

intron within a gene, do not result in changes in any amino acid sequence of a protein but 
may result in altered regulation of the expression pattern. Examples include alteration in 
temporal expression, physiological response regulation, cell type expression regulation, 
intensity of expression, and stability of transcribed message. 

20 SeqCalling assemblies produced by the exon linking process were selected and 

extended using the following criteria. Genomic clones having regions with 98% identity to 
all or part of the initial or extended sequence were identified by BLASTN searches using 
the relevant sequence to query human genomic databases. The genomic clones that 
resulted were selected for further analysis because this identity indicates that these clones 

25 contain the genomic locus for these SeqCalling assemblies. These sequences were 
analyzed for putative coding regions as well as for similarity to the known DNA and 
protein sequences. Programs used for these analyses include Grail, Genscan, BLAST, 
HMMER, FASTA, Hybrid and other relevant programs. 

Some additional genomic regions may have also been identified because selected 

30 SeqCall ing assemblies map to those regions. Such SeqCalling sequences may have 
overlapped with regions defined by homology or exon prediction. They may also be 
included because the location of the fragment was in the vicinity of genomic regions 
identified by similarity or exon prediction that had been included in the original predicted 
sequence. The sequence so identified was manually assembled and then may have been 
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extended using one or more additional sequences taken from CuraGen Corporation's 
human SeqCalling database. SeqCalling fragments suitable for inclusion were identified by 
the CuraTools™ program SeqExtend or by identifying SeqCalling fragments mapping to 
the appropriate regions of the genomic clones analyzed. 

Irtie regions defined by the procedures described above were then manually 
integrated and corrected for apparent inconsistencies that may have arisen, for example, 
from miscalled bases in the original fragments or from discrepancies between predicted 
exon junctions, EST locations and regions of sequence similarity, to derive the final 
sequence disclosed herein. When necessary, the process to identify and analyze SeqCalling 
assemblies and genomic clones was reiterated to derive the full length sequence (Alderborn 
et al., Determination of Single Nucleotide Polymorphisms by Real-time Pyrophosphate 
DNA Sequencing. Genome Research. 10 (8) 1249-1265, 2000). 

Variants are reported individually but any combination of all or a select subset of 
variants are also included as contemplated NOVX embodiments of the invention. 



NOV5a has one SNP variant, whose variant positions for its nucleotide and amino 
acid sequences is numbered according to SEQ ID NOs:13 and 14 , respectively. The 
nucleotide sequence of the NOV5a variant differs as shown in Table SNP1. 



Table SNP1. NOV5a variants. 



Variant 


Nucleotides 


Amino Acids 


Position 


Initial 


Modified 


Position 


Initial 


Modified 


13376274 


143 


A 


T 


47 


Gin 


Leu 



NOV9a has eight SNP variants, whose variant positions for its nucleotide and 
amino acid sequences is numbered according to SEQ ID NOs:2 1 and 22 , respectively. The 
nucleotide sequence of the NOV9a variant differs as shown in Table SNP2. 
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Table SNP2. NOV9a variants. 



Variant 


Nucleotides 


Amino Acids 


Position 


Initial 


Modified 


Position 


Initial 


Modified 


13376043 


230 


G 


T 


72 


Glu 


End 


13376044 


341 


G 


A 


109 


Gly 


Arg 


13376045 


441 


A 


C 


142 


Gin 


Pro 


13376046 


532 


C 


T 


172 


His 


His 


13376050 


1680 


T 


C 


555 


Leu 


Ser 


13376049 


1762 


G 


T 


582 


Leu 


Phe 


13376048 


1818 


C 


T 


601 


Ser 


Leu 


13376047 


1900 


A 


G 


628 


Thr 


Thr 



NOV 14a has five SNP variants, whose variant positions for its nucleotide and 
amino acid sequences is numbered according to SEQ ID NOs:43 and 44 , respectively. The 
nucleotide sequence of the NOV 14a variant differs as shown in Table SNP3. 



Table SNP3. NOV14a variants. 



Variant 


Nucleotides 


Amino Acids 


Position 


Initial 


Modified 


Position 


Initial 


Modified 


13376438 


1307 


T 


C 


431 


Val 


Ala 


13376437 


1571 


A 


G 


519 


His 


Arg 


13376436 


1625 


T 


C 


537 


Val 


Ala 


13376435 


1646 


T 


C 


544 


Val 


Ala 


13376434 


1667 


T 


C 


551 


He 


Thr 



NOV 15a has twelve SNP variants, whose variant positions for its nucleotide and 
amino acid sequences is numbered according to SEQ ID NOs:53 and 54 , respectively. The 
nucleotide sequence of the NOV15a variant differs as shown in Table SNP4. 
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Table SNP4. NOV15a variants. 



Variant 


Nucleotides 


Amino Acids 


Position 


Initial 


Modified 


Position 


Initial 


Modified 


13376083 


154 


A 


G 


45 


Pro 


Pro 


13376082 


194 


T 


C 


59 


Ser 


Pro 


13376081 


253 


G 


A 


78 


Arg 


Arg 


13376080 


280 


G 


A 


87 


Gin 


Gin 


13376079 


327 


C 


T 


103 


Ala 


Val 


13376078 


338 


C 


T 


107 


Pro 


Ser 


13376077 


366 


T 


C 


116 


He 


Thr 


13376076 


502 


A 


G 


161 


Lys 


Lys 


13376069 


1069 


A 


G 


350 


Pro 


Pro 


13376072 


1137 


A 


G 


373 


Glu 


Gly 


13376071 


1264 


T 


c 


415 


Leu 


Leu 


13376070 


1367 


T 


c 


450 


Ser 


Pro 



NOV17a has four SNP variants, whose variant positions for its nucleotide and 
amino acid sequences is numbered according to SEQ ID NOs:61 and 62 , respectively. The 
nucleoti de sequence of the NOV 17a variant differs as shown in Table SNP5. 



Table SNP5. NOV17a variants. 



Variant 


Nucleotides 


Amino Acids 


Position 


Initial 


Modified 


Position 


Initial 


Modified 


13377433 


175 


T 


C 


55 


His 


His 


13377432 


223 


C 


G 


71 


Pro 


Pro 


13377431 


538 


G 


A 


176 


Thr 


Thr 


13377430 


680 


T 


C 


224 


Phe 


Leu 



NOV 19a has one SNP variant, whose variant positions for its nucleotide and amino 
acid sequences is numbered according to SEQ ID NOs:71 and 72 , respectively. The 
nucleotide sequence of the NOV19a variant differs as shown in Table SNP6. 
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Table SNP6. NOV19a variants. 



Variant 


Nucleotides 


Amino Acids 


Position 


Initial 


Modified 


Position 


Initial 


Modified 


13377434 


1777 


T 


A 


586 


Thr 


Thr 



NOV21a has one SNP variant, whose variant positions for its nucleotide and amino 
acid sequences is numbered according to SEQ ID NOs:75 and 76 , respectively. The 
5 nucleotide sequence of the NOV2 1 a variant differs as shown in Table SNP7. 



Table SNP7. NOV21a variants. 



Variant 


Nucleotides 


Amino Acids 


Position 


Initial 


Modified 


Position 


Initial 


Modified 


13377435 


7503 


T 


A 


2482 


Ala 


Ala 



NOV38a has two SNP variants, whose variant positions for its nucleotide and 
amino acid sequences is numbered according to SEQ ID NOs:123 and 124 , respectively. 
10 The nucleotide sequence of the NOV38a variant differs as shown in Table SNP8. 



Table SNP8. NOV38a variants. 



Variant 


Nucleotides 


Amino Acids 


Position 


Initial 


Modified 


Position 


Initial 


Modified 


13377439 


801 


G 


A 


232 


Ser 


Ser 


13377441 


1595 


C 


G 


497 


Pro 


Arg 



NOV39a has three SNP variants, whose variant positions for its nucleotide and 
amino acid sequences is numbered according to SEQ ID NOs:125 and 126 , respectively. 
1 5 The nucleotide sequence of the NOV39a variant differs as shown in Table SNP9. 



1103 



Table SNP9. NOV39a variants. 



Variant 


Nucleotides 


Amino Acids 


Position 


Initial 


Modified 


Position 


Initial 


Modified 


13375670 


183 


C 


G 


57 


His 


Gin 


13375669 


187 


C 


T 


59 


Leu 


Phe 


13377389 


1385 


A 


G 


458 


His 


Arg 



NOV46a has four SNP variants, whose variant positions for its nucleotide and 
amino acid sequences is numbered according to SEQ ID NOs:143 and 144 , respectively. 
The nucleotide sequence of the NOV46a variant differs as shown in Table SNP 10. 



Table SNP10. NOV46a variants. 



Variant 


Nucleotides 


Amino Acids 


Position 


Initial 


Modified 


Position 


Initial 


Modified 


13377442 


177 


T 


C 


27 


Val 


Ala 


13377443 


590 


A 


G 


165 


Thr 


Ala 


13377444 


799 


A 


G 


234 


Gin 


Gin 


13377445 


977 


T 


C 


294 


Tyr 


His 



NOV49a has two SNP variants, whose variant positions for its nucleotide and 
amino acid sequences is numbered according to SEQ ID NOs:151 and 152 , respectively. 
The nucleotide sequence of the NOV49a variant differs as shown in Table SNP1 1. 



Table SNP11. NOV49a variants. 



Variant 


Nucleotides 


Amino Acids 


Position 


Initial 


Modified 


Position 


Initial 


Modified 


13377450 


119 


A 


G 


7 


Arg 


Gly 


13377448 


1556 


G 


A 


486 


Ala 


Thr 
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NOV50a has one SNP variant, whose variant positions for its nucleotide and amino 
acid sequences is numbered according to SEQ ID NOs:153 and 154 , respectively. The 
nucleotide sequence of the NOV50a variant differs as shown in Table SNP 12. 



Table SNP12. NOV50a variants. 



Variant: 


Nucleotides 


Amino Acids 


Position 


Initial 


Modified 


Position 


Initial 


Modified 


13377451 


2371 


G 


A 


791 


Ala 


Thr 



NOV51a has five SNP variants, whose variant positions for its nucleotide and 
amino acid sequences is numbered according to SEQ ID NOs:155 and 156 , respectively. 
The nucleotide sequence of the NO V5 1 a variant differs as shown in Table SNP13. 



Table SNP13. NOV51a variants. 



Variant 


Nucleotides 


Amino Acids 


Position 


Initial 


Modified 


Position 


Initial 


Modified 


13374492 


765 


G 


A 


243 


Ala 


Thr 


13374491 


924 


T 


C 


296 


Phe 


Leu 


13377453 


1028 


C 


T 


330 


Pro 


Pro 


13377454 


1052 


A 


C 


338 


Ala 


Ala 


13377455 


1205 


C 


T 


389 


His 


His 



NOV52a has one SNP variant, whose variant positions for its nucleotide and amino 
acid sequences is numbered according to SEQ ID NOs:157 and 158 , respectively. The 
nucleotide sequence of the NOV52a variant differs as shown in Table SNP 14. 



Table SNP14. NOV52a variants. 



Variant 


Nucleotides 


Amino Acids 


Position 


Initial 


Modified 


Position 


Initial 


Modified 


13377458 


221 


C 


T 


37 


Arg 


Trp 
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NOV55a has one SNP variant, whose variant positions for its nucleotide and amino 
acid sequences is numbered according to SEQ ID NOs:163 and 164 , respectively. The 
nucleotide sequence of the NOV55a variant differs as shown in Table SNP15. 

5 Table SNP 15. NOV55a variants. 



Variant 


Nucleotides 


Amino Acids 


Position 


Initial 


Modified 


Position 


Initial 


Modified 


13377462 


514 


C 


T 


165 


Arg 


Tip 


13377461 


993 


T 


C 


324 


Ser 


Ser 



NOV60a has one SNP variant, whose variant positions for its nucleotide and amino 
acid sequences is numbered according to SEQ ID NOs:183 and 184 , respectively. The 
nucleotide sequence of the NOV55a variant differs as shown in Table SNP16. 



Table SNP16. NOV60a variants. 



Variant 


Nucleotides 


Amino Acids 


Position 


Initial 


Modified 


Position 


Initial 


Modified 


13377463 


453 


T 


C 


111 


Gly 


Gly 
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NOV65a has one SNP variant, whose variant positions for its nucleotide and amino 
acid sequences is numbered according to SEQ ID NOs:195 and 196, respectively. The 
nucleotide sequence of the NOV65a variant differs as shown in Table SNP17. 



Table SNP17. NOV65a variants. 



Variant 


Nucleotides 


Amino Acids 


Position 


Initial 


Modified 


Position 


Initial 


Modified 


13377464 


75 


C 


A 


25 


Gin 


Lys 



15 



1106 



NOV68a has one SNP variant, whose variant positions for its nucleotide and amino 
acid sequences is numbered according to SEQ ID NOs:201 and 202, respectively. The 
nucleotide sequence of the NOV68a variant differs as shown in Table SNP 18. 



Table SNP18. NOV68a variants. 



Variant 


Nucleotides 


Amino Acids 


Position 


Initial 


Modified 


Position 


Initial 


Modified 


13377465 


438 


C 


G 


145 


Gly 


Gly 



NOV72a has one SNP variant, whose variant positions for its nucleotide and amino 
acid sequences is numbered according to SEQ ID NOs:209 and 210, respectively. The 
nucleotide sequence of the NOV72a variant differs as shown in Table SNP 19. 



Table SNP19. NOV72a variants. 



Variant 


Nucleotides 


Amino Acids 


Position 


Initial 


Modified 


Position 


Initial 


Modified 


13377466 


839 


C 


T 


271 


Pro 


Ser 



NOV80a has four SNP variants, whose variant positions for its nucleotide and 
amino acid sequences is numbered according to SEQ ID NOs:225 and 226, respectively. 
The nucleotide sequence of the NOV80a variant differs as shown in Table SNP20. 



Table SNP20. NOV80a variants. 



Variant 


Nucleotides 


Amino Acids 


Position 


Initial 


Modified 


Position 


Initial 


Modified 


13377471 


166 


G 


T 


46 


Ala 


Ser 


13377470 


482 


C 


T 


151 


Ala 


Val 


13377469 


685 


A 


G 


219 


Thr 


Ala 


13377468 


1410 


G 


C 


460 


Glu 


Asp 



1107 



NOV81a has four SNP variants, whose variant positions for its nucleotide and 
amino acid sequences is numbered according to SEQ ID NOs:229 and 230, respectively. 
The nucleotide sequence of the NOV8 la variant differs as shown in Table SNP21. 



Table SNP21. NOV81a variants. 



Variant 


Nucleotides 


Amino Acids 


Position 


Initial 


Modified 


Position 


Initial 


Modified 


13377472 


285 


C 


T 


91 


His 


Tyr 


13377473 


553 


A 


G 


180 


His 


Arg 


13377474 


554 


C 


T 


180 


His 


His 


13377475 


2581 


A 


G 


856 


Gin 


Arg 



NOV89a has two SNP variants, whose variant positions for its nucleotide and 
amino acid sequences is numbered according to SEQ ID NOs:249 and 250, respectively. 
The nucleotide sequence of the NOV89a variant differs as shown in Table SNP22. 



Table SNP22. NOV89a variants. 



Variant 


Nucleotides 


Amino Acids 


Position 


Initial 


Modified 


Position 


Initial 


Modified 


13377477 


425 


A 


G 


119 


Met 


Val 


13377478 


1162 


C 


A 


364 


Val 


Val 



NOV94a has one SNP variants, whose variant positions for its nucleotide and 
amino acid sequences is numbered according to SEQ ID NOs:269 and 270, respectively. 
The nucleotide sequence of the NOV94a variant differs as shown in Table SNP23. 



Table SNP23. NOV94a variants. 



Variant 


Nucleotides 


Amino Acids 


Position 


Initial 


Modified 


Position 


Initial 


Modified 


13377479 


1005 


T 


C 


303 


Asp 


Asp 
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NOV96a has two SNP variants, whose variant positions for its nucleotide and 
amino acid sequences is numbered according to SEQ ID NOs:273 and 274, respectively. 
The nucleotide sequence of the NOV96a variant differs as shown in Table SNP24. 



Table SNP24. NOV96a variants. 



Variant 


Nucleotides 


Amino Acids 


Position 


Initial 


Modified 


Position 


Initial 


Modified 


13377480 


150 


A 


G 


45 


Lys 


Arg 


13377482 


2221 


A 


G 


735 


Ser 


Ser 



NOV99a has one SNP variant, whose variant positions for its nucleotide and amino 
acid sequences is numbered according to SEQ ID NOs:283 and 284, respectively. The 
nucleotide sequence of the NOV99a variant differs as shown in Table SNP25. 



Table SNP25. NOV99a variants. 



Variant 


Nucleotides 


Amino Acids 


Position 


Initial 


Modified 


Position 


Initial 


Modified 


13377485 


274 


T 


C 


78, 


Thr 


Thr 



NOV 105a has three SNP variants, whose variant positions for its nucleotide and 
amino acid sequences is numbered according to SEQ ID NOs:299 and 300, respectively. 
The nucleotide sequence of the NOV 105a variant differs as shown in Table SNP26. 



Table SNP26. NOV105a variants. 



Variant 


Nucleotides 


Amino Acids 


Position 


Initial 


Modified 


Position 


Initial 


Modified 


13377488 


453 


C 


T 


145 


He 


He 


13377487 


828 


T 


G 


270 


Thr 


Thr 


13377486 


924 


A 


G 


302 


Thr 


Thr 



1109 



NOV 1 13a has three SNP variants, whose variant positions for its nucleotide and 
amino acid sequences is numbered according to SEQ ID NOs:315 and 316, respectively. 
The nucleotide sequence of the NOV1 1 3a variant differs as shown in Table SNP27. 



Table SNP27. NO VI 13a variants. 



Variant 


Nucleotides 


Amino Acids 


Position 


Initial 


Modified 


Position 


Initial 


Modified 


13377490 


340 


G 


A 


100 


Ala 


Thr 


13377491 


659 


C 


T 


206 


Pro 


Leu 


13377492 


726 


C 


T 


228 


Arg 


Arg 


13377493 


915 


T 


C 


291 


Ala 


Ala 


13377494 


1058 


T 


C 


339 


He 


Thr 


13377495 


1088 


T 


c 


349 


Leu 


Pro 



NOV1 14a has two SNP variants, whose variant positions for its nucleotide and 
amino acid sequences is numbered according to SEQ ID NOs:319 and 320, respectively. 
The nucleotide sequence of the NOV1 14a variant differs as shown in Table SNP28. 



Table SNP28. NOV114a variants. 



Variant 


Nucleotides 


Amino Acids 


Position 


Initial 


Modified 


Position 


Initial 


Modified 


13375633 


185 


T 


C 


54 


Val 


Ala 


13375632 


689 


A 


G 


222 


Gin 


Arg 



NOV1 16a has two SNP variants, whose variant positions for its nucleotide and 
amino acid sequences is numbered according to SEQ ID NOs:325 and 326, respectively. 
The nucleotide sequence of the NOV 1 16a variant differs as shown in Table SNP29. 
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Table SNP29. NOV116a variants. 



Variant 


Nucleotides 


Amino Acids 


Position 


Initial 


Modified 


Position 


Initial 


Modified 


13374815 


203 


A 


G 


63 


Thr 


Ala 


13374814 


384 


T 


C 


123 


Leu 


Pro 



NOV 1 17a has three SNP variants, whose variant positions for its nucleotide and 
amino acid sequences is numbered according to SEQ ID NOs:329 and 330, respectively. 
The nucleotide sequence of the NOV 1 17a variant differs as shown in Table SNP30. 



Table SNP30. NO VI 17a variants. 



Variant 


Nucleotides 


Amino Acids 


Position 


Initial 


Modified 


Position 


Initial 


Modified 


13377507 


453 


A 


G 


149 


Pro 


Pro 


13377506 


755 


A 


T 


250 


Glu 


Val 


13377505 


1128 


G 


A 


374 


Lys 


Lys 



NOV 124a has six SNP variants, whose variant positions for its nucleotide and 
amino acid sequences is numbered according to SEQ ID NOs:343 and 344, respectively. 
The nucleotide sequence of the NOV 124a variant differs as shown in Table SNP31. 



Table SNP31. NOV124a variants. 



Variant 


Nucleotides 


Amino Acids 


Position 


Initial 


Modified 


Position 


Initial 


Modified 


13377511 


186 


C 


T 


39 


Ser 


Ser 


13377512 


372 


C 


T 


101 


Gly 


Gly 


13377513 


970 


G 


T 


301 


Asp 


Tyr 


13377514 


1051 


G 


A 


328 


Val 


Met 


13377515 


1266 


C 


T 


399 


He 


He 


13377516 


1304 


A 


G 


412 


Asp 


Gly 



1111 



NOV 126a has two SNP variants, whose variant positions for its nucleotide and 
amino acid sequences is numbered according to SEQ ID NOs:349 and 350, respectively. 
The nucleotide sequence of the NOV126a variant differs as shown in Table SNP32. 



Table SNP32. NOV126a variants. 



Variant 


Nucleotides 


Amino Acids 


Position 


Initial 


Modified 


Position 


Initial 


Modified 


13377517 


747 


T 


C 


234 


Cys 


Cys 


13377518 


1879 


C 


T 


612 


Gin 


End 



OTHER EMBODIMENTS 

Although particular embodiments have been disclosed herein in detail, this has been done 
by way of example for purposes of illustration only, and is not intended to be limiting with 
respect to the scope of the appended claims, which follow. In particular, it is contemplated 
by the inventors that various substitutions, alterations, and modifications may be made to 
the invention without departing from the spirit and scope of the invention as defined by the 
claims. The choice of nucleic acid starting material, clone of interest, or library type is 
believed to be a matter of routine for a person of ordinary skill in the art with knowledge of 
the embodiments described herein. Other aspects, advantages, and modifications 
considered to be within the scope of the following claims. The claims presented are 
representative of the inventions disclosed herein. Other, unclaimed inventions are also 
contemplated. Applicants reserve the right to pursue such inventions in later claims. 
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